Step 2 of 7
Similarity Deduplication
Remove duplicate keywords with similar embeddings
Remove duplicate keywords with similar embeddings
By comparing embedding similarity, we identify and merge semantically duplicate keywords, consolidating their search volumes.
Semantic deduplication using cosine similarity on embeddings
class DeduplicationResponse(BaseModel):
input_filename: str
output_filename: str
removed_filename: str
input_download_url: str
output_download_url: str
removed_download_url: str
original_count: int
final_count: int
removed_count: int
groups_found: int
volume_consolidated: float
similarity_threshold: float
created_at: str