SEO IRL
Pipeline
upload_file
Import & EmbedStep 1
filter_alt
DeduplicateStep 2
hub
ClusterStep 3
cleaning_services
DenoiseStep 4
lightbulb
Generate TopicsStep 5
content_copy
Deduplicate TopicsStep 6
account_tree
Label ParentsStep 7
Visualizers
grid_viewTreemap ViewhubRadial Tree
Generator
descriptionContent Brief
Presented by
Amin Foroutan
info
Tip:Click the slideshow Slide icon above to see the educational content about this step!
track_changesStep 1 of 7
Move to the next step with this

Keyword Embedding Generation

Generate semantic embeddings for your keywords

info
Required Format:CSV file with two columns:
  • keyword - Your keywords/search terms
  • volume - Search volume (numeric)
tuneAdvanced Settings
expand_more

What Are Embeddings?

Embeddings are mathematical representations of data that preserve relationships within your data.

Word → Embedding → Position

man

living being
feline
human
gender
royalty
verb

2D projection

x: royalty, y: inverse gender
man
woman
king
queen

Axes used for the projection: x = royalty, y = inverse of gender. Values shown are synthetic for demo.

Tip: click Next to step through word → vector → plot, or Play to auto animate.

codeBackend Code Structure

account_treeProcessing Pipeline

INPUT
keywords.csv
save_upload()
arrow_downward
load_and_clean_data()
→ Remove duplicates, validate
arrow_downward
generate_embeddings_parallel()
→ Split into batches
cloudOpenAI Batch 1
cloudOpenAI Batch 2
cloudOpenAI Batch 3
arrow_downward
write_output_csv()
→ Serialize vectors
OUTPUT
embeddings.csv
Example Data Transformation:
Input:
{
  "Keyword": "best running shoes"
}
arrow_forward
Output:
{
  "Keyword": "best running shoes",
  "Embedding": "[0.023, -0.15, ...]",
  "Is_Brand": false
}

starsKey Features

speed
Parallel Processing
ThreadPoolExecutor for concurrent API calls
storage
File Persistence
Timestamped storage for inputs/outputs
verified
Data Validation
CSV structure validation and cleaning
auto_awesome
OpenAI Integration
text-embedding-3-small model