Skip to content
GlossaryConceptUpdated May 2026

Embedding

noun · also: vector-store, rag, openai

What is embedding?

An embedding is a numerical vector representation of text (or images, audio) that captures meaning — close vectors mean similar content.

Definition

Full definition of embedding

You pass text to an embedding model (OpenAI's text-embedding-3, Cohere's embed-v3, Voyage's voyage-3) and get back a list of floats (typically 1024-3072 dimensions). Two pieces of text with similar meaning produce similar vectors, so you can search "by meaning" instead of "by keyword". The foundation of RAG, semantic search, recommendation, and clustering.

In practice

Embedding examples

Embedding output
Input: 'customer cancelled'. Output: [0.012, -0.089, 0.234, ... ] (1536 floats)
Used by

Apps that exemplify embedding

See embedding in action across real integrations.

FAQ

Common questions about embedding

Which embedding model is best?
For most tasks: OpenAI's text-embedding-3-small (cheap) or text-embedding-3-large. Cohere and Voyage often outperform on specific domains; benchmark on your data.
Why are embeddings expensive at scale?
They're cheap to generate (~$0.02 per 1M tokens), but storage and search costs scale linearly with corpus size. Plan for ~$0.50/month per million docs.