Embedding Model

What and how it can be used:

The Embedding Model component converts text into numerical vector representations (embeddings) that capture semantic meaning. These vectors enable similarity searches, document retrieval, and semantic understanding by representing text in a mathematical space where similar meanings are positioned close together.

When/how the component should be used:

Use when you need to convert text into vectors for similarity search or semantic matching
Use before vector storage, similarity search, clustering, or retrieval.
Required for indexing documents in Knowledge Base – Files
Create a flow, add a Knowledge Base – Files component, and then select a file containing text data, such as a PDF, that you can use to test the flow.
Add the Embedding Model core component, and then provide a valid OpenAI API key.
Add a Split Text component to your flow. This component splits text input into smaller chunks to be processed into embeddings.
Add a vector store component, such as the Chroma DB component, to your flow, and then configure the component to connect to your vector database. This component stores the generated embeddings so they can be used for similarity search.
Connect the components
Click Test Agent, and then enter a search query to retrieve text chunks that are most semantically similar to your query.

Connections with other components:

Tool components (Chroma DB)

Configurable settings:

Model Provider
Model Name
OpenAi Api Key

Default settings:

Model Provider
Model Name
OpenAi Api Key

Control Section:

Model Provider
Model Name
OpenAI API Key
Chunk Size
Max Retries
API Base URL
Dimensions
Chunk Size
Request Timeout
Max Retries
Show Progress Bar
Model Kwargs

Default values:

Model Provider = OpenAI
Model Name = text-embedding-3-small
Chunk Size = 1000
Max Retries = 3

Desired Behaviour:

Batch embeddings where possible
Consistent vector dimensionality
No semantic modification of text

Resources