Embeddings

This Langflow feature is currently in public preview. Development is ongoing, and the features and functionality are subject to change. Langflow, and the use of such, is subject to the DataStax Preview Terms.

Embeddings models convert text into numerical vectors. These embeddings capture the semantic meaning of the input text, and allow LLMs to understand context.

Refer to your specific component’s documentation for more information on parameters.

Use an embeddings model component in a flow

In this example of a document ingestion pipeline, the OpenAI embeddings model is connected to a vector database. The component converts the text chunks into vectors and stores them in the vector database. The vectorized data can be used to inform AI workloads like chatbots, similarity searches, and agents.

This embeddings component uses an OpenAI API key for authentication. Refer to your specific embeddings component’s documentation for more information on authentication.

AI/ML

This component generates embeddings using the AI/ML API.

Name	Type	Description
model_name	String	The name of the AI/ML embedding model to use
aiml_api_key	SecretString	API key for authenticating with the AI/ML service

Name	Display Name	Info
credentials_profile_name	AWS Credentials Profile	Name of the AWS credentials profile in ~/.aws/credentials or ~/.aws/config
model_id	Model ID	ID of the model to call, e.g., amazon.titan-embed-text-v1
endpoint_url	Endpoint URL	URL to set a specific service endpoint other than the default AWS endpoint
region_name	AWS Region	AWS region to use, e.g., us-west-2

Name	Display Name	Info
provider	Embedding Provider	The embedding provider to use
model_name	Model Name	The embedding model to use
authentication	Authentication	The name of the API key in Astra KMS that stores your vectorize embedding provider credentials. (Not required if using an Astra-hosted embedding provider).
provider_api_key	Provider API Key	As an alternative to `authentication`, directly provide your embedding provider credentials.
model_parameters	Model Parameters	Additional model parameters

Name	Display Name	Info
Azure Endpoint	Azure Endpoint	Your Azure endpoint, including the resource
Deployment Name	Deployment Name	The name of the deployment
API Version	API Version	The API version to use
API Key	API Key	The API key to access the Azure OpenAI service

Name	Display Name	Info
account_id	Cloudflare account ID	Find your account ID
api_token	Cloudflare API token	Create an API token
model_name	Model Name	List of supported models
strip_new_lines	Strip New Lines	Whether to strip new lines from the input text.
batch_size	Batch Size	Number of texts to embed in each batch.
api_base_url	Cloudflare API base URL	Base URL for the Cloudflare API.
headers	Headers	Additional request headers.

Name	Display Name	Info
cohere_api_key	Cohere API Key	API key required to authenticate with the Cohere service.
model	Model Name	Language model used for embedding text documents and performing queries.
truncate	Truncate	Whether to truncate the input text to fit within the model’s constraints.

Name	Display Name	Info
embedding_vectors	Embedding Vectors	A list containing exactly two data objects with embedding vectors to compare.
similarity_metric	Similarity Metric	Select the similarity metric to use. Options: "Cosine Similarity", "Euclidean Distance", "Manhattan Distance".

Name	Display Name	Info
api_key	API Key	Secret API key for accessing Google’s generative AI service (required)
model_name	Model Name	Name of the embedding model to use (default: "models/text-embedding-004")

Name	Display Name	Info
Cache Folder	Cache Folder	Folder path to cache HuggingFace models
Encode Kwargs	Encoding Arguments	Additional arguments for the encoding process
Model Kwargs	Model Arguments	Additional arguments for the model
Model Name	Model Name	Name of the HuggingFace model to use
Multi Process	Multi-Process	Whether to use multiple processes

Name	Display name	Description
API Key	API Key	The API key for accessing the Hugging Face Inference API.
API URL	API URL	The URL of the Hugging Face Inference API.
Model Name	Model Name	The name of the model to use for embeddings.
Cache Folder	Cache Folder	The folder path to cache Hugging Face models.
Encode Kwargs	Encoding Arguments	Additional arguments for the encoding process.
Model Kwargs	Model Arguments	Additional arguments for the model.
Multi Process	Multi-Process	Whether to use multiple processes.

Inputs
Name	Type	Description
model	String	The MistralAI model to use (default: "mistral-embed")
mistral_api_key	SecretString	API key for authenticating with MistralAI
max_concurrent_requests	Integer	Maximum number of concurrent API requests (default: 64)
max_retries	Integer	Maximum number of retry attempts for failed requests (default: 5)
timeout	Integer	Request timeout in seconds (default: 120)
endpoint	String	Custom API endpoint URL (default: "https://api.mistral.ai/v1/")

Inputs
Name	Display Name	Info
credentials	Credentials	The default custom credentials to use
location	Location	The default location to use when making API calls
max_output_tokens	Max Output Tokens	Token limit for text output from one prompt
model_name	Model Name	The name of the Vertex AI large language model
project	Project	The default GCP project to use when making Vertex API calls
request_parallelism	Request Parallelism	The amount of parallelism allowed for requests
temperature	Temperature	Tunes the degree of randomness in text generations
top_k	Top K	How the model selects tokens for output
top_p	Top P	Probability threshold for token selection
tuned_model_name	Tuned Model Name	The name of a tuned model (overrides model_name if provided)
verbose	Verbose	Controls the level of detail in the output

Name	Display Name	Info
url	watsonx API Endpoint	The base URL of the watsonx API.
project_id	watsonx project id	The project ID for watsonx.ai.
api_key	API Key	The API Key to use for the model.
model_name	Model Name	The name of the watsonx embeddings model to use. Options are dynamically fetched from the API.
truncate_input_tokens	Truncate Input Tokens	Maximum number of input tokens. The default is 200 tokens.
input_text	Include the original text in the output	Whether to include the original text in the output. The default is `true`.

Name	Display Name	Info
model	Model	The LM Studio model to use for generating embeddings
base_url	LM Studio Base URL	The base URL for the LM Studio API
api_key	LM Studio API Key	API key for authentication with LM Studio
temperature	Model Temperature	Temperature setting for the model

Name	Type	Description
model	String	The NVIDIA model to use for embeddings (e.g., `nvidia/nv-embed-v1`)
base_url	String	Base URL for the NVIDIA API (default: `https://integrate.api.nvidia.com/v1`)
nvidia_api_key	SecretString	API key for authenticating with NVIDIA’s service
temperature	Float	Model temperature for embedding generation (default: 0.1)

Name	Display Name	Info
Ollama Model	Model Name	Name of the Ollama model to use
Ollama Base URL	Base URL	Base URL of the Ollama API
Model Temperature	Temperature	Temperature parameter for the model

Name	Display Name	Info
OpenAI API Key	API Key	The API key to use for accessing the OpenAI API
Default Headers	Default Headers	Default headers for the HTTP requests
Default Query	Default Query	Default query parameters for the HTTP requests
Allowed Special	Allowed Special Tokens	Special tokens allowed for processing
Disallowed Special	Disallowed Special Tokens	Special tokens disallowed for processing
Chunk Size	Chunk Size	Chunk size for processing
Client	HTTP Client	HTTP client for making requests
Deployment	Deployment	Deployment name for the model
Embedding Context Length	Context Length	Length of embedding context
Max Retries	Max Retries	Maximum number of retries for failed requests
Model	Model Name	Name of the model to use
Model Kwargs	Model Arguments	Additional keyword arguments for the model
OpenAI API Base	API Base URL	Base URL of the OpenAI API
OpenAI API Type	API Type	Type of the OpenAI API
OpenAI API Version	API Version	Version of the OpenAI API
OpenAI Organization	Organization	Organization associated with the API key
OpenAI Proxy	Proxy	Proxy server for the requests
Request Timeout	Request Timeout	Timeout for the HTTP requests
Show Progress Bar	Show Progress	Whether to show a progress bar for processing
Skip Empty	Skip Empty	Whether to skip empty inputs
TikToken Enable	Enable TikToken	Whether to enable TikToken
TikToken Model Name	TikToken Model	Name of the TikToken model

Name	Display Name	Info
embedding_model	Embedding Model	The embedding model to use for generating embeddings.
message	Message	The message for which to generate embeddings.

Embeddings

Was this helpful?