Generate and store embeddings in Astra DB Serverless databases

There are two ways to load embeddings into Serverless (vector) databases:

What are embedding providers?

Embedding providers are services that help you generate embeddings for your data to perform vector search queries.

The provider handles infrastructure, model maintenance, and other tasks necessary to generate embeddings from embedding models.

Providers may use one or more models to generate embeddings. When choosing an embedding provider, consider factors like the available embedding models, vector dimensions, supported data types, quality, accuracy, and scalability.

Astra-hosted integrations for vectorize

Vectorize generates embeddings through supported embedding provider integrations.

DataStax-managed embedding provider integrations are hosted within Astra. These integrations don’t require your own embedding provider account or credentials because they are managed by Astra. However, there are restrictions on the available regions and configuration options.

Databases in supported regions can configure collections and tables to automatically use these integrations:

Embedding provider Documentation

NVIDIA

Get started

External integrations for vectorize

An external embedding provider integration uses your embedding provider account to generate embeddings. You can incur billed charges for this use according to your agreement with your provider.

To use an external embedding provider with Astra vectorize, you must attach your embedding provider account to your Astra organization by enabling the embedding provider integration in your Astra organization. Then, you can attach the embedding provider integration to a collection or table.

All providers follow the same general integration process. However, each provider has specific configuration options, such as models, dimensions, credentials, and other parameters. For complete instructions, see the documentation for your embedding provider:

Embedding provider Documentation

Azure OpenAI

Get started

Hugging Face - Dedicated

Get started

Hugging Face - Serverless

Get started

Jina AI

Get started

Mistral AI

Get started

OpenAI

Get started

Upstage

Get started

Voyage AI

Get started

Was this helpful?

Give Feedback

How can we improve the documentation?

© Copyright IBM Corporation 2026 | Privacy policy | Terms of use Manage Privacy Choices

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: Contact IBM