Migrate to Astra Managed Clusters
The migration process includes exporting data from its original location, uploading or importing your data to your new Astra DB databases, and then updating your applications to connect to your new databases.
Migrate your data
DataStax offers several options to help migrate your data to Astra DB.
Migrate from DSE, HCD, and other Apache Cassandra® clusters
The following tools are designed to migrate Cassandra table data into a Cassandra-compatible cluster, such as Astra DB:
-
DataStax Zero Downtime Migration (ZDM) tools: Move your data to Astra DB with minimal configuration changes and little or no downtime.
The ZDM tools orchestrate live read and write traffic between your original database and new Astra DB database while you use a data migration tool to migrate your preexisting data to the new database. Data migration tools include CDM and DSBulk.
All new writes are sent to both databases, and your original database remains the source of truth for reads until you’re ready to permanently switch to your Astra DB database.
You can terminate the migration at any time before the final cutover with no impact to your original database.
-
Cassandra Data Migrator (CDM): Migrate and validate tables between origin Cassandra clusters and target Astra DB databases, with available logging and reconciliation support.
You can use CDM alone or in conjunction with the ZDM tools.
-
DataStax Bulk Loader (DSBulk) is an OSS command-line tool that you can use to extract and load CSV and JSON files containing Cassandra table data. You can use DSBulk to bring data from Cassandra, DSE, or HCD into Astra DB, as well as move data between collections and tables in Astra DB databases.
Use the following table to learn which tools are compatible with your current database provider or service. If you have questions about migrating from a specific source to Astra DB, contact your DataStax account representative or IBM Support.
| Origin | Astra DB Sideloader | Cassandra Data Migrator (CDM) | ZDM Proxy | DataStax Bulk Loader (DSBulk) |
|---|---|---|---|---|
Aiven for Cassandra |
||||
Amazon Keyspaces |
||||
Apache Cassandra® OSS 3.11 or later |
||||
Apache Cassandra® OSS 3.10 or earlier |
||||
Azure Cosmos DB (Cassandra API) |
||||
Azure Managed Instance for Apache Cassandra |
||||
DSE 5.1 or later |
||||
DSE 5.0 or earlier |
||||
Instaclustr Managed Cassandra |
||||
K8ssandra (self-managed) |
||||
Scylla Cloud |
||||
Scylla OSS or Enterprise |
||||
Yugabyte Aeon (YCQL) |
||||
Yugabyte OSS or Anywhere |
Migrate from non-Cassandra sources
Because Astra DB is based on Apache Cassandra, it expects data to be in a format that is compatible with Cassandra table schemas.
You can use techniques like super shredding to flatten, normalize, and map schemaless or semi-structured JSON/CSV data into a Cassandra-compatible fixed schema, and then load the data into Astra DB with DSBulk or other tools. However, super shredding can be complex and cumbersome, depending on the structure (or lack thereof) of the source data.
Migrate your code
After migrating your data to Astra DB, your applications can connect exclusively to your Astra DB databases.
Aside from the database connection, your code might not require any other changes if you already use a compatible driver and CQL statements.
For more information about connecting to Astra DB and migrating your applications, see the following:
Get support for your migration
If you need help planning or executing your migration to Astra, contact your DataStax account representative or IBM Support.
If you have a subscription to IBM Elite Support for Apache Cassandra, contact IBM Elite Support or your account representative to see if your plan includes migration assistance.
For any observed problems with ZDM Proxy or the other open-source ZDM and data migration tools, you can report an issue in their respective GitHub repositories:
-
ZDM Proxy Automation repository (includes ZDM Proxy Automation and ZDM Utility)