Data distribution and replication

In Hyper-Converged Database (HCD), data distribution and replication go together. HCD organizes data by table and uses a primary key to identify unique records, helping determine the node on which to store data. HCD stores multiple copies of each row, called replicas, on different nodes to ensure reliability and fault tolerance. HCD writes data to the first replica. All replicas have equal importance; there is no primary replica.

Learn the important concept of how the data is distributed to the nodes in a cluster.

The following features affect replication:

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2025 DataStax | Privacy policy | Terms of use | Manage Privacy Choices

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com