Data distribution and replication

In Hyper-Converged Database (HCD), data distribution and replication go together. HCD organizes data by table and uses a primary key to identify unique records, helping determine the node on which to store data. Replicas are copies of rows stored on multiple nodes to ensure reliability and fault tolerance. A replica also refers to data first written. All replicas are equally important; there is no primary replica.

Learn the important concept of how the data is distributed to the nodes in a cluster.

Features affecting replication include:

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2024 DataStax | Privacy policy | Terms of use

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com