Replication and consistency

In Cassandra-based databases you can configure both the replication and consistency of your data. Configure the replication factor to control the process of copying data to multiple replica nodes, ensuring its availability and durability. Set the consistency level to specify how many replica nodes must acknowledge a request for it to succeed.

Replication

Astra DB automatically manages replication, ensuring data is distributed across multiple cloud availability zones for fault tolerance and high availability. You cannot manually configure replication in Astra DB.

To configure data replication in Cassandra-based databases, you need to define a keyspace and specify the replication strategy and replication factor. The replication strategy controls how the data is distributed across the cluster. The replication factor determines the number of copies of data that are stored in the cluster. All tables in a keyspace use the same replication strategy and replication factor.

To ensure availability, most production databases should use a replication factor of 3.

Configure replication

DataStax recommends the NetworkTopologyStrategy to distribute data in single or multi-datacenter clusters.

Single datacenter
Multiple datacenters

To create a keyspace with 3 replicas in the default datacenter, use the following CQL command:

CREATE KEYSPACE user_profiles WITH replication = {
  'class': 'NetworkTopologyStrategy',
  'datacenter1': 3
};

To create a keyspace with 3 replicas in the east datacenter and 3 replicas in the west datacenter, use the following CQL command:

CREATE KEYSPACE user_profiles WITH replication = {
  'class': 'NetworkTopologyStrategy',
  'east': 3,
  'west': 3
};

Consistency level

In Cassandra-based databases, the consistency level together with the replication factor determines the number of replica nodes that must acknowledge a read or write operation for it to succeed. You can configure the consistency level for both read and write operations.

You can set the consistency level in the driver, in the connection, or for individual operations.

Write consistency

The database will always attempt to write data to the number of replica nodes specified by the replication factor. Write operations will only succeed if the number of nodes specified by the consistency level acknowledge the write operation.

Low write consistency levels can result in different nodes holding different versions of the data. Cassandra-based databases are eventually consistent since they use internal mechanisms to synchronize data to all nodes over time.

Write consistency levels

Level Description Usage

Level	Description	Usage
`ALL`	All replica nodes must acknowledge the write.	This write consistency level provides the highest consistency, the highest latency, and the lowest availability of any level.
`QUORUM`	A quorum of replica nodes across all datacenters must acknowledge the write.	Cross-datacenter communication may incur extra latency.
`ONE`	At least one replica node must acknowledge the write.	Use for high availability and low consistency. Note: Astra DB does not support write consistency level `ONE`.
`TWO`	At least two replica nodes must acknowledge the write.	Similar to `ONE`.
`THREE`	At least three replica nodes must acknowledge the write.	Similar to `TWO`.
`ANY`	At least one replica node must acknowledge the write, or if no replica nodes are available, a coordinator node must store a hint. If all replica nodes are down at write time, the data will not be available until the replica nodes for that partition have recovered.	This write consistency level provides the lowest latency, the highest write availability, and the lowest consistency. Note: Astra DB does not support write consistency level `ANY`.
`LOCAL_QUORUM`	A quorum of replica nodes in the local datacenter must acknowledge the write. Avoids latency of cross-datacenter communication.	Use `LOCAL_QUORUM` to maintain consistency within a single datacenter in a multiple-datacenter cluster.
`EACH_QUORUM`	A quorum of replica nodes in the each datacenter must acknowledge the write.	Use `EACH_QUORUM` in multi-datacenter clusters to strictly maintain consistency at the same level in each datacenter. Writes will fail if any datacenter fails to achieve a quorum.
`LOCAL_ONE`	At least one replica node in the local datacenter must acknowledge the write.	Use `LOCAL_ONE` to a consistency level of `ONE` without cross-datacenter communication. Note: Astra DB does not support write consistency level `LOCAL_ONE`.
`SERIAL`	Write consistency level `SERIAL` is a special case for compare and set operations like `INSERT`, `UPDATE`, and `DELETE` operations that use an `IF` clause.	Use to achieve linearizable consistency for lightweight transactions.
`LOCAL_SERIAL`	Write consistency level `LOCAL_SERIAL` is the same as `SERIAL`, but confined to the local datacenter.	Use to achieve linearizable consistency for lightweight transactions.

ALL

All replica nodes must acknowledge the write.

This write consistency level provides the highest consistency, the highest latency, and the lowest availability of any level.

QUORUM

A quorum of replica nodes across all datacenters must acknowledge the write.

Cross-datacenter communication may incur extra latency.

ONE

At least one replica node must acknowledge the write.

Use for high availability and low consistency.

Note: Astra DB does not support write consistency level ONE.

TWO

At least two replica nodes must acknowledge the write.

Similar to ONE.

THREE

At least three replica nodes must acknowledge the write.

Similar to TWO.

ANY

At least one replica node must acknowledge the write, or if no replica nodes are available, a coordinator node must store a hint. If all replica nodes are down at write time, the data will not be available until the replica nodes for that partition have recovered.

This write consistency level provides the lowest latency, the highest write availability, and the lowest consistency.

Note: Astra DB does not support write consistency level ANY.

LOCAL_QUORUM

A quorum of replica nodes in the local datacenter must acknowledge the write. Avoids latency of cross-datacenter communication.

Use LOCAL_QUORUM to maintain consistency within a single datacenter in a multiple-datacenter cluster.

EACH_QUORUM

A quorum of replica nodes in the each datacenter must acknowledge the write.

Use EACH_QUORUM in multi-datacenter clusters to strictly maintain consistency at the same level in each datacenter. Writes will fail if any datacenter fails to achieve a quorum.

LOCAL_ONE

At least one replica node in the local datacenter must acknowledge the write.

Use LOCAL_ONE to a consistency level of ONE without cross-datacenter communication.

Note: Astra DB does not support write consistency level LOCAL_ONE.

SERIAL

Write consistency level SERIAL is a special case for compare and set operations like INSERT, UPDATE, and DELETE operations that use an IF clause.

Use to achieve linearizable consistency for lightweight transactions.

LOCAL_SERIAL

Write consistency level LOCAL_SERIAL is the same as SERIAL, but confined to the local datacenter.

Use to achieve linearizable consistency for lightweight transactions.

Read consistency

The database will only attempt to read data from the number of replica nodes specified by the consistency level. Read operations will only succeed if the number of nodes specified by the consistency level acknowledge the read operation.

Read consistency levels

Level Description Usage

Level	Description	Usage
`ALL`	Queries return the most recent data from all replica nodes in the cluster. All replica nodes must must respond.	This read consistency level provides the highest consistency, the highest latency, and the lowest availability of any level.
`QUORUM`	Queries return the most recent data from a quorum of replica nodes across all datacenters.	Cross-datacenter communication may incur extra latency.
`ONE`	Queries return data from the closest replica.	Use for high availability and low consistency.
`TWO`	Queries return the most recent data from two of the closest replicas. Two replica nodes must respond.	Similar to `ONE`.
`THREE`	Queries return the most recent data from three of the closest replicas. Three replica nodes must respond.	Similar to `TWO`.
`LOCAL_QUORUM`	Queries returns the most recent data from a quorum of replicas in the current datacenter. `LOCAL_QUORUM` avoids latency of cross-datacenter communication.	Use `LOCAL_QUORUM` to maintain consistency within a single datacenter in a multiple-datacenter cluster.
`EACH_QUORUM`	Queries return the most recent data from a quorum of replica nodes in each datacenter has responded.	Use `EACH_QUORUM` in multiple datacenter clusters to ensure data consistency at the same level in each datacenter. Queries will fail if any datacenter fails to achieve a quorum.
`LOCAL_ONE`	Queries return data from the closest replica node in the local datacenter.	Use `LOCAL_ONE` to achieve a consistency level of `ONE` without cross-datacenter communications.
`SERIAL`	Read consistency level `SERIAL` is a special case for querying data that may be involved in in-flight lightweight transactions.	Use to achieve linearizable consistency for lightweight transactions.
`LOCAL_SERIAL`	Read consistency level `LOCAL_SERIAL` is the same as `SERIAL`, but confined to the local datacenter.	Use to achieve linearizable consistency for lightweight transactions.

ALL

Queries return the most recent data from all replica nodes in the cluster. All replica nodes must must respond.

This read consistency level provides the highest consistency, the highest latency, and the lowest availability of any level.

QUORUM

Queries return the most recent data from a quorum of replica nodes across all datacenters.

Cross-datacenter communication may incur extra latency.

ONE

Queries return data from the closest replica.

Use for high availability and low consistency.

TWO

Queries return the most recent data from two of the closest replicas. Two replica nodes must respond.

Similar to ONE.

THREE

Queries return the most recent data from three of the closest replicas. Three replica nodes must respond.

Similar to TWO.

LOCAL_QUORUM

Queries returns the most recent data from a quorum of replicas in the current datacenter. LOCAL_QUORUM avoids latency of cross-datacenter communication.

Use LOCAL_QUORUM to maintain consistency within a single datacenter in a multiple-datacenter cluster.

EACH_QUORUM

Queries return the most recent data from a quorum of replica nodes in each datacenter has responded.

Use EACH_QUORUM in multiple datacenter clusters to ensure data consistency at the same level in each datacenter. Queries will fail if any datacenter fails to achieve a quorum.

LOCAL_ONE

Queries return data from the closest replica node in the local datacenter.

Use LOCAL_ONE to achieve a consistency level of ONE without cross-datacenter communications.

SERIAL

Read consistency level SERIAL is a special case for querying data that may be involved in in-flight lightweight transactions.

Use to achieve linearizable consistency for lightweight transactions.

LOCAL_SERIAL

Read consistency level LOCAL_SERIAL is the same as SERIAL, but confined to the local datacenter.

Use to achieve linearizable consistency for lightweight transactions.

Immediate consistency

Immediate consistency ensures that read operations always return the most recent version of data. You can configure read and write consistency levels to achieve immediate consistency. Immediate consistency is sometimes referred to as strong consistency.

The simplest way to ensure immediate consistency is to use the consistency level ALL for both read and write operations. With this approach, both read and write operations will only succeed if all replica nodes respond to the operation. While this guarantees immediate consistency, it also results in the highest latency and lowest availablity.

You can use different consistency levels for read and write operations to achieve immediate consistency. Selecting different consistency levels for read and write operations can help you balance consistency, availability, and latency.

The formula for immediate consistency is:

Write Consistency Level + Read Consistency Level > Replication Factor

The following table contains some examples of write and read consistency levels that achieve immediate consistency.

Write Consistency Level Read Consistency Level Description

Write Consistency Level	Read Consistency Level	Description
`LOCAL_QUORUM`	`LOCAL_QUORUM`	Average latency and availability. Consistency levels are met in the local datacenter. Suitable for balanced workloads.
`ALL`	`ONE`	High write latency, low write availability, low read latency, high read availability. Suitable for read-heavy workloads.
`ONE`	`ALL`	Low write latency, high write availability, high read latency, low read availability. Suitable for write-heavy workloads.

LOCAL_QUORUM

Average latency and availability. Consistency levels are met in the local datacenter. Suitable for balanced workloads.

ALL

ONE

High write latency, low write availability, low read latency, high read availability. Suitable for read-heavy workloads.

ONE

ALL

Low write latency, high write availability, high read latency, low read availability. Suitable for write-heavy workloads.

Replication and consistency

Replication

Configure replication

Consistency level

Write consistency

Read consistency

Immediate consistency

Was this helpful?

Give Feedback