Adding single-token nodes to a cluster
Steps for adding nodes in single-token architecture clusters, not vnodes.
Steps for adding nodes in single-token architecture clusters, not vnodes.
- Add capacity by doubling the cluster size: Adding capacity by doubling (or tripling or quadrupling) the number of nodes is less complicated when assigning tokens. Using this method, existing nodes keep their existing token assignments, and the new nodes are assigned tokens that bisect (or trisect) the existing token ranges.
- Add capacity for a non-uniform number of nodes: When increasing capacity with this method, you must recalculate tokens for the entire cluster, and assign the new tokens to the existing nodes.
Procedure
- Calculate the tokens for the nodes based on your expansion strategy using the Token Generating Tool.
- Install Cassandra and configure Cassandra on each new node.
- If Cassandra starts automatically (Debian), stop the node and clear the data.
-
Configure cassandra.yaml on each new
node:
- auto_bootstrap: If false, set it to
true.
This option is not listed in the default cassandra.yaml configuration file and defaults to true.
- cluster_name
- cassandra.yaml configuration file/broadcast_address: Usually leave blank. Otherwise, use the IP address or host name that other Cassandra nodes use to connect to the new node.
- endpoint_snitch
- initial_token:
Set according to your token calculations.CAUTION: If this property has no value, Cassandra assigns the node a random token range and results in a badly unbalanced ring.
- seed_provider:
Make sure that the new node lists at least one seed node in the existing
cluster.
Seed nodes cannot bootstrap. Make sure the new nodes are not listed in the -seeds list. Do not make all nodes seed nodes. See Internode communications (gossip).
- Change any other non-default settings in the new nodes to match the existing nodes. Use the diff command to find and merge any differences between the nodes.
- auto_bootstrap: If false, set it to
true.
- Depending on the snitch, assign the datacenter and rack names in the cassandra-topology.properties or cassandra-rackdc.properties for each node.
-
Start Cassandra on each new node
in two minutes intervals with consistent.rangemovement
turned on:
- Package installations: To each bootstrapped node, add the
following option to the
/usr/share/cassandra/cassandra-env.sh file and then
start
Cassandra:
JVM_OPTS="$JVM_OPTS -Dcassandra.consistent.rangemovement=true
- Tarball
installations:
$ bin/cassandra -Dcassandra.consistent.rangemovement=true
- Package installations: To each bootstrapped node, add the
following option to the
/usr/share/cassandra/cassandra-env.sh file and then
start
Cassandra:
The following operations are resource intensive and should be done during low-usage times.
- After the new nodes are fully bootstrapped, use nodetool move to assign the new initial_token value to each node that requires one, one node at a time.
-
After all nodes have their new tokens assigned, run nodetool cleanup on each node in the cluster and wait for cleanup to
complete on each node before doing the next node.
This step removes the keys that no longer belong to the previously existing nodes.
The location of the cassandra-rackdc.properties file depends on the type of installation:Package installations /etc/cassandra/cassandra-rackdc.properties Tarball installations install_location/conf/cassandra-rackdc.properties The location of the cassandra-topology.properties file depends on the type of installation:Package installations /etc/cassandra/cassandra-topology.properties Tarball installations install_location/conf/cassandra-topology.properties The location of the cassandra.yaml file depends on the type of installation:Package installations /etc/cassandra/cassandra.yaml Tarball installations install_location/resources/cassandra/conf/cassandra.yaml