nodetool cleanup

Cleans up keyspaces and partition keys no longer belonging to a node.

Cleans up keyspaces and partition keys no longer belonging to a node.

Synopsis

nodetool options cleanup [keyspace_name [table_name] […] ]
Cassandra tarball installations:
installation_location/cassandra/bin
Table 1. Options
Short Long Description
-h --host Hostname or IP address.
-j --job Number of sstables to cleanup simultaneously; 0 uses all available compaction threads.
-p --port Port number.
-pwf --password-file Password file path.
-pw --password Password.
-u --username Remote JMX agent username.
keyspace Keyspace name.
table One or more table names, separated by a space.
-- Separates an option from an argument that could be mistaken for a option.

Description

Use this command to remove unwanted data after adding a new node to the cluster. Cassandra does not automatically remove data from nodes that lose part of their partition range to a newly added node. Run nodetool cleanup on the source node and on neighboring nodes that shared the same subrange after the new node is up and running. Failure to run this command after adding a node causes Cassandra to include the old data to rebalance the load on that node. Running the nodetool cleanup command causes a temporary increase in disk space usage proportional to the size of your largest SSTable. Disk I/O occurs when running this command.

Optionally, this command takes a list of table names. If you do not specify a keyspace, this command cleans all keyspaces no longer belonging to a node.