nodetool scrub
Creates a snapshot and then rebuilds SSTables for one or more tables.
Creates a snapshot and then rebuilds SSTables for one or more tables on a node. If possible use nodetool upgradesstables instead of scrub.
Scrub automatically discards broken data and removes any tombstone rows that have exceeded the grace period of the table. If partition key values do not match the column data type, the partition is considered corrupt and the process automatically stops.
nodetool
scrub
resets all SSTables back to Level 0 and requires recompaction of
all SSTables.Synopsis
nodetool [connection_options] scrub [-j num_jobs] [-n] [-ns] [-r] [-s] [--] [keyspace_name table_name [table_name ...]]
Syntax conventions | Description |
---|---|
UPPERCASE | Literal keyword. |
Lowercase | Not literal. |
Italics |
Variable value. Replace with a valid option or user-defined value. |
[ ] |
Optional. Square brackets ( [ ] ) surround optional command
arguments. Do not type the square brackets. |
( ) |
Group. Parentheses ( ( ) ) identify a group to choose from. Do
not type the parentheses. |
| |
Or. A vertical bar ( | ) separates alternative elements. Type
any one of the elements. Do not type the vertical bar. |
... |
Repeatable. An ellipsis ( ... ) indicates that you can repeat
the syntax element as often as required. |
'Literal string' |
Single quotation ( ' ) marks must surround literal strings in
CQL statements. Use single quotation marks to preserve upper case. |
{ key:value } |
Map collection. Braces ( { } ) enclose map collections or key
value pairs. A colon separates the key and the value. |
<datatype1,datatype2> |
Set, list, map, or tuple. Angle brackets ( < > ) enclose
data types in a set, list, map, or tuple. Separate the data types with a comma.
|
cql_statement; |
End CQL statement. A semicolon ( ; ) terminates all CQL
statements. |
[ -- ] |
Separate the command line options from the command arguments with two hyphens (
-- ). This syntax is useful when arguments might be mistaken for
command line options. |
' <schema> ... </schema>
' |
Search CQL only: Single quotation marks ( ' ) surround an entire
XML schema declaration. |
@xml_entity='xml_entity_type' |
Search CQL only: Identify the entity and literal value to overwrite the XML element in the schema and solrconfig files. |
Definition
The short form and long form parameters are comma-separated.
Connection options
- -h, --host hostname
- The hostname or IP address of a remote node or nodes. When omitted, the default is the local machine.
- -p, --port jmx_port
- The JMX port number.
- -pw, --password jmxpassword
- The JMX password for authenticating with secure JMX. If a password is not provided, you are prompted to enter one.
- -pwf, --password-file jmx_password_filepath
- The filepath to the file that stores JMX authentication credentials.
- -u, --username jmx_username
- The username for authenticating with secure JMX.
Command arguments
--
- Separates an option from an argument that could be mistaken for a option.
- -j, --jobs num_jobs
-
- num_jobs - Number of SSTables affected simultaneously. Default: 2.
- 0 - Use all available compaction threads.
- keyspace_name
- The keyspace name.
- -n, --no-validate
- Do not validate columns using column validator.
- -ns, --no-snapshot
- If disablesnapshot is false, scrubbed tables are snapshotted first. Default: false.
- -r, --reinsert-overflowed-ttl
- Rewrite rows with overflowed expiration date affected by CASSANDRA-14092 with the maximum supported expiration date of 2038-01-19T03:14:06+00:00. The rows are rewritten with the original timestamp incremented by one millisecond to override/supersede any potential tombstone that may have been generated during compaction of the affected rows.
- -s, --skip-corrupted
- Skip corrupted partitions even when scrubbing counter tables. Default is false.
- table_name
- One or more table names, separated by a space.
Examples
Scrub a single table on a keyspace
nodetool scrub cycling cyclist_id
Scrub an entire keyspace while skipping corrupted partitions
nodetool scrub -s cycling