The sstablesplit utility

Use this tool to split SSTables files into multiple SSTables of a maximum designated size.

Use this tool to split SSTables files into multiple SSTables of a maximum designated size. For example, if SizeTieredCompactionStrategy was used for a major compaction and results in a excessively large SSTable, it's a good idea to split the table because won't get compacted again until the next huge compaction.

Cassandra must be stopped to use this tool:
  • Packaged installs:
    $ sudo service cassandra stop
  • Tarball installs:
    $ ps auwx | grep cassandra
    $ sudo kill pid

Usage:

  • Packaged installs: sstablessplit [options] <filename> [<filename>]*
  • Tarball installs: install_location/bin/sstablessplit [options] <filename> [<filename>]*
Example:
$ sstablesplit -s 40 /var/lib/cassandra/data/Keyspace1/Standard1/*
Options
Flag Option Description
--debug Display stack traces.
-h --help Display help.
  --no-snapshot Do not snapshot the SSTables before splitting.
-s --size <size> Maximum size in MB for the output SSTables (default: 50).
-v --verbose Verbose output.