sstablesplit

Use this tool to split SSTables files into multiple SSTables of a maximum designated size. For example, if SizeTieredCompactionStrategy was used for a major compaction and results in a excessively large SSTable, it’s a good idea to split the table because won’t get compacted again until the next huge compaction.

DataStax Enterprise must be stopped to use this tool.

Usage:

sstablesplit [options] <filename> [<filename>]*

Tarball and Installer No-Services path:

<installation_location>/resources/cassandra/tools/bin

Example:

sstablesplit -s 40 /var/lib/cassandra/data/Keyspace1/Standard1/*
Options
Flag Option Description

--debug

Display stack traces.

-h

--help

Display help.

--no-snapshot

Do not snapshot the SSTables before splitting.

-s

--size <size>

Maximum size in megabytes (MB) for the output SSTables (default: 50).

-v

--verbose

Verbose output.

Some tools require a larger heap size. For example, OutOfMemoryError exceptions may be resolved by increasing the heap size for sstablesplit.

To increase the heap size for sstablesplit to 8GB, change the following line in the tools/bin/sstablesplit shell script from:

MAX_HEAP_SIZE="256M"

to:

MAX_HEAP_SIZE="8G"

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2024 DataStax | Privacy policy | Terms of use

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com