Configuring tiered storage

About this task

DSE Tiered Storage segregates hot and cold data to designated paths on different gradations of storage media, which can facilitate cost savings for data storage. Configure the storage tier strategy and paths to each tier in dse.yaml within an Lifecycle Manager (LCM) configuration profile. Run an install or configure job in LCM to push the configuration to all applicable nodes.

Procedure

  1. Click Config Profiles from the Lifecycle Manager navigation menu.

  2. Click the edit icon for the configuration profile to edit, or click Add config profile if you have not already created a profile.

  3. In the Config Profile pane under the Cassandra section, click dse.yaml.

  4. Scroll down to the Miscellaneous pane and locate tiered_storage_options.

    tieredStorageOptions

  5. Click + Add an item under the tiered_storage_options option.

    The Add tiered_storage_opions dialog displays.

    tieredStorageStrategyLCMcp

  6. Enter a strategy_name.

    For DSE clusters to use the tiered storage strategy, the strategy must be referenced in the config of the table schema. See Configuring DSE Tiered Storage.

  7. Optional Under local_options, click + Add an item to add local configuration options to overwrite the tiered storage settings in the table schema.

    The class or the tiered storage configuration name cannot be overwritten. For more details and an example, see Testing DSE Tiered Storage configurations.

    The Add local_options dialog displays.

    1. Enter a key and value for each local option. For example, type max_tier_ages for key and 3600 for its value.

    2. Click Save.

  8. Under tiers, click + Add an item to define the paths to the storage tiers.

    The Add tiers dialog displays.

    1. Enter the paths to the directory for each tier in the strategy.

      Tier order of each path matters. Add the paths for each tier in the order that represents each successive tier in a strategy. Each tier added is used in the order listed, with the top tier typically routing data to the fastest storage media.

      Ensure the directories referenced in the path already exist. LCM does not create the directories. If the directories do not exist when the installation or configuration job runs, DSE does not start up.

    2. Click Save to add the strategy and tiers to the configuration profile.

  9. Click Save to save the Config Profile.

What’s next

  1. Apply tiered storage to a table schema using CREATE or ALTER table statements, and define the maximum age of data in each tier. See Configuring DSE Tiered Storage and Testing DSE Tiered Storage configurations for complete details.

    CREATE TABLE ks.tbl (k INT, c INT, v INT, PRIMARY KEY (k, c))
    WITH COMPACTION={'class':'org.apache.cassandra.db.compaction.TieredCompactionStrategy',
        'tiering_strategy': 'TimeWindowStorageStrategy',
        'config': 'strategy1',
        'max_tier_ages': '3600,7200'};
  2. Go to the Clusters workspace in Lifecycle Manager and select the configuration profile to apply at the cluster, datacenter, or node level. Nodes can inherit configuration profile settings from the cluster or datacenter levels, or have settings at the node level that take precedence.

  3. Run a configuration job to push the configuration to all of the applicable nodes.

  4. Add dashboard graphs and alerts to monitor data distribution amongst storage tiers using the tiered storage metrics available in OpsCenter.

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2024 DataStax | Privacy policy | Terms of use

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com