Bulk uploading S3 backups using the AWS CLI
Use the AWS CLI instead of the AWS SDK when bulk loading backups to Amazon S3 locations. Using the S3 CLI is a Labs feature that must be enabled.
Use the AWS CLI instead of the AWS SDK when bulk loading backups to Amazon S3 locations. Using the AWS CLI rather than the AWS SDK can result in a performance increase, with a noticeable decrease in the time it takes to complete a backup. This is an OpsCenter Labs feature (that is, under ongoing development but available for use). The feature is available in OpsCenter versions 6.1.3 and later.
For more information, see AWS CLI in the Amazon documentation.
address.yaml
The location of the address.yaml file depends on the type of installation:- Package installations: /var/lib/datastax-agent/conf/address.yaml
- Tarball installations: install_location/conf/address.yaml
cluster_name.conf
The location of the cluster_name.conf file depends on the type of installation:- Package installations: /etc/opscenter/clusters/cluster_name.conf
- Tarball installations: install_location/conf/clusters/cluster_name.conf
Prerequisites
- Install the AWS CLI package on every node. DataStax recommends using the Amazon
bundled installer method and upgrading to the latest version of AWS CLI if it is
already installed. See Install the AWS CLI using the bundled
installer in the Amazon documentation for installation
procedures.Tip: As a recommended best practice for OpsCenter, install the AWS CLI bundle using APT as follows:
sudo apt-get install -y unzip curl 'https://s3.amazonaws.com/aws-cli/awscli-bundle.zip' -o awscli-bundle.zip unzip awscli-bundle.zip sudo ./awscli-bundle/install -i /usr/local/aws -b /usr/local/bin/aws
Important: Regardless of the install procedure used, make sure that the AWS CLI package is installed in the PATH of thecassandra
user, or whichever user the DataStax agent runs as. - Add an S3 location for backups.
Procedure
-
Open cluster_name.conf for editing. Substitute
cluster_name with the name of your cluster. Setting agent
options through the cluster configuration file sets the corresponding property
in address.yaml on every node.
If necessitated by your environment, open address.yaml for editing and configure at the node level. Do so for every node that requires a specific configuration override.
-
Add the following configuration option:
[labs] use_s3_cli = True
- Save the configuration file or files.
- Restart the OpsCenter daemon.
- Optional: If you made changes to address.yaml, restart the DataStax agents.