Install DataStax Enterprise 5.1 using the binary tarball

You can install DataStax Enterprise (DSE) 5.1 on a bare metal or virtual machine (VM) environment using a binary tarball. This installation method lets you run DSE on Linux platforms as a standalone process, with or without root permissions. These instructions also describe how to store data and logs in either a default directory or a custom directory.

New installations of DSE create a default cassandra user in the database. This user can present a security risk in production environments.

To learn how to change or delete the default cassandra user, see Creating superuser accounts.

Prerequisites

Ensure that you have the following:

A supported platform.
- For RHEL-compatible platforms, make sure to enable Extra Packages for Enterprise Linux (EPEL).
A compatible version of Java 8.

Because of the end of public updates for Oracle JRE/JDK 8, DataStax recommends that you use the latest release of a Technology Compatibility Kit (TCK)-certified version of OpenJDK 8. You must use version 1.8.0_151 or later. For more information, see Installing the JDK.
Python 3.6 through 3.11, or Python 2.7.x. Each listed version provides support for cqlsh, but DataStax recommends using Python 3.11.
For production installations, see Recommended production settings.

Download and extract the tarball

By downloading DataStax Enterprise, you agree to the DataStax MSA, including the DSE Supplement.

Open a terminal and verify that you have Java 8 installed:

java -version

Result

Here’s an example result for OpenJDK:

openjdk version "1.8.0_171"
OpenJDK Runtime Environment (build 1.8.0_171-8u171-b11-0ubuntu0.16.04.1-b11)
OpenJDK 64-Bit Server VM (build 25.171-b11, mixed mode)

Here’s an example result for Oracle Java:

java version "1.8.0_181"
Java(TM) SE Runtime Environment (build 1.8.0_181-b13)
Java HotSpot(TM) 64-Bit Server VM (build 25.181-b13, mixed mode)

Ensure the output shows Java 8. If it does not, follow the instructions in Installing the JDK.

Download and extract the tarball file manually, or use curl:
- Manual download
- curl download
Download the DSE 5.1 tarball file from the DataStax Downloads website. Refer to the DSE 5.1 release notes for the latest patch version.

Extract the tarball file into the directory where you want to install DSE 5.1.

sudo tar -xzvf dse-5.1.48-bin.tar.gz -C INSTALLATION_DIRECTORY

The files expand into a DSE_DIRECTORY that contains the version number of the release, such as dse-5.1.48.
Use curl to download the tarball file:

curl -L -O https://downloads.datastax.com/enterprise/dse-5.1.48-bin.tar.gz

Extract the tarball file into the directory where you want to install DSE 5.1.

sudo tar -xzvf dse-5.1.48-bin.tar.gz -C INSTALLATION_DIRECTORY

The files expand into a DSE_DIRECTORY that contains the name of the release, such as /dse-5.1.48.

Configure data and log directories

Use either the default DSE data and log directory locations, or define your own custom directory locations:

Default directory locations
Custom directory locations

DSE stores its runtime data and logs in default locations unless otherwise configured in cassandra.yaml. Make sure these directories exist and have the correct ownership before starting DSE.

Run the following commands to create and change ownership for the default DSE data and log directories:

sudo mkdir -p /var/lib/cassandra /var/log/cassandra

sudo chown -R cassandra:cassandra /var/lib/cassandra /var/log/cassandra

If you plan to run DSE as a different user, replace cassandra:cassandra with that user and group.

Follow these steps to store DSE data and logs in custom locations:

Create and change ownership for your custom DSE data and log directories. For example:

sudo mkdir -p \
  CUSTOM_DIRECTORY/dse-data/commitlog \
  CUSTOM_DIRECTORY/dse-data/saved_caches \
  CUSTOM_DIRECTORY/dse-data/hints \
  CUSTOM_DIRECTORY/cdc_raw

sudo chown -R cassandra:cassandra \
  CUSTOM_DIRECTORY/dse-data \
  CUSTOM_DIRECTORY/cdc_raw

Replace CUSTOM_DIRECTORY with the absolute path to your custom directory locations.

If you plan to run DSE as a different user, replace cassandra:cassandra with that user and group.

Update the following settings in the cassandra.yaml file to point to your custom directory locations:

data_file_directories:
  - CUSTOM_DIRECTORY/dse-data
commitlog_directory: CUSTOM_DIRECTORY/dse-data/commitlog
saved_caches_directory: CUSTOM_DIRECTORY/dse-data/saved_caches
hints_directory: CUSTOM_DIRECTORY/dse-data/hints
cdc_raw_directory: CUSTOM_DIRECTORY/cdc_raw

Replace CUSTOM_DIRECTORY with the absolute path to your custom directory locations.

Where is the cassandra.yaml file?

The location of the cassandra.yaml file depends on the type of installation:

Installation Type Location

Installation Type	Location
Package installations + Installer-Services installations	`/etc/dse/cassandra/cassandra.yaml`
Tarball installations + Installer-No Services installations	`<installation_location>/resources/cassandra/conf/cassandra.yaml`

Package installations + Installer-Services installations

/etc/dse/cassandra/cassandra.yaml

Tarball installations + Installer-No Services installations

<installation_location>/resources/cassandra/conf/cassandra.yaml

Optional: If you plan to use DSE Analytics, you can use either the default Spark data and log directory locations or define your own custom directory locations:

Default directory locations
Custom directory locations

When you enable DSE Analytics, the Spark components write temporary data and logs to specific directories unless otherwise configured in spark-env.sh or dse.yaml. Make sure these directories exist and have the correct ownership before starting DSE.

Run the following commands to create and change ownership for the default Spark data and log directories:

sudo mkdir -p \
  /var/lib/dsefs \
  /var/lib/spark \
  /var/lib/spark/rdd \
  /var/lib/spark/worker \
  /var/log/spark \
  /var/log/spark/master \
  /var/log/spark/alwayson_sql

sudo chown -R cassandra:cassandra /var/lib/dsefs /var/lib/spark /var/log/spark

If you plan to run DSE as a different user, replace cassandra:cassandra with that user and group.

Follow these steps to store Spark data and logs in custom locations:

Create and change ownership for your custom Spark data and log directories. For example:

sudo mkdir -p \
  CUSTOM_DIRECTORY/dsefs \
  CUSTOM_DIRECTORY/spark/rdd \
  CUSTOM_DIRECTORY/spark/worker \
  CUSTOM_DIRECTORY/spark/log/worker \
  CUSTOM_DIRECTORY/spark/log/master \
  CUSTOM_DIRECTORY/spark/log/alwayson_sql

sudo chown -R cassandra:cassandra \
  CUSTOM_DIRECTORY/dsefs \
  CUSTOM_DIRECTORY/spark

Replace CUSTOM_DIRECTORY with the absolute path to your custom directory locations.

If you plan to run DSE as a different user, replace cassandra:cassandra with that user and group.

Update the following environment variables in the spark-env.sh file to point to your custom directory locations:

export SPARK_WORKER_DIR="CUSTOM_DIRECTORY/spark/worker"
export SPARK_EXECUTOR_DIRS="CUSTOM_DIRECTORY/spark/rdd"
export SPARK_WORKER_LOG_DIR="CUSTOM_DIRECTORY/spark/log/worker"
export SPARK_MASTER_LOG_DIR="CUSTOM_DIRECTORY/spark/log/master"
export ALWAYSON_SQL_LOG_DIR="CUSTOM_DIRECTORY/spark/log/alwayson_sql"

Replace CUSTOM_DIRECTORY with the absolute path to your custom directory locations.

Where is the spark-env.sh file?

The default location of the spark-env.sh file depends on the type of installation:

Installation Type Location

Installation Type	Location
Package installations + Installer-Services installations	`/etc/dse/spark/spark-env.sh`
Tarball installations + Installer-No Services installations	`<installation_location>/resources/spark/conf/spark-env.sh`

Package installations + Installer-Services installations

/etc/dse/spark/spark-env.sh

Tarball installations + Installer-No Services installations

<installation_location>/resources/spark/conf/spark-env.sh

In the dse.yaml file, update the location of the DataStax Enterprise file system (DSEFS) work directory to point to your custom location:

dsefs_options:
  work_dir: CUSTOM_DIRECTORY/dsefs

Replace CUSTOM_DIRECTORY with the absolute path to your custom directory location.

Where is the dse.yaml file?

The location of the dse.yaml file depends on the type of installation:

Installation Type Location

Installation Type	Location
Package installations + Installer-Services installations	`/etc/dse/dse.yaml`
Tarball installations + Installer-No Services installations	`<installation_location>/resources/dse/conf/dse.yaml`

Package installations + Installer-Services installations

/etc/dse/dse.yaml

Tarball installations + Installer-No Services installations

<installation_location>/resources/dse/conf/dse.yaml

Start DSE

From the directory where you extracted the tarball files, run the following command to start DSE as a standalone process:
```
DSE_DIRECTORY/bin/dse cassandra
```
For other startup options, see Starting DataStax Enterprise as a stand-alone process.

Verify that DSE is running:

DSE_DIRECTORY/bin/nodetool status

Result

Datacenter: Cassandra
=====================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
--  Address    Load       Tokens  Owns    Host ID                               Rack
UN  127.0.0.1  82.43 KB   128     ?       40725dc8-7843-43ae-9c98-7c532b1f517e  rack1

Next steps

Change or delete the default cassandra user.
Review specific changes for DSE 5.1 in the DSE 5.1 release notes.
Connect to DSE using the cqlsh tool.
Configure startup options: service or standalone.
Configure DataStax Enterprise settings for DSE Advanced Security, In-Memory, DSE Advanced Replication, DSE Multi-Instance, DSE Tiered Storage, and more.
Review default file locations for tarball installations.
Change logging locations after installation.
Start and stop DataStax Enterprise.
Prepare DataStax Enterprise for production.
Review recommended production settings.

Install DataStax Enterprise 5.1 using the binary tarball

Prerequisites

Download and extract the tarball

Configure data and log directories

Start DSE

Next steps

Was this helpful?

Give Feedback