Instructions for installing DataStax Enterprise 5.0 on any supported Linux-based
platform.
Use these instructions for installing with root permissions on Linux-based platform
using the binary tarball. To install earlier
versions, see Installing DataStax Enterprise 5.0.x patch releases.
Important: DataStax Enterprise 5.0 uses Apache
Cassandra 3.0 and CQL 3.3.
When installed from the binary tarball, DataStax Enterprise runs as a stand-alone
process.
Prerequisites
Also see Recommended production settings and the
DataStax Enterprise Reference Architecture
white paper.
Hardware requirements
Requirement |
Minimum |
Production |
CPUs |
2 |
16 |
Memory |
8 GB |
24 GB |
Data directory |
20 GB |
200 GB |
Commit log directory |
20 GB |
200 GB |
Saved caches directory |
20 GB |
200 GB |
Logs directory |
20 GB |
200 GB |
Production requirements depend on the
volume of data and workload. |
Note: The latest version of DataStax Enterprise 5.0.x is 5.0.15.
Procedure
These steps install DataStax Enterprise. They do not install OpsCenter, DataStax
Studio, DataStax Agent, or DevCenter. After installing, you must configure and
start DataStax Enterprise.
Important: End User License Agreement (EULA). By downloading DataStax products, you
confirm that you agree to the processing of information as described in the DataStax
website privacy policy and agree to the website terms of use.
In a terminal window:
-
Verify that a required version of Java is installed:
If not Oracle Java 8, or OpenJDK, see Installing the JDK.
Important: Package management tools do not install Oracle Java.
-
Download and extract the DataStax Enterprise tarball using your
To install the latest version (5.0.15):
curl -L https://downloads.datastax.com/enterprise/dse-5.0.15-bin.tar.gz | tar xz
To install earlier versions, use the version number in the above command. For
example:
curl -L https://downloads.datastax.com/enterprise/dse-5.0.3-bin.tar.gz | tar xz
The latest version of DataStax Enterprise 5.0.x is 5.0.15.
The files are downloaded and extracted into the dse-5.0.15 directory.
-
To use the default data and logging directory locations, create the directories
and change ownership:
- /var/lib/cassandra
- /var/log/cassandra (includes audit directory,
debug.log, gremlin.log, solrvalidation.log, system.log)
- /var/lib/dsefs
- /var/lib/spark
- /var/log/spark
- /var/lib/spark/rdd
- /var/lib/spark/worker
sudo mkdir -p /var/lib/cassandra; sudo chown -R $USER:$GROUP /var/lib/cassandra
$ sudo mkdir -p /var/log/cassandra; sudo chown -R $USER:$GROUP /var/log/cassandra
$ sudo mkdir -p /var/lib/dsefs; sudo chown -R $USER:$GROUP /var/lib/dsefs
-
Make the directories for data and logging directories:
-
For example:
mkdir installation_location/dse-data
$ cd dse-data
$ mkdir commitlog
$ mkdir saved_caches
$ mkdir hints
-
Go the directory containing the
file:
cd install_location/resources/cassandra/conf
-
Edit the following lines in the cassandra.yaml
file:
The
location of the
cassandra.yaml file
depends on the type of installation:
Installer-Services |
/etc/dse/cassandra/cassandra.yaml |
Package installations |
/etc/dse/cassandra/cassandra.yaml |
Installer-No Services |
install_location/resources/cassandra/conf/cassandra.yaml |
Tarball installations |
install_location/resources/cassandra/conf/cassandra.yaml |
data_file_directories: install_location/dse-data
commitlog_directory: install_location/dse-data/commitlog
saved_caches_directory: install_location/dse-data/saved_caches
hints_directory: install_location/dse-data/hints
- Optional:
If using DataStax analytics, to define your own Spark directory
locations:
-
Make the directories for the Spark lib and
log directories.
-
Edit the file to match
the locations of your Spark lib and
log directories, as described in Configuring Spark nodes.
The default
location of the
spark-env.sh file depends on
the type of installation:
Installer-Services and Package
installations |
/etc/dse/spark/spark-env.sh |
Installer-No Services and Tarball
installations |
install_location/resources/spark/conf/spark-env.sh |
-
Make a directory for the DSEFS data directory and set its location in
dsefs_options.
- Optional:
Single-node cluster installations only:
-
Start DataStax Enterprise from the installation directory:
-
Verify that DataStax Enterprise is running:
If using
vnodes:
nodetool status
Datacenter: Cassandra
=====================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
-- Address Load Tokens Owns Host ID Rack
UN 127.0.0.1 82.43 KB 128 ? 40725dc8-7843-43ae-9c98-7c532b1f517e rack1
If using not using
vnodes:
nodetool status
Datacenter: Analytics
=====================
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
-- Address Load Owns Host ID Token Rack
UN 172.16.222.136 103.24 KB ? 3c1d0657-0990-4f78-a3c0-3e0c37fc3a06 1647352612226902707 rack1
What's next
The
location of the
cassandra.yaml file
depends on the type of installation:
Installer-Services |
/etc/dse/cassandra/cassandra.yaml |
Package installations |
/etc/dse/cassandra/cassandra.yaml |
Installer-No Services |
install_location/resources/cassandra/conf/cassandra.yaml |
Tarball installations |
install_location/resources/cassandra/conf/cassandra.yaml |
The location of
the
dse.yaml file depends
on the type of installation:
Installer-Services |
/etc/dse/dse.yaml |
Package installations |
/etc/dse/dse.yaml |
Installer-No Services |
install_location/resources/dse/conf/dse.yaml |
Tarball installations |
install_location/resources/dse/conf/dse.yaml |
DataStax Enterprise demos