Installing DataStax Enterprise 5.0 using the binary tarball 

Instructions for installing DataStax Enterprise 5.0 on any supported Linux-based platform.

Use these instructions for installing with root permissions on Linux-based platform using the binary tarball. To install earlier versions, see Installing DataStax Enterprise 5.0.x patch releases.

Important: DataStax Enterprise 5.0 uses Apache Cassandra 3.0 and CQL 3.3.

When installed from the binary tarball, DataStax Enterprise runs as a stand-alone process.

Prerequisites

Hardware requirements
Requirement Minimum Production
CPUs 2 16
Memory 8 GB 24 GB
Data directory 20 GB 200 GB
Commit log directory 20 GB 200 GB
Saved caches directory 20 GB 200 GB
Logs directory 20 GB 200 GB
Production requirements depend on the volume of data and workload.

Also see Recommended production settings and the DataStax Enterprise Reference Architecture white paper.

Procedure

These steps install DataStax Enterprise. They do not install OpsCenter, DataStax Studio, DataStax Agent, or DevCenter. After installing, you must configure and start DataStax Enterprise.

Note: The latest version of DataStax Enterprise is 5.0.9.

In a terminal window:

  1. Verify that a required version of Java is installed:
    $ java -version

    If not Oracle Java 8, or OpenJDK, see Installing the JDK.

    Important: Package management tools do not install Oracle Java.
  2. Download and extract the DataStax Enterprise tarball using your DataStax Academy account credentials.

    To install the latest version (5.0.9):

    $ curl --user dsa_email_address:password -L https://downloads.datastax.com/enterprise/dse.tar.gz | tar xz

    To install earlier versions, add the version number in the above command. For example:

    $ curl --user daenerys@targaryen.com:Dothraki_warlord -L https://downloads.datastax.com/enterprise/dse-5.0.3-bin.tar.gz | tar xz
    The latest version of DataStax Enterprise is 5.0.9.
    where dsa_email_address and password are your DataStax Academy account credentials.
    Attention: Depending on your environment, you might need to replace @ in your email address with %40 and escape any character in your password that is used in your operating system's command line. Examples: \! and \|.
    CAUTION:
    If you choose to run the above command, your password is retained in the shell history. To avoid this DataStax recommends using curl with the --netrc or --netrc-file option. Alternately, download the tarball from DataStax downloads.

    The DataStax Installer has options to install developer related tools: Javadoc, DataStax Enterprise demos, DataStax Studio, and the DSE Graph Loader.

    The files are downloaded and extracted into the dse-5.0.9 directory.

  3. If you want to use the default data and logging directory locations, create and change ownership as follows:
    Note: If you want to define your own directory locations, see 4 and 5.
    • /var/lib/cassandra
    • /var/log/cassandra (includes audit directory, debug.log, gremlin.log, solrvalidation.log, system.log)
    • /var/lib/dsefs
    • /var/lib/spark
    • /var/log/spark
    $ sudo mkdir -p /var/lib/cassandra; sudo chown -R  $USER:$GROUP /var/lib/cassandra
    $ sudo mkdir -p /var/log/cassandra; sudo chown -R  $USER:$GROUP /var/log/cassandra
    $ sudo mkdir -p /var/lib/dsefs; sudo chown -R  $USER:$GROUP /var/lib/dsefs
    $ sudo mkdir -p /var/lib/spark; sudo chown -R  $USER:$GROUP /var/lib/spark
    $ sudo mkdir -p /var/log/spark; sudo chown -R  $USER:$GROUP /var/log/spark
  4. Optional: To define your own data and logging directories:
    1. Make the directories for data and logging directories. For example:
      $ mkdir installation_location/dse-data
      $ cd dse-data
      $ mkdir commitlog
      $ mkdir saved_caches
      $ mkdir hints
    2. Go the directory containing the cassandra.yaml file:
      $ cd install_location/resources/cassandra/conf
    3. Edit the following lines in the cassandra.yaml file:
      The location of the cassandra.yaml file depends on the type of installation:
      Installer-Services /etc/dse/cassandra/cassandra.yaml
      Package installations /etc/dse/cassandra/cassandra.yaml
      Installer-No Services install_location/resources/cassandra/conf/cassandra.yaml
      Tarball installations install_location/resources/cassandra/conf/cassandra.yaml
      data_file_directories: install_location/dse-data
      commitlog_directory: install_location/dse-data/commitlog
      saved_caches_directory: install_location/dse-data/saved_caches
      hints_directory: install_location/dse-data/hints
  5. Optional: To define your own Spark directory locations:
    1. Make the directories for the Spark lib and log directories.
    2. Edit the spark-env.sh file to match the locations of your Spark lib and log directories, as described in Configuring Spark nodes.
      The default location of the spark-env.sh file depends on the type of installation:
      Installer-Services and Package installations /etc/dse/spark/spark-env.sh
      Installer-No Services and Tarball installations install_location/resources/spark/conf/spark-env.sh
    3. Make a directory for the DSEFS data directory and set its location in dsefs_options.

    DataStax Enterprise is ready for additional configuration.

  6. Optional: Single-node cluster installations only:
    1. Start DataStax Enterprise from the installation directory:
      $ bin/dse cassandra
      Note: For other start options, see Starting DataStax Enterprise as a stand-alone process.
    2. Verify that DataStax Enterprise is running:
      If using vnodes:
      $ nodetool status
      Datacenter: Cassandra
      =====================
      Status=Up/Down
      |/ State=Normal/Leaving/Joining/Moving
      --  Address    Load       Tokens  Owns    Host ID                               Rack
      UN  127.0.0.1  82.43 KB   128     ?       40725dc8-7843-43ae-9c98-7c532b1f517e  rack1
      If using not using vnodes:
      $ nodetool status
      Datacenter: Analytics
      =====================
      Status=Up/Down
      |/ State=Normal/Leaving/Joining/Moving
      --  Address         Load       Owns    Host ID                               Token                 Rack
      UN  172.16.222.136  103.24 KB  ?       3c1d0657-0990-4f78-a3c0-3e0c37fc3a06  1647352612226902707   rack1

What's next

The location of the cassandra.yaml file depends on the type of installation:
Installer-Services /etc/dse/cassandra/cassandra.yaml
Package installations /etc/dse/cassandra/cassandra.yaml
Installer-No Services install_location/resources/cassandra/conf/cassandra.yaml
Tarball installations install_location/resources/cassandra/conf/cassandra.yaml
The location of the dse.yaml file depends on the type of installation:
Installer-Services /etc/dse/dse.yaml
Package installations /etc/dse/dse.yaml
Installer-No Services install_location/resources/dse/conf/dse.yaml
Tarball installations install_location/resources/dse/conf/dse.yaml