Using the DataStax Installer to install using command line or properties file commands

Instructions for installing DataStax Enterprise 5.0 using the DataStax Installer using command line or properties file commands.

Instructions for installing DataStax Enterprise using the DataStax Installer using command line or properties file commands.

Important: DataStax Enterprise 5.0 uses Apache Cassandra 3.0 and CQL 3.3.

Prerequisites

Also see Recommended production settings and the DataStax Enterprise Reference Architecture white paper.

Hardware requirements
Requirement Minimum Production
CPUs 2 16
Memory 8 GB 24 GB
Data directory 20 GB 200 GB
Commit log directory 20 GB 200 GB
Saved caches directory 20 GB 200 GB
Logs directory 20 GB 200 GB
Production requirements depend on the volume of data and workload.

About the installer 

The DataStax installer installs DataStax Enterprise and DataStax Studio if developer_install is specified. It does not install OpsCenter or DevCenter. See Installing OpsCenter or Installing DevCenter.

You can use this installer to set cassandra.yaml and dse.yaml properties with the following template files:
  • cassandra_yaml_template
  • dse_yaml_template
Note: The latest version of DataStax Enterprise 5.0.x is 5.0.15.

The DataStax Enterprise download page provides a sample_install_5.0.prop file.

Procedure

Important: End User License Agreement (EULA). By downloading DataStax products, you confirm that you agree to the processing of information as described in the DataStax website privacy policy and agree to the website terms of use.

In a terminal window:

  1. Download the installer for your computer from the DataStax download page or use the following command:
    curl -O https://downloads.datastax.com/enterprise/DataStaxEnterprise-5.0.15-linux-x64-installer.run
    Note: To install earlier versions, replace the version number in the above command. To view the available versions, see the Release Notes.
  2. From the directory where you downloaded the install file, change the permission to executable:
    chmod +x DataStaxEnterprise-5.0.15-linux-x64-installer.run
  3. You can either use the command line or a properties file:
    • Command line:
      $ sudo ./DataStaxEnterprise-5.0.15-linux-x64-installer.run --option argument --option argument ... --option argument --mode unattended
      For available options, see the table below. Be sure to add "--" to the option. For example:
      $ sudo ./DataStaxEnterprise-5.0.15-linux-x64-installer.run --enable_vnodes 0 --mode unattended

      The installer uses the default value for any unspecified option.

    • Properties file:
      $ sudo ./DataStaxEnterprise-5.0.15-linux-x64-installer.run --optionfile option_file_name --mode unattended
      where option_file_name is the name of the file containing the installation options. For example:
      $ sudo ./DataStaxEnterprise-5.0.15-linux-x64-installer.run --optionfile ../datastax/DC4-analytics.prop --mode unattended
      Properties file format: option=argument. For example:
      install_type=simple
      update_system=1

      The properties file options are the same as the command line options without the --.

      Custom installation location example:
      sudo ./DataStaxEnterprise-5.0.15-linux-x64-installer.run --optionfile ../datastax/custom_location.prop --mode unattended
      system_install=no_services
      prefix=/home/dse
    Unattended install options
    Option Argument Description
    Install options

    prefix
    (No services installations only)

    install_location

    Set installation location available only for No Services installations.

    For upgrade installations, No Services installations use the previous installation location.

    cassandra_yaml_template file_name Template for cassandra.yaml file.
    dse_yaml_template file_name Template for dse.yaml file.
    logs_dir log_location Set the log directory for Cassandra, Spark, Hadoop, Pig, and Hive data.
    developer_install developer_install Install demos and Javadoc.
    do_drain 0 (no) or 1 (yes) Drain existing node before installing. Default: 1
    start_services 0 (no) or 1 (yes) Start services after installation. Default: 1
    update_system 0 (no) or 1 (yes) Update some system packages and dependencies. Does not upgrade or install major components such as Java. Default: 1 for root user, 0 for others.
    Set up node type options
    install_type simple or advanced Set install type:
    • simple - installs DataStax Enterprise using the default path names and options.
    • advanced - configure path names and options. See Install Type.
    Default: simple
    system_install Use one of the following:
    • no_services
    • services_and_utilities
    Set up DataStax Enterprise as a service or standalone process. If installed as a service, installs DataStax Enterprise in system locations. Default: services_and_utilities for root user, no_services for others.

    For Services installations, the installer puts DataStax Enterprise in system locations and /usr/share/dse.

    For upgrade installations, No Services installations use the previous installation location. For Services installations, the installer puts DataStax Enterprise in system locations and /usr/share/dse.

    enable_analytics 0 (no) or 1 (yes) Enable or disable DSE Analytics. Default: 0
    analytics_type analytics_type:
    Set analytics type:
    • spark_only - only enable Spark (Default).
    • spark_integrated - enable Spark + Integrated Hadoop (DSE Hadoop).
    enable_search 0 (no) or 1 (yes) DSE Search (Solr) simplifies using search applications for data that is stored in a Cassandra database. Default: 0
    enable_graph 0 (no) or 1 (yes) DSE Graph is a graph database for managing, analyzing, and searching highly-connected data. Default: 0
    enable_advrepl 0 (no) or 1 (yes) DSE Advanced Replication supports one-way, distributed-data replication from remote clusters that might experience periods of network or internet downtime. Default: 0
    opscenter_address opscenter_address Launches OpsCenter in your default browser.
    Preflight check options

    run_pfc
    (Services installations only)

    0 (no) or 1 (yes) Preflight tool is a collection of tests that can be run on a node to detect and fix a configuration. The tool canPre-flight detect and fix many invalid or suboptimal configuration settings. Not available in No Services installations.

    pfc_fix_issues
    (Services installations only)

    0 (no) or 1 (yes) Attempt to fix invalid or suboptimal configuration settings. Not available in No Services installations.

    pfc_ssd
    (Services installations only)

    paths_to_SSDs Paths to the Solid State Drives. Separate the drives by a comma. Not available in No Services installations.

    pfc_devices
    (Services installations only)

    paths_to_hard_drives Paths to your hard drives. Separate the drives by a comma. Not available in No Services installations.

    pfc_disk_duration
    (Services installations only)

    Threads Per Disk Benchmark Set to simulate a normal load. Not available in No Services installations.

    pfc_disk_threads
    (Services installations only)

    Set to simulate a normal load. Not available in No Services installations.
    User and user group options

    cassandra_user

    user_name User name for running service. Because the DataStax Agent relies on user cassandra, DataStax does not recommend changing the default. If changed, you must manually install, update, and configure the DataStax Agent.
    cassandra_group group_name Group name for running service. See above.
    cassandra.yaml options

    These values override options set in the cassandra_yaml_template.

    cassandra_commitlog_dir directory commitlog_dir
    cassandra_data_dir directory data_file_directories
    cassandra_hints_dir directory hints_directory
    cassandra_saved_caches_dir directory saved_caches_directory
    cassandra_hints_dir directory hints_directory
    enable_vnodes 0 (no) or 1 (yes) Enable or disable virtual nodes (vnodes). Default: 1 for Cassandra nodes, 0 for others.
    interface IP_address Default interface to use for listening on all services.
    listen_address IP_address listen_address
    num_tokens number_of_tokens num_token
    ring_name name Name of ring.
    rpc_address IP_address rpc_address
    rpc_port port_number rpc_port
    seeds Comma separated list of seed IP_addresses Seed list for this node.

    Do not make all nodes seed nodes. See Internode communications (gossip).

    ssl_storage_port port_number ssl_storage_port
    storage_port port_number storage_port
    The location of the dse.yaml file depends on the type of installation:
    Installer-Services /etc/dse/dse.yaml
    Package installations /etc/dse/dse.yaml
    Installer-No Services install_location/resources/dse/conf/dse.yaml
    Tarball installations install_location/resources/dse/conf/dse.yaml
    The location of the cassandra.yaml file depends on the type of installation:
    Installer-Services /etc/dse/cassandra/cassandra.yaml
    Package installations /etc/dse/cassandra/cassandra.yaml
    Installer-No Services install_location/resources/cassandra/conf/cassandra.yaml
    Tarball installations install_location/resources/cassandra/conf/cassandra.yaml

Results

DataStax Enterprise is ready for additional configuration.

What's next

The location of the cassandra.yaml file depends on the type of installation:
Installer-Services /etc/dse/cassandra/cassandra.yaml
Package installations /etc/dse/cassandra/cassandra.yaml
Installer-No Services install_location/resources/cassandra/conf/cassandra.yaml
Tarball installations install_location/resources/cassandra/conf/cassandra.yaml
The location of the dse.yaml file depends on the type of installation:
Installer-Services /etc/dse/dse.yaml
Package installations /etc/dse/dse.yaml
Installer-No Services install_location/resources/dse/conf/dse.yaml
Tarball installations install_location/resources/dse/conf/dse.yaml