Installing a DataStax Enterprise cluster on Amazon EC2
Installing a DataStax Enterprise cluster on Amazon EC2.
You can also launch a single node using the DataStax AMI and then use OpsCenter to create a cluster.
The DataStax AMI does the following:
- Installs the latest version of DataStax Enterprise with an Ubuntu 12.04 LTS (Precise Pangolin), image (Ubuntu Cloud 20140227 release), Kernel 3.8+.
- Installs Oracle Java 7.
- Installs metrics tools such as dstat, ethtool, make, gcc, and s3cmd.
- Uses RAID0 ephemeral disks for data storage and commit logs.
- Provides a choice of virtualization types: PV (paravirtualization) or HVM (hardware-assisted virtual machine).
- Launches EBS-backed instances for faster start-up, not database storage.
- Uses the private interface for intra-cluster communication.
- Starts the nodes in the specified type: Cassandra (transactional), DSE Analytics, or DSE Search.
- Sets the seed nodes cluster-wide.
- Installs DataStax OpsCenter on the first node in the cluster (by default).
The DataStax AMI is intended for a single region and availability zone. When creating an EC2 cluster that spans multiple regions and availability zones, use OpsCenter to set up your cluster instead. You can use any of the supported platforms. It is best practice to use the same platform on all nodes. If your cluster was instantiated using the DataStax AMI, use Ubuntu for the additional nodes. The following topics describe OpsCenter provisioning:
For production Cassandra clusters on EC2, Production deployment planning. RAID0 the ephemeral disks, and put both the data directory and the commit log on that volume. This has proved to be better in practice than putting the commit log on the root volume (which is also a shared resource). For more data redundancy, consider deploying your Cassandra cluster across multiple availability zones or using OpsCenter to backup to S3. Also see .