BYOH Prerequisites and installation
Configure BYOH datacenters to isolate workloads.
You must install DataStax Enterprise on all the nodes, nodes in the Hadoop cluster, and additional nodes outside the Hadoop cluster. Configure the additional nodes in one or more BYOH datacenters to isolate workloads. Run sequential data loads, not random OLTP loads or Solr data loads in a BYOH datacenter.
Use separate datacenters to deploy mixed workloads. Within the same datacenter, do not mix nodes that run DSE Hadoop integrated Job Tracker and Task Trackers with external Hadoop services. In the BYOH mode, run external Hadoop services on the same nodes as Cassandra. Although you can enable CFS on these Cassandra nodes as a startup option, CFS as a primary data store is not recommended.
Prerequisites
- Installation of a functioning CDH or HDP Hadoop cluster.
- Installation and configuration of these master services on the Hadoop cluster:
- Job Tracker or Resource Manager (required)
- HDFS Name Node (required)
- Secondary Name Node or High Availability Name Nodes (required)
- At least one set of HDFS Data Nodes (required externally)
The BYOH nodes must to be able to communicate with the HDFS Data Node that is located outside the BYOH data center.
During the installation procedure, you install only the required Hadoop components in the BYOH datacenter: Task Trackers/Node Managers and optional clients, MapReduce, Hive, and Pig. Install Hadoop on the same paths on all nodes. CLASSPATH variables that are used by BYOH need to work on all nodes.