Segregating workloads in a cluster

About using real-time (Cassandra), integrated Hadoop or Spark/Shark (Analytics), an external Hadoop system, or search (Solr) nodes in the same cluster.

A common question is how to use real-time (Cassandra), integrated Hadoop or Spark/Shark (Analytics), an external Hadoop system, or search (Solr) nodes in the same cluster. Within the same data center, attempting to run Solr on some nodes and real-time queries, analytics, or external Hadoop on other nodes does not work. The answer is to organize the nodes running different workloads into virtual data centers.

The answer is to organize the nodes running different workloads into virtual data centers.

Replicating data across data centers

You set up replication for Solr nodes exactly as you do for other nodes in a Cassandra cluster, by creating a keyspace. You can change the replication of a keyspace after creating it.

DataStax, Titan, and TitanDB are registered trademark of DataStax, Inc. and its subsidiaries in the United States and/or other countries.

Apache Cassandra, Apache, Tomcat, Lucene, Solr, Hadoop, Spark, TinkerPop, and Cassandra are trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.