Segregating workloads in a cluster

About using real-time (Cassandra), integrated Hadoop or Spark/Shark (Analytics), an external Hadoop system, or search (Solr) nodes in the same cluster.

A common question is how to use real-time (Cassandra), integrated Hadoop or Spark/Shark (Analytics), an external Hadoop system, or search (Solr) nodes in the same cluster. Within the same data center, attempting to run Solr on some nodes and real-time queries, analytics, or external Hadoop on other nodes does not work. The answer is to organize the nodes running different workloads into virtual data centers.

The answer is to organize the nodes running different workloads into virtual data centers.

Replicating data across data centers 

You set up replication for Solr nodes exactly as you do for other nodes in a Cassandra cluster, by creating a keyspace. You can change the replication of a keyspace after creating it.