Capacity planning
Use a discovery process to develop a plan to ensure sufficient memory resources.
Using DSE Search/Solr is memory-intensive. This discovery process is intended to help you, the DSE Search/Solr administrator, develop a plan for having sufficient memory resources to meet the needs of your users.
Overview
First, you estimate how large your Solr index will grow by indexing a number of documents on a single node, executing typical user queries, and then examining the field cache memory usage for heap allocation. Repeat this process using a greater number of documents until you get a feel for the size of the index for the maximum number of documents that a single node can handle. You can then determine how many servers to deploy for a cluster and the optimal heap size. The index should be stored on SSDs or should fit in the system IO cache.
- Optimal heap size per node
- Good estimate about the number of nodes needed for your application.
The replication factor can be increased for more queries per second.
Prerequisites
You need to have the following hardware and data:
- GB of RAM, size to be determined during capacity planning
- SSD or spinning disk
- N documents indexed on a single test node
- A complete set of sample queries to be executed
- The total number of documents the system should support