Configuring Apache Spark
Configuring Apache Spark™ for DataStax Enterprise (DSE) includes:
- Configuring Apache Spark nodes
-
Modify the settings for Spark nodes security, performance, and logging.
- Automatic Apache Spark Master election
-
Spark Master elections are automatically managed.
- Configuring Apache Spark logging options
-
Configure Spark logging options.
- Running Apache Spark processes as separate users
-
Spark processes can be configured to run as separate operating system users.
- Configuring the Apache Spark history server
-
Load the event logs from Spark jobs that were run with event logging enabled.
- Setting Spark Cassandra Connector-specific properties
-
Use the Spark Cassandra Connector options to configure DSE Spark.
- Creating a DSE Analytics Solo datacenter
-
DSE Analytics Solo datacenters do not store any database or search data, but are strictly used for analytics processing. They are used in conjunction with one or more datacenters that contain database data.
- Apache Spark JVMs and memory management
-
Spark jobs running on DSE are divided among several different JVM processes.