Analyze data using Apache Spark
Apache Spark™ is the default mode when you start an analytics node in a packaged installation.
- About Apache Spark
-
Information about Spark architecture and capabilities.
- Use Apache Spark with DataStax Enterprise
-
DataStax Enterprise (DSE) integrates with Apache Spark to allow distributed analytic applications to run using database data.
- Configure Apache Spark
-
Configuring Apache Spark includes setting Spark properties for DSE and the database, enabling Spark apps, and setting permissions.
- Use Apache Spark modules with DataStax Enterprise
-
Spark Streaming, Spark SQL, and MLlib are modules that extend the capabilities of Apache Spark.
- Use AlwaysOn SQL service
-
AlwaysOn SQL is a high availability service that responds to SQL queries from JDBC and ODBC applications.
- Access DataStax Enterprise data from external Apache Spark clusters
-
Information on accessing data in DSE clusters from external Spark clusters, or Bring Your Own Spark (BYOS).
- Use the Apache Spark Jobserver
-
DSE includes Spark Jobserver, a REST interface for submitting and managing Spark jobs.
- DSE Spark Connector API documentation