Analyze data using Apache Spark

Apache Spark™ is the default mode when you start an analytics node in a packaged installation.

About Apache Spark: Information about Spark architecture and capabilities.
Use Apache Spark with DataStax Enterprise: DataStax Enterprise (DSE) integrates with Apache Spark to allow distributed analytic applications to run using database data.
Configuring Spark: Configuring Apache Spark includes setting Spark properties for DSE and the database, enabling Spark apps, and setting permissions.
Use Apache Spark modules with DataStax Enterprise: Spark Streaming, Spark SQL, and MLlib are modules that extend the capabilities of Apache Spark.
Use AlwaysOn SQL service: AlwaysOn SQL is a high availability service that responds to SQL queries from JDBC and ODBC applications.
Access DataStax Enterprise data from external Apache Spark clusters: Information on accessing data in DSE clusters from external Spark clusters, or Bring Your Own Spark (BYOS).
Use the Apache Spark Jobserver: DSE includes Spark Jobserver, a REST interface for submitting and managing Spark jobs.
DSE Spark Connector API documentation

Give Feedback