Using Apache Spark™ modules with DataStax Enterprise

Getting started with Spark Streaming

Spark Streaming allows you to consume live data streams from sources, including Akka™, Kafka™, and Twitter™. This data can then be analyzed by Spark applications, and the data can be stored in the database. This example uses Scala.

Using Spark SQL to query data

Spark SQL allows you to execute Spark queries using a variation of the SQL language.

Using Apache SparkR™ with DataStax Enterprise

Apache SparkR is a front-end for the R programming language for creating analytics applications. DataStax Enterprise integrates SparkR to support creating data frames from DSE data.

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2024 DataStax | Privacy policy | Terms of use

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com