DataStax Enterprise (DSE) integrates real-time and batch operational analytics capabilities with an enhanced version of Apache Spark™. With DSE Analytics you can easily generate ad-hoc reports, target customers with personalization, and process real-time streams of data. The analytics toolset lets you write code once and then use it for both real-time and batch workloads.
- About DSE Analytics
Use DSE Analytics to analyze huge databases. DSE Analytics includes integration with Apache Spark.
- Setting the replication factor for analytics keyspaces
Guidelines and steps to set the replication factor for keyspaces on DSE Analytics nodes.
- DSE Analytics and Search integration
DSE SearchAnalytics clusters can use DSE Search queries within DSE Analytics jobs.
- About DSE Analytics Solo
DSE Analytics Solo datacenters provide analytics processing with Spark and distributed storage using DSEFS without storing transactional database data.
- Analyzing data using Spark
Spark is the default mode when you start an analytics node in a packaged installation. Spark runs locally on each node.
- DSEFS (DataStax Enterprise file system)
DSEFS (DataStax Enterprise file system) is the default distributed file system on DSE Analytics nodes.
- About the Cassandra File System (CFS) - deprecated
Analytics jobs often require a distributed file system. DataStax Enterprise provides a replacement for the Hadoop Distributed File System (HDFS) called the Cassandra File System (CFS).