Getting started with DataStax Enterprise 6.8

Information about developing applications for DataStax Enterprise.

Information about developing applications for DataStax Enterprise.

This topic provides basic information and a roadmap to documentation for developers new to DataStax Enterprise (DSE).

Which product?

To help you choose which DataStax products best fit your requirements, see Products on the DataStax website. DataStax Enterprise (DSE) provides all the capabilities of Apache Cassandra® plus advanced functionality (detailed below).


Before diving into queries and coding, it is important to learn the basics of DataStax Enterprise first.

Querying data
Like any database, you need to be able to get information into and out of the database. The basic way of doing this is the Cassandra Query Language (CQL). The CQL data model is based on the queries you want to perform, unlike modeling entities and relationships used in relational databases.
Loading data
DataStax provides tools for getting data into and out of the database. Described below.
DataStax highly recommends taking 7 minutes to read Architecture in brief. This document provides basic information about key concepts and terminology for understanding the database.
Learning resources
Note: Save yourself some time and frustration by spending a few moments looking at DataStax Doc and Search tips. These short topics talk about navigation and bookmarking aids that will make your journey through the docs more efficient and productive.


Before starting development, you need to deploy a DataStax cluster. DataStax offers a variety of ways to set up a cluster. Select the method below that best suits your environment.

On premises


DataStax provides tools for getting data into and out of the database:


Getting data into and out of the database.

CQL Guide
The CQL guide provides information for querying data, command syntax, inserting and updating data, using the Cassandra Query Language (CQL), plus using and customizing the cqlsh command line utility.
Data model
A data model is a process that involves identifying the entities (items to be stored) and the relationships between entities. Data modeling in DataStax Enterprise uses a query-driven approach, in which specific queries are the key to organizing the data. In contrast, relational databases normalize data based on the tables and relationships design, and then writes the queries that will be made.
DataStax Studio
DataStax Studio an interactive developer tool for CQL (Cassandra Query Language), Spark SQL, and DSE Graph. Developers and analysts collaborate by mixing code, documentation, query results, and visualizations in self-documenting notebooks.

Connect (drivers)

How to connect client applications to the database.

Important: As of January 2020, you can use the same DataStax driver for Apache Cassandra® (OSS) and DataStax Enterprise (DSE). DataStax unified the DSE and OSS drivers to avoid confusion and enhance the OSS drivers with some of the features in the DSE drivers. For more information, see the Better Drivers for Cassandra blog. DSE drivers are supported until January 1st, 2022. However, all new features and functionality will take place only in the unified DataStax drivers.
Developing applications with DataStax drivers
The Developing applications with DataStax drivers guide contains information about best practices, connecting to DSE clusters, submitting queries with drivers, and handling client and server errors.
Driver Quickstarts
Simple CRUD application demos to quickly help you quickly get up and running with the DataStax drivers.

For details, see DataStax drivers.


In addition to driver APIs, DataStax provides the following:
DSE GraphFrames API

The DseGraphFrame framework allows you create applications that use the Spark API for analytics operations on DSE Graph.

DataStax Spark Cassandra Connector API
The DataStax Spark Cassandra Connector API lets you expose tables as Spark Resilient Distributed Datasets (RDDs), write Spark RDDs to tables, and execute arbitrary CQL queries in your Spark applications.
DSE OpsCenter API
The DSE OpsCenter API provides the ability to make RESTful requests for programmatically performs the same set of operations as the OpsCenter GUI.

Advanced Functionality

In addition to all the capabilities of Apache Cassandra, DataStax Enterprise offers the following capabilities:

DSE Analytics
Built on a production-certified version of Apache Spark, with enhanced capabilities like AlwaysOn SQL for process streaming and historical data at cloud scale.
DSE Graph
DSE Graph is optimized for storing billions of items and their relationships to enable you to identify and analyze hidden relationships between connected data and build powerful modern applications for real-time use cases: fraud detection, customer 360, social networks, IoT, and recommendation systems. The DSE Graph Quick Start is a great place to get started.
DSE Search
Provides powerful search and indexing capabilities, including support for full-text, relevancy, sub-string, and fuzzy queries over large data sets, aggregation, and geospatial matchups.
DSE OpsCenter
Provides visual management and monitoring for DataStax Enterprise, including automatic backups, reduced manual operations, automatic failover, patch release upgrades, and secure management of DSE clusters on-premises, in the cloud, or in hybrid environments that span multiple data centers.
Lifecycle Manager
A visual provisioning and monitoring tool for DataStax Enterprise clusters. LCM allows you to define the cluster configuration including datacenter, node topology, and security. LCM monitoring helps you troubleshoot installation, configuration, and upgrade jobs.
DSE Advanced Security
Provides fine-grained user and access controls to keep applications data protected and compliance with regulatory standards like PCI, SOX, HIPAA, and the European Union’s General Data Protection Regulation (GDPR).
DSE Metrics Collector
Aggregates DSE metrics and integrates with existing monitoring solutions to facilitate problem resolution and remediation.
DSE Management Services
DSE Management Services automatically handle administration and maintenance tasks and assist with overall database cluster management.
NodeSync service
Continuous background repair that virtually eliminates manual efforts to run repair operations in a DataStax cluster.
Advanced Replication
Advanced Replication allows a single cluster to have a primary hub with multiple spokes. This allows configurable, bi-directional distributed data replication to and from source and destination clusters.