Get started with DataStax Enterprise 6.8

DataStax Enterprise (DSE) provides all the capabilities of Apache Cassandra® plus advanced functionality.

Learn

Before diving into administration tasks, you can save a lot of time when setting up and operating DSE in a production environment by learning a few basics first:

Differences between Cassandra, DSE, and relational databases

Cassandra and DSE databases are much different than relational databases and use a data model based on the types of queries, not on modeling entities and relationships. DataStax highly recommends that you read our Architecture in brief. It contains key concepts and terminology for understanding the database.

DSE OpsCenter and Lifecycle Manager (LCM)

DSE OpsCenter and LCM automate and simplify many administrative tasks.

Learning resources

DataStax sample code and examples

While not specific to administrators, these topics provide more database details:

Cassandra Query Language (CQL) is the query language for DSE.
Cassandra drivers are available in several programming languages to connect client applications to your DSE databases.
APIs are available to interface with DSE OpsCenter, DseGraphFrame, Apache Cassandra Spark Connector, and Cassandra drivers.

Plan

The Planning and testing guide contains guidelines for capacity planning and hardware selection in production environments.

Install

DataStax offers a variety of ways to set up a cluster:

Cloud

On premises

Use Mission Control to provision and deploy a DSE cluster
Use DSE OpsCenter LCM to provision and deploy a DSE cluster
Packages for Yum and Debian based platforms
Docker images
Binary tarball
Deployment per workload type

For help with choosing an install type, see Choose an installation method.

Secure

DSE Advanced Security provides detailed user access controls to keep applications data protected and compliant with regulatory standards like PCI, SOX, HIPAA, and the European Union’s General Data Protection Regulation (GDPR). Key topics include:

The DSE database includes the default role <cassandra> with password <cassandra>. This is a superuser login with full access to the database. DataStax recommends using the cassandra role only once during the initial Role Based Access Control (RBAC) setup to establish your own root account and then disable the cassandra role. See Adding a superuser login.

Tune

Important topics for optimizing the performance of the database include:

Recommended production settings.
Tuning the Java Virtual Machine.
Enable the NodeSync Service, which covers continuous background repair.
Load test your cluster before deployment.

Operations

The most commonly used operations include:

Starting and stopping DSE per workload type.
Backup and recovery.
Adding or removing nodes, datacenters, or clusters.
Moving a node from one rack to another.

Load data

These are the primary tools for moving data into and out of the database:

CQL statements
DataStax Bulk Loader (DSBulk)
Streaming tools, such as the DataStax Apache Kafka Connector
Data migration tools

DSE Analytics: Built on a production-certified version of Apache Spark™, with enhanced capabilities like AlwaysOn SQL for process streaming and historical data at cloud scale.
DSE Graph: DSE Graph is optimized for storing billions of items and their relationships. This enables you to identify and analyze hidden relationships between connected data and build powerful modern applications for real-time use cases: fraud detection, customer 360, social networks, IoT, and recommendation systems. The DSE Graph Quick Start is a great place to get started.
DSE Search: Provides powerful search and indexing capabilities, including support for full-text, relevancy, sub-string, and fuzzy queries over large datasets, aggregation, and geospatial matchups.
DSE OpsCenter: Provides visual management and monitoring for DSE, including automatic backups, reduced manual operations, automatic failover, patch release upgrades, and secure management of DSE clusters on-premises, in the cloud, or in hybrid environments that span multiple datacenters.
LCM: A visual provisioning and monitoring tool for DSE clusters. LCM allows you to define the cluster configuration including datacenter, node topology, and security. LCM monitoring helps you troubleshoot installation, configuration, and upgrade jobs.
DSE Advanced Security: Provides fine-grained user and access controls to keep applications data protected and compliance with regulatory standards like PCI, SOX, HIPAA, and the European Union’s General Data Protection Regulation (GDPR).
DSE Metrics Collector: Aggregates DSE metrics and integrates with existing monitoring solutions to facilitate problem resolution and remediation.
DSE Management Services: DSE Management Services automatically handle administration and maintenance tasks and assist with overall database cluster management.
NodeSync Service: Continuous background repair that virtually eliminates manual efforts to run repair operations in a DataStax cluster.
Advanced Replication: Advanced Replication allows a single cluster to have a primary hub with multiple spokes. This allows configurable, bi-directional distributed data replication to and from source and destination clusters.

Get started with DataStax Enterprise 6.8

Learn

Plan

Install

Secure

Tune

Operations

Load data

Monitor

Troubleshooting/Help

Upgrade your database

Advanced functionality

Was this helpful?

Give Feedback