About DataStax Enterprise 5.0

DataStax Enterprise is a database platform built on proven open source technology that delivers the functionality, tooling, certification, and support required by today’s enterprises to power cloud applications.

DataStax Enterprise is a database platform built on proven open source technology that delivers the functionality, tooling, certification, and support required by today’s enterprises to power cloud applications.

How to use the DataStax Enterprise documentation

To ensure that you get the best experience in using this document, take a moment to look at the Tips for using DataStax documentation.

The landing pages provide information about supported platforms, product compatibility, planning and testing cluster deployments, recommended production settings, troubleshooting, third-party software, resources for additional information, administrator and developer topics, and earlier documentation.

Overview of DataStax Enterprise 5.0

DataStax Enterprise is an integrated always-on multi-model database system with real-time and batch analytics using Apache Spark, in-memory technology, continuously available search, and graph database computing. With advanced tools for development and production system operations, flexible features such as tiered storage meet short- and long-term data access, multi-tenancy to run several database clusters within the same system, and advanced security meet enterprise requirements.

How does DataStax Enterprise work?

DataStax provides enterprise-ready products and features for DataStax Enterprise:

DataStax Enterprises uses certified Apache Cassandra™ 3.0 The DataStax Enterprise secure, operationally simple platform is built on a production-certified DataStax Distribution of Apache Cassandra™. Cassandra is a massively scalable open source NoSQL database. DataStax provides documentation for Cassandra.
Uses Cassandra Query Language (CQL) to communicate with the database The Cassandra Query Language (CQL 3.3) is the primary language for communicating with the Cassandra database. The most basic way to interact with Cassandra is using the CQL shell, cqlsh. Use cqlsh to create, insert, and query keyspaces and tables.
DataStax DevCenter and DataStax Studio provide visual IDE tools The DataStax DevCenter tool provides a visual schema and query integrated development environment (IDE) that runs CQL statements against the Cassandra database and DataStax Enterprise. DataStax Studio makes graph development and administration a visual experience.
Application writing is simple with DataStax drivers DataStax drivers are enhanced to ease the development of applications powered by DataStax Enterprise. They provide full compatibility with Apache Cassandra 3.0 and support for the new functionality introduced in DataStax Enterprise 5.0, including DSE Graph, unified authentication, and geospatial types.
Easy operational maintenance with OpsCenter The DataStax OpsCenter visual management and monitoring solution for DataStax Enterprise simplifies administration tasks and ensures optimal cluster health.
DSE Analytics integrates Apache Spark™ DSE Analytics provides real-time, streaming, and batch analytics with built-in integration with Apache Spark, a distributed, parallel data processing engine.
DSE Search integrates Apache Solr™ DSE Search uses Apache Solr to provide continuously available search.
What is the DataStax Enterprise architecture?
DataStax Enterprise is powered by the Cassandra database. Cassandra is designed to handle big data workloads across multiple nodes with no single point of failure. See Cassandra architecture in brief.
What is Apache Cassandra and how is DataStax Enterprise different?
Apache Cassandra is an open source database project that rapidly advances the state of the art for distributed databases. DataStax Enterprise uses a production-certified version of Apache Cassandra to ensure enterprise stability and performance. DataStax Enterprise incorporates analytics with DSE Analytics, search with DSE Search, and enterprise security.
What is Apache Spark and how is DSE Analytics different?
Apache Spark is an open source analytics project that provides a fast and general engine for large-scale data processing. DataStax Enterprise integrates Apache Spark real-time and batch analytics processing to more easily manage both database and analytics with a single operational system.
Apache Solr is an open source search project that produces a highly reliable, scalable, and distributed search system that provides search for databases. DSE Search integrates Solr to manage search indexes with a persistent store. DSE Search provide enterprises with the ability to perform text search and text analysis.
How is DataStax Enterprise different from relational databases?
DataStax Enterprise is a distributed and highly available database that uses peer-to-peer communication. Data modeling in DataStax Enterprise is similar to relational databases while differing in key areas to provide blazingly fast interaction. Relational databases use joins between tables for relationships. DataStax Enterprise uses denormalization to achieve more robust querying.
What is NoSQL?
The NoSQL term originally referred to a new generation of databases that shunned SQL for other interfaces. The term NoSQL is now a catch-all term for post-relational "not-only SQL" databases that use a method of storage different from a relational, or SQL, database. There are many different types of NoSQL databases, so a direct comparison of even the most used types is not useful.
How do I interact with DataStax Enterprise?
The Cassandra Query Language (CQL) is used to create, update, alter, and delete data in DataStax Enterprise using the CQL shell cqlsh. The Spark Cassandra Connector provides Cassandra integration for Spark and access to the Spark shell. There is also SQL access with Spark's ODBC/JDBC support.
How do I move data to and from DataStax Enterprise?
DataStax Enterprise has several solutions such as copying CSV files and using a Cassandra-specific bulk loader. See Migrating data to DataStax Enterprise.
What tools are included with DataStax Enterprise?
What drivers work with DataStax Enterprise?
DataStax drivers come in two types: DataStax drivers for Apache Cassandra™ and DataStax drivers for DataStax Enterprise 5.0 and later. The DataStax 5.0 drivers are enhanced to ease the development of applications powered by DataStax Enterprise. These drivers support the functionality for DataStax Enterprise 5.0, including DSE Graph, unified authentication, and geospatial types. DataStax drivers for DataStax Enterprise 5.0 support many languages: Java, Python, node.js, C/C++, C#, and Ruby.
What kind of hardware do I need to run DataStax Enterprise?
See Planning and testing cluster deployments for hardware requirements. Commodity hardware in-house or cloud instances are appropriate for building highly scalable DataStax Enterprise clusters. The distributed nature of DataStax Enterprise can actively utilize multiple datacenters across several geographic regions, supporting highly available data under even the most trying circumstances.
How do I install DataStax Enterprise?
You can install DataStax Enterprise in several ways, depending on the purpose of the installation, the type of operating system, and the available permissions.
How do I configure DataStax Enterprise?
DataStax Enterprise installation also installs the production-certified version of the Cassandra database. Configure Cassandra-specific options and then configure DataStax Enterprise.
How do I upgrade DataStax Enterprise?
DataStax provides instructions for upgrading Datastax Enterprise in the DataStax Upgrade Guide. Instructions are also provided to upgrade from DataStax Community and open source Apache Cassandra.
How do I use the DataStax documentation?
The DataStax documentation landing pages provide information on:
  • Product compatibility
  • Cluster planning
  • Hardware selection
  • Testing for production
  • Supported platforms
  • Administrator and developer topics
  • Resources for additional information
  • Guidance for providing document feedback