New features

DataStax Enterprise 5.1 introduces the following new features and enhancements:

Feature Description

DSE Analytics

Support for Apache Spark™ 2.0 including SparkR. Improvements include operational improvements, performance improvements, structured streaming, DSE GraphFrames, Spark SQL, and geospatial types.

DSEFS

DataStax Enterprise File System provides a distributed file system for storing very large data sets, such as Spark Streaming data and analytic processing. DSEFS replaces the CFS (Cassandra File System).

DSE Graph

Improvements includes new fuzzy search; advanced configuration of search indexing; better data loading performance improved geospatial querying; Kerberos support for Graph Loader; ability to customize graph visualizations and stability improvements.

DSE Security

New security features include granular access control that allows permissions on table rows and search indexes; DSE Proxy management for web services; and JMX authentication integrated with DSE Unified Authentication (LDAP or internal). For details, see New security features in DSE 5.1.

DSE Search

Faster and simplified DSE Search using Apache Solr™ 6.0. New index management CQL and cqlsh commands to streamline operations and development. Improved query and indexing performance. Enhanced error handling and improvements in hardening and stability.

DSE Advanced Replication

Re-designed to use CDC (Change Data Capture) logs, CDC is ideal for configurable replication between sources and destinations. Suitable for environments where sporadic connectivity that can occur, such as a network of microservices clusters that report data to a central analytics cluster.

Drivers

Support for new DSE 5.1 functionality, such as Unified Authentication, proxy login and execution, and the new data types. Additionally, the Java and Python drivers contain a DSE Graph fluent API that use the Gremlin Traversal API for programmatically building Gremlin queries.

DataStax Studio

Added support for CQL (Cassandra Query Language). This new features provides the ability to visually navigate database objects, create and tune CQL queries. Studio features an intelligent CQL editor providing syntax highlighting, validation, intelligent code completion, configuration options, and query profiling. Improvements in DSE Studio for DSE Graph include better usability; more complete profiling for graph queries; and new customization capabilities for graph visualization, including coloring and sizing vertices by label or property value, and custom shapes and icons. Schema visualization is also improved.

DSE In-Memory

MemoryOnlyStrategy now works with compression.

Operations

Performance improvements including faster server startup. Improved help for CQL and cqlsh commands. Tab completion to cqlsh for DSE custom compaction strategies. Improvements to dsetool and dse client-tool.

New database features

cassandra-stress -graph option

Results can be automatically graphed for data visualization.

Clustering columns improvement

Clustering columns without a secondary index can be used in a WHERE clause, provided the ALLOW FILTERING clause is also used.

CREATE TABLE WITH ID

If a table is accidentally dropped, recreate it with its ID and replay the commitlog to regain data.

JBOD compaction and backup

Updated SSTable partitioning by token range has improved JBOD compaction and backup. The nodetool relocatesstables command supports the improvement. For details, see the Improving JBOD blog.

jvm.options file

Garbage collection (GC) and other JVM options moved to the jvm.options file from the cassandra-env.sh file.

nodetool updates
PER PARTITION LIMIT

A query can be limited to return results from each partition, such as a Top 3 listing.

sstableloader -ap option

sstableloader (Bulkloader) can use third-party authentication.

Static columns improvements

Static columns can be indexed (experimental).

Static columns can be used with SASI indexes (experimental).

Timestamp in sub-second precision

Timestamp defaults include sub-second precision.

TTL for COPY FROM

Specify a TTL value when copying from CSV files.

user-defined type (UDT) improvements

If a UDT has only non-collection fields, an individual field value can be updated or deleted.

For more details, see the DataStax Enterprise 5.1 release notes. To upgrade to DataStax Enterprise 5.1, see the DataStax Upgrade Guide.

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2024 DataStax | Privacy policy | Terms of use

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com