Hyper-Converged Database (HCD) Frequently Asked Questions (FAQs)

Databases

How do I install HCD?

You can provision and deploy an HCD cluster using the Mission Control UI, manually using the CLI, or with Docker to explore standalone container development in a local, non-prod environment. DataStax recommends using Mission Control to manage your clusters in production environments. For more information, see Install HCD.

The Docker image can be deployed as a standalone container for development environments.

How is HCD different from Astra DB?

HCD runs on premises and in the cloud, and uses Mission Control for provisioning and management. HCD is intended for companies that must manage their database resources on cloud providers, such as AWS, Google Cloud, and Microsoft Azure.

Astra DB allows you to create databases in minutes, with reduced complexity, fewer operational concerns, and an approachable entry point for developing your applications. Astra DB is a true database-as-a-service (DBaaS) offering that provides simplified database management with flexible pricing.

How is HCD different from DataStax Enterprise (DSE)?

HCD is the future of self-managed data, designed for modern data centers and Hyper-Converged Infrastructure (HCI) to support data workloads and AI systems at scale. It supports powerful on-premises enterprise data systems built to enable AI data and is ideal for enterprise operators and architects who want to take advantage of data center modernization investments to reduce operating costs and operational overhead.

HCD runs on premises and in the cloud, and uses Mission Control for provisioning and management. HCD is intended for companies that must manage their database resources on cloud providers, such as AWS, Google Cloud, and Microsoft Azure.

While you can run DSE on premises and in the cloud, HCD allows you to manage your database resources with modern architecture and a cloud-native approach.

DSE includes Advanced Workloads (DSE Search, Analytics, and Graph), which are not currently available in HCD. HCD will add more features and capabilities over time.

Which DataStax drivers work with HCD?

The following DataStax drivers are compatible with HCD:

What is the practical limit for the amount of data stored in an HCD database?

There is no practical limit to the number of bytes an HCD database can store.

Should I use Storage Attached Indexing (SAI)?

SAI is a highly-scalable, globally-distributed index for Apache Cassandra® that is available for HCD databases.

Explore this deep dive on Storage Attached Indexing.

Which ports does HCD use?

If you need to make firewall exceptions, HCD uses the following ports:

Ports
Port Description

7000

Cassandra inter-node communication

7001

Cassandra SSL inter-node communication

7199

JMX monitoring port

9042

CQL native transport port; also works for SSL

Can I change a namespace or keyspace name?

Yes, you can alter the namespace or keyspace name. You can also create a new namespace or keyspace with a different name.

I created my table. When I run a SELECT statement I get an error. What do I do?

This is the common error:

Invalid Request: Error from server: code=2200
[Invalid query] message="table TABLE_NAME does not exist"

When you create a table, the table name, keyspace name, or column name is case-sensitive. You can access case-sensitive characters using double quotes.

For example, see the SELECT statement below:

token@cqlsh:test> desc tables;
token@cqlsh:test> select * from "**Case_Sensitive**";

Architecture

What is NoSQL?

The NoSQL term originally referred to a new generation of databases that shunned SQL for other interfaces. The term NoSQL has recently become a catch-all term for post-relational "not-only SQL" databases that use a method of storage different from a relational, or SQL, database.

How is HCD different from relational databases?

HCD is a distributed and highly available database that uses peer-to-peer communication. Data modeling in HCD is similar to relational databases while differing in key areas to provide blazing fast interaction. Relational databases use joins between tables for relationships. HCD uses denormalization to achieve more robust querying.

What kind of hardware do I need to run HCD?

The Capacity Planning Guide provides recommendations for choosing hardware including number of CPUs, amount of RAM and type of disks appropriate for the workload and environment.

What operational tools are included with HCD?

Mission Control is the next generation operations platform for deploying, managing, and maintaining HCD, DataStax Enterprise (DSE), and Cassandra clusters. It provides a web-based interface that simplifies management and operation of HCD clusters.

How do I interact with HCD?

HCD architecture allows any authorized user to connect to any node in any datacenter and access data using the Cassandra Query Language (CQL 3.4.5 protocol v5). For ease of use, CQL uses a similar syntax to SQL. The most basic way to interact with HCD is using the CQL shell, cqlsh. Using cqlsh, you can create keyspaces and tables, insert records and query tables, and much more. Other ways to interact with HCD are:

  • Clients that interact with the Data API, a schema-less, document-based, modern API that provides easy and intuitive access to both structured and unstructured data. It leverages the scalability, performance, and real-time indexing capabilities of Cassandra to support GenAI application development. There are clients for Python, TypeScript, and Java.

  • Drivers that interact with CQL for developers who want to take advantage of specific Cassandra features. There are a number of drivers in various programming languages. See DataStax drivers.

How do I move data to and from HCD?

HCD provides several solutions for migrating data from other databases: DataStax Bulk Loader (DSBulk) is an open-source utility for loading or unloading data in CSV or JSON format into and out of HCD databases.

  • The cqlsh COPY FROM and COPY TO commands mirror what the PostgreSQL RDBMS uses for file import/export.

  • The sstableloader utility allows you to bulk load SSTables into HCD from the snapshots of another Cassandra cluster.

  • For more sophisticated use cases that require transformation and manipulation of the source data, extract-transform-load (ETL) solutions that support Cassandra are available from vendors including Talend, Informatica, and Streamsets.

Security

What measures are taken to protect data in HCD?

DataStax is committed to data privacy and security. This commitment includes robust security measures and compliance standards, such as HIPAA, SOC2 Type 2, and ISO 27001.

For more information, see the DataStax Trust Center.

Tunable data consistency and consistency levels

How are consistent read and write operations handled?

HCD extends eventual consistency with tunable consistency to vary the consistency of reads and writes. For more information, see How are consistent read and write operations handled?.

How do HCD transactions differ from RDBMS transactions?

HCD does not use RDBMS ACID transactions; instead HCD offers atomic, isolated, and durable transactions with eventual/tunable consistency. For more information, see How do Hyper-Converged Database (HCD) transactions differ from RDBMS transactions?.

How do I discover consistency level performance?

Use tracing to discover what the consistency level is set to, and how it affects performance. For more information, see How do I discover consistency level performance?.

How do I configure consistency levels?

Consistency levels in HCD can be configured to manage availability versus data accuracy. For more information, see How is the consistency level configured?.

How do I configure serial consistency levels?

Serial consistency levels control lightweight transaction (LWT) isolation. For more information, see How is the serial consistency level configured?.

How are read requests accomplished?

A coordinator node can send three types of read requests to a replica. For more information, see How are read requests accomplished?.

How are write requests accomplished?

The write consistency level determines how many replica nodes must respond with a success acknowledgment in order for the write to be considered successful. For more information, see How are write requests accomplished?.

Support

How do I get help with HCD?

How do I contact DataStax Support?

Only accounts with paid plans can submit support tickets.

If you identify an alert or incident, you can submit a support ticket for help. Go to Support Onboarding to learn how to submit a ticket.

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2024 DataStax | Privacy policy | Terms of use

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com