A brief overview of the Apache Cassandra data model.
Apache Cassandra™ is a partitioned row store. It is an open-source, distributed-database system that is designed for storing and managing large amounts of data across commodity servers.
- You design the data model
- The design of the data model is based on the queries you want to perform, not on modeling entities and relationships like you do for relational databases.
- The outermost grouping of data, similar to a schema in a relational database. All tables go inside a keyspace. A keyspace is the defining container for replication.
- A table stores data based on a primary key, which consists of a partition key and
optional clustering columns.
- A partition key defines the node on which the data is stored.
- A clustering column defines the order of data stored in a row.
- A primary key is used to access the data in the table.Note: In earlier versions of Apache Cassandra, a column family was synonymous in many respects, to a table.
- More information on Data modeling
- Data modeling concepts - An in-depth discussion in the CQL 3.3 documentation for Apache Cassandra documentation.
- Using CQL in the CQL 3.3 documentation.
- Getting Started with Time Series Data Modeling white paper
- Getting Started with User Profile Data Modeling white paper
- Become a Super Modeler webinar
- The Data Model is Dead, Long Live the Data Model webinar
- C* Summit 2013: The World's Next Top Data Model webinar