Monitor database metrics
You can view health metrics for your databases, such as latency and throughput. These metrics provide insights into database performance and workload distribution.
View metrics in the Astra Portal
-
In the Astra Portal, click Managed clusters, and then click the name of your Astra DB Classic database.
-
Click the Health tab to explore database metrics in the embedded Grafana viewer.
To view granular metrics for time-series graphs, hover over a specific time in the graph.
Gaps in read/write metrics indicate periods when there were no requests.
For more information about specific widgets or metrics, see Astra DB Classic metrics widgets and [metrics-definitions].
-
To sort and filter the data, do the following:
-
For multi-region databases, use the Region menu to inspect data for a different region.
-
To change the reporting period, use the schedule Time Frame Section menu. The default time range is the last 30 minutes. For more information about time controls, see the Grafana documentation.
-
You can click Refresh to refresh the dashboard. The default refresh interval is one minute.
-
-
To drill down into a widget’s data, hover over the widget, click More, and then select an option:
-
Inspect: Inspect timestamped data or raw JSON data. You can also export the data as a CSV.
-
View: View the widget in full screen mode. To navigate back to the dashboard view, use the breadcrumbs in the header of the Grafana viewer.
-
-
To view the dashboard in a full window, click Cycle View Mode, press Esc, click Share, and then open the URL in a new browser tab or window.
-
To set up alerts or switch dashboards, click Toggle menu, and then click Dashboards or Alerting.
Astra DB Classic metrics widgets
The Health tab includes the following sections and widgets.
Cluster Overview section
This section reports the following individual metrics:
-
Nodes Up
-
Nodes Down
-
Compactions per five minutes
-
Requests per five minutes
-
Dropped Messages per five minutes
-
Clients Connected
-
Client Timeouts per five minutes
-
Total Hints
-
Live Data Size
Coordinator Metrics section
This section contains time series graphs for the following metrics or groups of metrics:
-
Connected Clients
-
Client Timeouts per five minutes
-
Requests Served per five minutes: This widget reports the number of requests served by the coordinator over a five minute period.
-
Coordinator 0.95 Latency per cluster per five minutes: This widget reports the 95th percentile latency metrics for the coordinator nodes. You can use these metrics to help identify coordinator specific issues, such as inter-node latency.
-
Compactions All.all per cluster: This widget reports Bytes Compacted, Completed Compaction, and Pending Compactions. You can use these metrics to help determine if the compaction workload is impacting your Astra DB Classic database performance.
Compaction is the process of reconciling different data copies stored in different SSTables.
SSTables are immutable data files that Astra databases use for persisting data on disk.
-
Memtable Space All.all per cluster: This widget reports Off Heap, On Heap, Data Size, Flushes, and Pending Flushes for memtables, which are in-memory structures for buffering writes. You can use these metrics to help identify and analyze tables that are frequently written to.
-
Large Partitions
-
Time Since Last Repair
-
Misplaced Operations
Time-series graph legend
The legends for time-series graphs can include the following:
-
Read: A query that retrieves data from your database.
-
Write: An insert/update operation that writes data to your database.
-
ViewWrite: A write operation related to materialized views (MVs).
-
RangeSlice: Includes reads that span a range of partitions or secondary indexes as well as queries that don’t use a
WHERE
clause. -
CASRead: Compare and set (CAS) reads.
-
CASWrite: Compare and set (CAS) writes.
Compare and set (CAS) is equivalent to Lightweight Transactions (LWTs), where the server returns some specialized results indicating the outcome and optional state preceding a transaction. LWTs are a suboptimal pattern that can negatively impact performance. |