Metrics

Mission Control collects metrics across all components and aggregates them across projects and clusters. Review this unified observability data in the centralized user interface. Mission Control installs and configures metrics components at the same time as the Mission Control control plane and scales those components independently.

These components enable you to monitor metrics from many sources within Mission Control, including:

Platform services
Operators
Observability components
Reaper
Database instances

Mission Control only deploys observability components to Platform instances.

Metrics are read from the database as the scraping occurs, providing the most up-to-date metrics. Metrics are scraped every 30 seconds and logs are collected as they are written to disk. This data is then pushed to aggregator instances. The aggregator handles applying configured transforms and sinks to the data stream.

See collected metrics with Mission Control’s graphical metrics view.

You can use Mission Control to push metrics to existing monitoring stacks. Manipulate and send observability data externally by adding custom transforms and sinks in the Mission Control configuration.

Prerequisites

You must provide an AWS S3 or S3-compatible, Google Cloud Storage, or an Azure Blob Storage object store during installation and configuration. All metrics are stored within an object store, providing long-term storage for metrics.

When cloud-based metrics storage is a concern, for example if you don’t use a cloud provider, you can use an S3 API to store objects. For environments without an S3 endpoint, you can use MinIO to provide an S3-compatible object store within the Mission Control platform.

Installation methods

KOTS installation
Helm installation

For KOTS-based installations, you configure metrics during the KOTS installation process through the web-based configuration interface.

For Helm-based installations, you manage metrics configuration through the values.yaml file. The key components for metrics collection are:

aggregator: Configures the Vector aggregator that processes and forwards metrics
mimir: Configures the metrics storage backend
agent: Configures the Vector agent that collects metrics from nodes

For detailed configuration options, see Install Mission Control with Helm.

Metrics collection

Vector is an observability pipeline framework from Datadog that collects metrics and logs from various Mission Control services. Vector transforms and sends the metrics to destinations like Loki and Mimir. The mission-control-aggregator ConfigMap in the mission-control namespace stores configuration data for Vector, including the vector.yaml file. The vector.yaml file defines Vector’s behavior, such as configured transforms and sinks.

Each database instance within a control or data plane includes a server-system-logger sidecar container. The server-system-logger collects metrics and logs generated by the local database instance.

This configuration is part of a larger Vector configuration that defines other components, such as sources, transforms, and other sinks.

KOTS configuration
Helm configuration

vector.toml

[sinks.vector_aggregator]
type = "vector"
inputs = ["cassandra_metrics", "enrich_host_metrics", "add_source_to_systemlog", "gclog_parser"]
address = "mission-control-aggregator.mission-control.svc:6000"

[sinks.console_log]
type = "console"
inputs = ["systemlog"]
target = "stdout"
encoding.codec = "text"

For Helm installations, Vector configuration is managed through the aggregator.customConfig section in your values.yaml file:

aggregator:
  enabled: true
  customConfig:
    sinks:
      vector_aggregator:
        type: "vector"
        inputs: ["cassandra_metrics", "enrich_host_metrics", "add_source_to_systemlog", "gclog_parser"]
        address: "mission-control-aggregator.mission-control.svc:6000"
      console_log:
        type: "console"
        inputs: ["systemlog"]
        target: "stdout"
        encoding:
          codec: "text"

This TOML configuration defines two sinks within a Vector configuration. In this case, the configuration specifies how to send data to the vector_aggregator and the console.

This configuration instructs Vector to:

Collect data from the specified input sources.
- cassandra_metrics
- enrich_host_metrics
- add_source_to_systemlog
- gclog_parser
Forward the collected data to the vector_aggregator service at the specified address.
Forward the systemlog data to the console for immediate inspection.

The following is a breakdown of the configuration:

[sinks.vector_aggregator]: Defines the vector_aggregator sink. The aggregator forwards direct sources, enriched sources, and parsed logs to the defined address.
- type = "vector": Specifies that this sink is of type "vector", indicating that it forwards data to another Vector instance.
- inputs = ["cassandra_metrics", "enrich_host_metrics", "add_source_to_systemlog", "gclog_parser"]: Specifies the input sources that are forwarded to the vector_aggregator. These sources represent different types of metrics or logs collected by Vector.
- address = "mission-control-aggregator.mission-control.svc:6000": Sets the address of the vector_aggregator service. In this example, the mission-control-aggregator service in the mission-control namespace listening on port 6000.
[sinks.console_log]: Defines the console_log sink.
- type = "console": Specifies that this sink is of type "console", indicating that it will output data to the console.
- inputs = ["systemlog"]: Specifies that the systemlog input source will be forwarded to the console.
- target = "stdout": Sets the target output stream to the standard output (stdout). encoding.codec = "text": Specifies that the output data must be encoded in text format.

View metrics

Use the Mission Control UI to view metrics.

In the Mission Control UI, go to Home, and then select your target cluster’s project.
Click Observability.
In the Health Metrics tab, hold the pointer over on any part of a chart to view more details. Review details for a specific time by moving your cursor along the horizontal time line.
Optional: In the Filter list, select the datacenter to monitor.
Optional: In the Frequency list, select the duration in which to refresh the metrics.
Optional: In the Time Period list, select the monitoring time period.

What metrics can I see?

Various Mission Control views reveal real-time and historical performance status about clusters, datacenters, nodes, tables, data, and storage tiers.

Overview view
Node view
Observability view

In the Mission Control UI, go to Home, and then select your target cluster’s project.
In the Overview tab, the Mission Control Overview view reveals datacenter and node information.

In the Mission Control UI, go to Home, and then select your target cluster’s project.
In the Nodes section of the Overview tab, in the Name column, click on a node.
Monitor node specifics such as:
- Availability of nodes - the status is next to the node name
- Type of database - HCD, DSE, or Cassandra
- Storage Capacity - largely measured in gigabytes (GB)
- Load
- Memory Usage - with details about System, Heap, and In Memory usage
- Gossip activity
- Pending Tasks
- Number of Native clients
- Days of Uptime
- Running Tasks - with Type, SSTable, and Progress
- Incoming Streams - with Operation, Peer, and Progress
- Outgoing Streams - with Operation, Peer, and Progress
- Thread Pool Stats - with Name, Active, Pending, Completed, Blocked, and Total Blocked

In the Mission Control UI, go to Home, and then select your target cluster’s project.
Click Observability.
In Examine metrics and logs, monitor datacenter activity in the cluster for a specific Frequency and Time Period:
- Read/Write Throughput
- Read/Write Latencies
- Other Latencies
- Errors
- CPU Utilization
- Unix Load
- Garbage Collection Time
- Disk Read Throughput
- Disk Write Throughput
- Network IO - with Receive (RX) and Transmit (TX) values