• Glossary
  • Support
  • Downloads
  • DataStax Home
Get Live Help
Expand All
Collapse All

DataStax Streaming Home

Streaming Learning

    • Processing Data
      • Change Data Capture
        • Overview
        • Table schema evolution
        • Consuming change data
        • Questions and patterns
      • Data Pipelines
    • Migrating to Apache Pulsar
      • Starlight for Kafka
      • Starlight for RabbitMQ
      • Starlight for JMS
    • APIs & References
      • Connectors
        • Sources
          • Data Generator
          • Debezium MongoDB
          • Debezium MySQL
          • Debezium Oracle
          • Debezium PostgreSQL
          • Debezium SQL Server
          • Kafka
          • Kinesis
        • Sinks
          • Astra DB (Cassandra Enhanced)
          • Cloud Storage
          • Elasticsearch
          • Google BigQuery
          • JDBC PostgreSQL
          • JDBC MariaDB
          • JDBC Clickhouse
          • JDBC SQLite
          • Kafka
          • Kinesis
          • Snowflake
      • Pulsar Functions
      • Transformation Functions
        • Overview
        • Cast
        • Compute
        • Drop
        • Flatten
        • Merge KeyValue
        • Unwrap KeyValue
        • Deploy transform function in sink
        • Drop fields
      • Topic Subscriptions
        • Overview
        • Exclusive
        • Failover
        • Shared
        • Key Shared
  • Streaming Learning
  • DataStax Streaming Learning
Edit this Page

DataStax Streaming Learning

The learning site is a collection of guides and articles about DataStax streaming products. Each product has its own area dedicated to providing specifics to get started, its features, and other details. The learning site helps you get the most out of each product and get to production using best practices.

Processing Data

Change Data Capture (CDC)

Change Data Capture (CDC) is a design pattern used in software development to capture and propagate changes made to data in a system. The CDC pattern is commonly used in real-time data streaming applications to enable near-real-time processing of data changes.

Real-time data pipeline

A real-time data pipeline is a series of steps that takes data from its original source and moves it to a destination such as a data warehouse or data lake. The pipeline is a set of steps that are executed in a specific order to transform the data and make it available for analysis.

Migrating to Apache Pulsar

Starlight for JMS

Starlight for JMS allows enterprises to take advantage of the scalability and resiliency of a modern streaming platform to run their existing JMS applications.

Get started now | Configuring | Source Code↗

Starlight for Kafka

Starlight for Kafka brings native Apache Kafka® protocol support to Apache Pulsar by introducing a Kafka protocol handler on Pulsar brokers.

Get started now | Configuring | Source Code↗

Starlight for RabbitMQ

Starlight for RabbitMQ combines the AMQP 0.9.1 API with Pulsar, providing a powerful way to modernize your RabbitMQ infrastructure, improve performance, and reduce costs.

Get started now | Configuring | Source Code↗

APIs & References

Connectors

A connector is a function that moves data between Apache Pulsar and external systems. Source are used to push data to Pulsar from external systems such as databases, message queues, and storage systems. Sinks are used to pull data from a Pulsar topic to an external system like a database, data warehouse, or storage system.
Sources | Sinks | Experimental

Pulsar Functions

Functions are lightweight compute processes that enable you to process each message received on a topic. You can apply custom logic to that message, transforming or enriching it, and then output it to a different topic.
Learn more

Transformation Functions

Transform functions are a low-code implementation of common Pulsar functions. They are used to transform messages from one format to another. Use them to transform a message, enrich messages with additional data, or filter messages based on their content.
Cast | Compute | Drop | Flatten | Merge | Unwrap

Topic Subscriptions

Subscriptions in Pulsar describe which consumers are consuming data from a topic and how they want to consume that data.
Exclusive | Shared | Failover | Key Shared

Overview

General Inquiries: +1 (650) 389-6000 info@datastax.com

© DataStax | Privacy policy | Terms of use

DataStax, Titan, and TitanDB are registered trademarks of DataStax, Inc. and its subsidiaries in the United States and/or other countries.

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.

Kubernetes is the registered trademark of the Linux Foundation.

landing_page landingpage