DataStax Streaming Learning

The learning site is a collection of guides and articles about DataStax streaming products. Each product has its own area dedicated to providing specifics to get started, its features, and other details. The learning site helps you get the most out of each product and get to production using best practices.

Processing Data

Change Data Capture (CDC)

Change Data Capture (CDC) is a design pattern used in software development to capture and propagate changes made to data in a system. The CDC pattern is commonly used in real-time data streaming applications to enable near-real-time processing of data changes.

Real-time data pipeline

A real-time data pipeline is a series of steps that takes data from its original source and moves it to a destination such as a data warehouse or data lake. The pipeline is a set of steps that are executed in a specific order to transform the data and make it available for analysis.

Migrating to Apache Pulsar™

Starlight for JMS

Starlight for JMS allows enterprises to take advantage of the scalability and resiliency of a modern streaming platform to run their existing JMS applications.

Get started now | Configuring | Source Code

Starlight for Kafka

Starlight for Kafka brings native Apache Pulsar protocol support by introducing a Kafka protocol handler on Pulsar brokers.

Get started now | Configuring | Source Code

Starlight for RabbitMQ

Starlight for RabbitMQ combines the AMQP 0.9.1 API with Pulsar, providing a powerful way to modernize your RabbitMQ infrastructure, improve performance, and reduce costs.

Get started now | Configuring | Source Code

APIs & References

Connectors

A connector is a function that moves data between Apache Pulsar and external systems. Source are used to push data to Pulsar from external systems such as databases, message queues, and storage systems. Sinks are used to pull data from a Pulsar topic to an external system like a database, data warehouse, or storage system.
Sources | Sinks | Experimental

Pulsar Functions

Functions are lightweight compute processes that enable you to process each message received on a topic. You can apply custom logic to that message, transforming or enriching it, and then output it to a different topic.
Learn more

Transformation Functions

Transform functions are a low-code implementation of common Pulsar functions. They are used to transform messages from one format to another. Use them to transform a message, enrich messages with additional data, or filter messages based on their content.
Cast | Compute | Drop | Flatten | Merge | Unwrap

Topic Subscriptions

Subscriptions in Pulsar describe which consumers are consuming data from a topic and how they want to consume that data.
Exclusive | Shared | Failover | Key Shared

DataStax Streaming Learning

Processing Data

Migrating to Apache Pulsar™

APIs & References

Connectors

Pulsar Functions

Transformation Functions

Topic Subscriptions

Was this helpful?

Give Feedback