Configuring parallelism

Adjusting the number of tasks, simultaneous writes, and batch size.

In order for workers to split DataStax Apache Kafka connector tasks, the workers must have the same group.id in the connect-distributed.properties file. The parallelism is limited by the partitions for the given Kafka topic.

Example - Kafka topic named my-topic has 10 partitions and the connector group is configured to have 20 tasks. The maximum unit of parallelism in this environment would be 10 because each partitions cannot be subdivided among the tasks.
Note: Use additional Apache Kafka Connect workers to scale the connector if greater throughput is needed.