• Glossary
  • Support
  • Downloads
  • DataStax Home
Get Live Help
Expand All
Collapse All

DataStax Bulk Loader

    • About DataStax Bulk Loader
    • Release notes
    • Architecture
    • Installing
    • Getting Started
      • Loading data
      • Unloading data
      • Counting data in tables
      • Creating configuration files
      • Loading tables that contain static and non-static columns
      • Using SSL with dsbulk
      • Printing cluster information
    • Kerberos client authentication
    • Reference
      • dsbulk
        • Loading data examples
        • Unloading data examples
        • Counting data example
        • Exit codes
      • Common options
      • Connector options
      • Count options
      • Schema options
      • Batch options
      • Codec options
      • Driver options
      • Engine options
      • Executor options
      • Logging options
      • Monitoring options
  • DataStax Bulk Loader
  • Reference
  • Executor options

Executor options

Specify executor options for the dsbulk command. These options define how DataStax Bulk Loader executes queries, and allow users to control throughput, throttling, concurrency, and query method execution.

--executor.maxPerSecond, --dsbulk.executor.maxPerSecond number

The maximum number of concurrent operations per second. This fixed value setting acts as a safeguard to prevent more requests than the cluster can handle. To set a variable throughput limit, use the --dsbulk.executor.maxInFlight number option.

Batch statements are counted by the number of statements included. Reduce this setting when the latencies get too high and a remote cluster cannot keep up with throughput, as dsbulk requests will eventually time out. This setting applies to all operations. When writing to the database, it applies to the number of statements executed. When reading from the database, it applies to the number of rows retrieved.

For important related information, see --dsbulk.engine.maxConcurrentQueries.

Default: -1 [Starting in 1.6.0, disabled by default -1. Zero 0 or any negative value also disables this option.]

When connecting to DataStax Astra DB clusters, a rate limit is always enforced. By default, the limit is 4,096 operations per second and per coordinator. Therefore, when --executor.maxPerSecond is left to its default setting, and the cluster is an Astra DB cluster, DataStax Bulk Loader automatically applies an appropriate limit and logs a message. Increasing this setting to load data at a higher rate can trigger Astra DB rate limit errors. See Get support for Astra DB for details about how to get support and to raise the rate limit.

--executor.maxInFlight, --dsbulk.executor.maxInFlight number

The maximum number of "in-flight" requests, or maximum number of concurrent requests waiting for a response from the server. This acts as a safeguard to prevent more requests than the cluster can handle. Batch statements count as one request. Reduce this value if you are facing out-of-memory errors.

For important related information, see --dsbulk.engine.maxConcurrentQueries.

Default: -1 [Starting in 1.6.0, disabled by default]

--executor.continuousPaging.enabled, --dsbulk.executor.continuousPaging.enabled {true | false}

Enable or disable continuous paging. If the target cluster does not support continuous paging, or if --datastax-java-driver.basic.request.consistency is not ONE or LOCAL_ONE, traditional paging is used regardless of this setting. Can be used with unload and count operations. Not applicable for load.

Default: true

--executor.continuousPaging.maxConcurrentQueries, --dsbulk.executor.continuousPaging.maxConcurrentQueries number

Deprecated. The correct option to use is --engine.maxConcurrentQueries.

--executor.continuousPaging.maxPages, --dsbulk.executor.continuousPaging.maxPages number

Deprecated. The correct option to use is --datastax-java-driver.advanced.continuous-paging.max-pages.

--executor.continuousPaging.maxPagesPerSecond, --dsbulk.executor.continuousPaging.maxPagesPerSecond number

Deprecated. The correct option to use is --datastax-java-driver.advanced.continuous-paging.max-pages-per-second.

--executor.continuousPaging.pageSize, --dsbulk.executor.continuousPaging.pageSize number

Deprecated. The correct option to use is --datastax-java-driver.advanced.continuous-paging.page-size.

--executor.continuousPaging.pageUnit, --dsbulk.executor.continuousPaging.pageUnit {ROWS | BYTES}

Deprecated. The correct option to use is --datastax-java-driver.advanced.continuous-paging.page-size-in-bytes.

Engine options Logging options

General Inquiries: +1 (650) 389-6000 info@datastax.com

© DataStax | Privacy policy | Terms of use

DataStax, Titan, and TitanDB are registered trademarks of DataStax, Inc. and its subsidiaries in the United States and/or other countries.

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries.

Kubernetes is the registered trademark of the Linux Foundation.

landing_page landingpage