DataStax Bulk Loader release notes

Release notes for DataStax Bulk Loader.

Release notes for DataStax Bulk Loader.

DataStax Bulk Loader can migrate data in CSV or JSON format into DSE from another DSE or Apache CassandraTM cluster.
  • Can unload data from any Cassandra 2.1 or later data source
  • Can load data to DSE 5.0 or later

DataStax Bulk Loader 1.2.0 release notes

Changes, enhancements, and resolved issues for DataStax Bulk Loader 1.2.0.

1 August 2018

DataStax Bulk Loader 1.2.0 release notes include:

1.2.0 Changes and enhancements

After upgrade to 1.2.0, be sure to review and adjust scripts to use changed settings.

  • Improve range split algorithm in multi-DC and vnodes environments. (DAT-252)
  • Support simplified notation for JSON arrays and objects in collection fields. (DAT-317)

1.2.0 Resolved issues:

  • CSVWriter trims leading/trailing whitespace in values. (DAT-302)
  • CSV connector fails when the number of columns in a record is greater than 512. (DAT-311)
  • Bulk Loader fails when mapping contains a primary key column mapped to a function. (DAT-326)

DataStax Bulk Loader 1.1.0 release notes

Changes, enhancements, and resolved issues for DataStax Bulk Loader 1.1.0.

18 June 2018

DataStax Bulk Loader 1.1.0 release notes include:

1.1.0 Changes and enhancements

After upgrade to 1.1.0, be sure to review and adjust scripts to use changed settings.

  • Combine batch.mode and batch.enabled into a single setting: batch.mode. If you are using the batch.enabled setting in scripts, change to batch.mode with value DISABLED. (DAT-287)
  • Improve handling of Univocity exceptions. (DAT-286)
  • Logging improvements. (DAT-290)
    • Log messages are logged only to operation.log. Logging does not print to stdout.
    • Configurable logging levels with the log.verbosity setting.
    • The setting log.ansiEnabled is changed to log.ansiMode.
  • New count workflow. (DAT-291, DAT-299)
    • Supports counting rows in a table.
    • Configurable counting mode.
    • When mode = partitions, configurable number of partitions to count. Support to count the number of rows for the N biggest partitions in a table.
  • Counter tables are supported for load and unload. (DAT-292)
  • Improve validation to include user-supplied queries and mappings. (DAT-294)
  • The codec.timestamp CQL_DATE_TIME setting is renamed to CQL_TIMESTAMP. Adjust scripts to use the new setting. (DAT-298)

1.1.0 Resolved issues:

  • Generated query does not contain all token ranges when a range wraps around the ring. (DAT-295)
  • Empty map values do not work when loading using dsbulk. (DAT-297)
  • DSBulk cannot handle columns of type list<timestamp>. (DAT-288)
  • Generated queries do not respect indexed mapping order. (DAT-289)
  • DSBulk fails to start with Java 10+. (DAT-300)

DataStax Bulk Loader 1.0.2 release notes

Release notes for DataStax Bulk Loader 1.0.2.

5 June 2018

DataStax Bulk Loader 1.0.2 release notes include:

1.0.2 Changes and enhancements

  • DataStax Bulk Loader 1.0.2 is bundled with DSE 6.0.1. (DSP-16206)
  • Configure whether to use ANSI colors and other escape sequences in log messages printed to standard output and standard error. (DAT-249)

DataStax Bulk Loader 1.0.1 release notes

Release notes for DataStax Bulk Loader 1.0.1.

17 April 2018

DataStax Bulk Loader 1.0.1 release notes include:

1.0.1 Changes and enhancements

  • DataStax Bulk Loader (dsbulk) version 1.0.1 is automatically installed with DataStax Enterprise, and can also be installed as a standalone tool. DataStax Bulk Loader 1.0.1 is supported for use with DSE 5.0 and later. (DSP-13999, DSP-15623)
  • Support to manage special characters on the command line and in the configuration file. (DAT-229)
  • Improve error messages for incorrect mapping. (DAT-235)
  • Improved monitoring options. (DAT-238)
  • Detect console width on Windows. (DAT-240)
  • Null words are supported by all connectors. The schema.nullStrings is changed to codec.nullWords. Renamed the convertTo and convertFrom methods. See Codec options and Schema options. (DAT-241)
  • Use Logback to improve filtering to make stack traces more readable and useful. On ANSI-compatible terminals, the date prints in green, the hour in cyan, the level is blue (INFO) or red (WARN), and the message prints in black. (DAT-242)
  • Improved messaging for completion with errors. (DAT-243)
  • Settings schema.allowExtraFields and schema.allowMissingFields are added to reference.conf. (DAT-244)
  • Support is dropped for using :port to specify the port to connect to. Specify the port for all hosts only with driver.port. (DAT-245)

1.0.1 Resolved issues

  • Numeric overflows should display the original input that caused the overflow. (DAT-237)
  • Null words are not supported by all connectors. (DAT-241)
  • Addresses might not be properly translated when cluster has custom native port. (DAT-245)