Loading data

DSE Graph Loader can load data from many different input data formats. Pick the option that most resembles your data source:

Supported DSE Graph Loader data sources
Type Description Instructions

CSV

Strict format, with the first line of the file identifying the property keys used in the graph.

Loading CSV data

Text

Delimited text data of any format.

Loading TEXT data

Text with regular expressions

Delimited text data parsed using regular expressions (regex).

Loading TEXT data using regular expressions (regex)

JSON

Data stored in JSON (JavaScript Object Notation) format.

Loading JSON data

JDBC-compatible database

Data stored in a JDBC-compatible database

Loading data from a JDBC compatible database.

HDFS file

Data file stored in a Hadoop Distributed File System (HDFS) of any format.

Loading data from Hadoop (HDFS)

AWS S3 file

Data file stored in AWS S3 storage of any format.

Loading data from AWS S3

Gryo

Data stored in a binary Gryo format.

Loading Gryo data

GraphSON

Data stored in GraphSON format.

Loading GraphSON data

GraphML

Data stored in GraphML format.

Loading GraphML data

Fields that contain NULL, null, or empty fields in text and CSV files will be pruned by DSE Graph Loader. A transform must be used if a different behavior is desired.

When loading custom vertex ids, the vertex cache that DSE Graph Loaders uses will be bypassed to facilitate faster write throughput. The client must ensure vertices are unique because no logic will validate the existence of a vertex with custom ids. To ensure the fastest performance, the DSE Graph configuration option external_vertex_verify should be set to false.

The DSE Graph Loader also supports loading several files of the same format from a single directory. Example mapping scripts are shown for CSV and JSON, but will work for all formats.

Loading CSV data

How to use the DSE Graph Loader to load CSV data.

Loading JSON data

How to use the DSE Graph Loader to load JSON data.

Loading TEXT data

How to use the DSE Graph Loader to load delimited text data.

Loading TEXT data using regular expressions (regex)

How to use the DSE Graph Loader to load text data using regex.

Loading data from a JDBC compatible database.

How to use the DSE Graph Loader to load data from a JDBC compatible database.

Loading data from Hadoop (HDFS)

How to use the DSE Graph Loader to load data from Hadoop (HDFS).

Loading data from AWS S3

How to use the DSE Graph Loader to load data from AWS S3.

Loading Gryo data

How to use the DSE Graph Loader to load Gryo data.

Loading GraphSON data

How to use the DSE Graph Loader to load GraphSON data.

Loading GraphML data

How to use the DSE Graph Loader to load GraphML data.

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2024 DataStax | Privacy policy | Terms of use

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com