Accessing the Apache Spark™ SQL Thrift Server with the Simba JDBC driver

The Simba JDBC Driver for Spark provides a standard JDBC interface to the information stored in DataStax Enterprise with the Spark SQL Thrift Server running.

Your DSE license includes a license to use the Simba drivers.

Procedure

  1. Download the Simba JDBC Driver for Apache Spark from the Download DataStax page.

  2. Expand the ZIP file containing the driver.

  3. In your JDBC application, configure the following details:

    1. Add SparkJDBC41.jar and the rest of the JAR files included in the ZIP file in your classpath.

    2. The JDBC driver class is com.simba.spark.jdbc41.Driver and the JDBC data source is com.simba.spark.jdbc41.DataSource.

    3. Set the connection URL to jdbc:spark://<hostname>:<port> where <hostname> is the hostname of the node on which the Spark SQL Thrift Server is running, and <port> is the port number on which the Spark SQL Thrift Server is listening.

      jdbc:spark://node1.example.com:10000

  4. For more details, see Installing Simba JDBC Driver for Apache Spark.

Was this helpful?

Give Feedback

How can we improve the documentation?

© 2024 DataStax | Privacy policy | Terms of use

Apache, Apache Cassandra, Cassandra, Apache Tomcat, Tomcat, Apache Lucene, Apache Solr, Apache Hadoop, Hadoop, Apache Pulsar, Pulsar, Apache Spark, Spark, Apache TinkerPop, TinkerPop, Apache Kafka and Kafka are either registered trademarks or trademarks of the Apache Software Foundation or its subsidiaries in Canada, the United States and/or other countries. Kubernetes is the registered trademark of the Linux Foundation.

General Inquiries: +1 (650) 389-6000, info@datastax.com