Accessing the Spark SQL Thrift Server with the Simba JDBC driver

The Simba JDBC driver for Spark provides a standard JDBC interface to the information stored in DataStax Enterprise with the Spark SQL Thrift Server running.

Your DSE 5.1 license includes a license to use the Simba drivers.

Make sure you have a running DSE Analytics cluster with Spark enabled where one node in the cluster is running the Spark SQL Thrift Server.
Contact IBM Support to download the Simba JDBC driver for Apache Spark.
Expand the ZIP file containing the driver.
In your JDBC application, configure the following:
1. Add SparkJDBC41.jar and the rest of the JAR files included in the ZIP file in your classpath.
2. The JDBC driver class is com.simba.spark.jdbc41.Driver and the JDBC data source is com.simba.spark.jdbc41.DataSource.
3. Set the connection URL to jdbc:spark://<hostname>:<port>, such as jdbc:spark://node1.example.com:10000. <hostname> is the hostname of the node on which the Spark SQL Thrift Server is running. <port> is the port number on which the Spark SQL Thrift Server is listening.

For more information about using this driver, contact IBM Support.

Accessing the Spark SQL Thrift Server with the Simba JDBC driver

Was this helpful?

Give Feedback