Accessing the Apache Spark™ SQL Thrift Server with the Simba JDBC driver

The Simba JDBC Driver for Spark provides a standard JDBC interface to the information stored in DataStax Enterprise with the Spark SQL Thrift Server running.

Your DSE 5.1 license includes a license to use the Simba drivers.

Prerequisites

A running DSE Analytics cluster with Spark enabled
One node in the cluster running the Spark SQL Thrift Server

Procedure

Download the Simba JDBC Driver for Apache Spark.
Expand the ZIP file containing the driver.
In your JDBC application, configure the following:
1. Add SparkJDBC41.jar and the rest of the JAR files included in the ZIP file in your classpath.
2. The JDBC driver class is com.simba.spark.jdbc41.Driver and the JDBC data source is com.simba.spark.jdbc41.DataSource.
3. Set the connection URL to jdbc:spark://<hostname>:<port>, such as jdbc:spark://node1.example.com:10000. <hostname> is the hostname of the node on which the Spark SQL Thrift Server is running. <port> is the port number on which the Spark SQL Thrift Server is listening.

Accessing the Apache Spark™ SQL Thrift Server with the Simba JDBC driver

Prerequisites

Procedure

See also

Was this helpful?

Give Feedback