Accessing the Apache Spark™ SQL Thrift Server with the Simba JDBC driver
The Simba JDBC Driver for Spark provides a standard JDBC interface to the information stored in DataStax Enterprise with the Spark SQL Thrift Server running.
Your DSE 5.1 license includes a license to use the Simba drivers.
Prerequisites
-
A running DSE Analytics cluster with Spark enabled
-
One node in the cluster running the Spark SQL Thrift Server
Procedure
-
Expand the ZIP file containing the driver.
-
In your JDBC application, configure the following:
-
Add
SparkJDBC41.jar
and the rest of the JAR files included in the ZIP file in your classpath. -
The JDBC driver class is
com.simba.spark.jdbc41.Driver
and the JDBC data source iscom.simba.spark.jdbc41.DataSource
. -
Set the connection URL to
jdbc:spark://<hostname>:<port>
, such asjdbc:spark://node1.example.com:10000
.<hostname>
is the hostname of the node on which the Spark SQL Thrift Server is running.<port>
is the port number on which the Spark SQL Thrift Server is listening.
-