Accessing the Apache Spark™ SQL Thrift Server with the Simba JDBC driver
The Simba JDBC Driver for Spark provides a standard JDBC interface to the information stored in DataStax Enterprise with the Spark SQL Thrift Server running.
Your DSE 5.1 license includes a license to use the Simba drivers.
-
Make sure you have a running DSE Analytics cluster with Spark enabled where one node in the cluster is running the Spark SQL Thrift Server.
-
Contact DataStax Support to download the Simba JDBC Driver for Apache Spark.
-
Expand the ZIP file containing the driver.
-
In your JDBC application, configure the following:
-
Add
SparkJDBC41.jarand the rest of the JAR files included in the ZIP file in your classpath. -
The JDBC driver class is
com.simba.spark.jdbc41.Driverand the JDBC data source iscom.simba.spark.jdbc41.DataSource. -
Set the connection URL to
jdbc:spark://<hostname>:<port>, such asjdbc:spark://node1.example.com:10000.<hostname>is the hostname of the node on which the Spark SQL Thrift Server is running.<port>is the port number on which the Spark SQL Thrift Server is listening.
-
For more information about using this driver, contact DataStax Support.