Configuring the Apache Spark™ ODBC Driver (Windows)
Configure an ODBC data source for ODBC applications, including business intelligence (BI) tools like Tableau or Microsoft Excel.
Procedure
-
Choose either the 32 bit or 64 bit ODBC driver.
-
For the 32-bit driver, click Start > Program Files > Simba Spark ODBC Driver > 32 bit ODBC Data Source Administrator.
-
For the 64-bit driver, click Start > Program Files > Simba Spark ODBC Driver > 64 bit ODBC Data Source Administrator.
-
-
Click the Drivers tab to verify that the Simba Spark ODBC Driver is present.
-
Create either a User or System DSN (data source name) for your ODBC tool connection.
-
a. Click the User DSN or System DSN tab.
-
b. Click Add > Simba Spark ODBC Driver > Finish.
-
In Simba Spark ODBC Driver DSN Setup, enter the following:
Field Input Data Source Name
The name for your DSN.
Description
Optional longer description of your DSN.
Spark Server Type
SparkThriftServer (Spark 1.1 and later)
Host(s)
IP or hostname of your Spark SQL Thrift Server.
Port
Listening port for the Spark SQL Thrift Server (default 10000)
Database
Specify
default
to load all tables into the default database. Or pick a specific keyspace.Auth Mechanism
User Name
User Name
leave blank
-
Click Test.
The test results should indicate a successful connection.
-
-
For advanced configuration options, refer to the Installing Simba ODBC Driver for Apache Spark.
What’s next
Use the newly created data source in ODBC applications like Tableau and Microsoft Excel.
After the ODBC query is transmitted to the Spark SQL Thrift server, the appropriate Spark jobs are executed, then the data is returned via ODBC to the application.
To troubleshoot or understand the queries being executed at the Spark SQL Thrift server, open a web browser to the Spark Master web interface (http://node name:4040
) on the DSE cluster, click on the Thrift server application, then view the SQL tab.