Configuring the Spark ODBC Driver (Windows)

Adding a Simba ODBC Driver for Apache Spark data source to Windows.

Configure an ODBC data source for use by ODBC applications, including business intelligence (BI) tools like Tableau or Microsoft Excel.

Procedure

  1. Choose either the 32 bit or 64 bit ODBC driver.
    1. For the 32-bit driver, click Start > Program Files > Simba Spark ODBC Driver > 32 bit ODBC Data Source Administrator.
    2. For the 64-bit driver, click Start > Program Files > Simba Spark ODBC Driver > 64 bit ODBC Data Source Administrator.
  2. Click the Drivers tab to verify that the Simba Spark ODBC Driver is present.
  3. Create either a User or System DSN (data source name) for your ODBC tool connection.
    1. a. Click the User DSN or System DSN tab.
    2. b. Click Add > Simba Spark ODBC Driver > Finish.
    3. In Simba Spark ODBC Driver DSN Setup, enter the following:
      Data Source Name The name for your DSN.
      Description Optional longer description of your DSN.
      Spark Server Type SparkThriftServer (Spark 1.1 and later)
      Host(s) IP or hostname of your Spark SQL Thrift Server.
      Port Listening port for the Spark SQL Thrift Server (default 10000)
      Database Specify default to load all tables into the default database. Or pick a specific keyspace.
      Auth Mechanism User Name
      User Name leave blank
    4. Click Test.

      The test results should indicate a successful connection.

  4. For advanced configuration options, refer to the Simba ODBC Driver for Spark Installation Guide which is installed at Start > Program Files > Simba Spark ODBC Driver.

What's next

Use the newly created data source in ODBC applications like Tableau and Microsoft Excel.

After the ODBC query is transmitted to the Spark SQL Thrift server, the appropriate Spark jobs are executed, then the data is returned via ODBC to the application.

To troubleshoot or understand the queries being executed at the Spark SQL Thrift server, open a web browser to the Spark Master web interface (http://node name:4040) on the DSE cluster, click on the Thrift server application, then view the SQL tab.