Using Spark SQL in DataStax Studio
Writing, testing, and running Spark SQL queries against DSE clusters.
Analyze data stored in DSE clusters with Spark SQL relational queries. Spark SQL is a unified relational query language for traversing over distributed collections of data, and supports a variation of the SQL language used in relational databases.
- Interactively perform Spark SQL queries against a DSE cluster
- Schema-aware content assist
- Syntax validations to facilitate faster prototyping
- The DSE cluster must be configured for the AlwaysOn SQL service.
- Be familiar with the Supported syntax of Spark SQL.
- In DataStax Studio, the Spark SQL statements must end in a semicolon (
;
).
The notebook tutorial Working with SparkSQL is installed with Studio. The tutorial provides hands-on steps to create data and execute Spark SQL code in a notebook. Learn about exploring the SQL schema in schema view, using content assist for syntax and domain validation. View results in table view and different styles of charts.