Apache Spark SQL


Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description
In this course, you will be introduced to Apache Spark SQL, Datasets, and DataFrames.

Target Audience
Programmers and Developers wishing to perform big data development using Apache Spark 2.2.

Prerequisites
None

Expected Duration (hours)
1.2

Lesson Objectives

Apache Spark SQL

  • start the course
  • describe Apache Spark SQL
  • create a SparkSession
  • create DataFrames with Spark SQL
  • use aggregations with the built-in DataFrames functions
  • run SQL queries programmatically
  • create a global temporary view
  • create Datasets with Spark SQL
  • use JSON Datasets with Spark SQL
  • use Load/Save functions
  • manually specify a data source
  • run SQL directly on files
  • use SaveMode to handle save operations
  • write parquet files with Spark SQL
  • use Spark SQL to save a DataFrame as a persistent table
  • use partitioning when saving persistent tables
  • use Spark SQL to create Datasets and DataFrames
  • Course Number:
    df_apsk_a02_it_enus

    Expertise Level
    Intermediate