Interactive Queries with Spark SQL and Interactive Hive


Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description
In this course you will learn about implementing interactive queries with Spark SQL and Interactive Hive. It is one in a series of courses that prepares learners for exam 70-775: Perform Data Engineering on Microsoft Azure HDInsight.

Target Audience
IT professionals who implement and work with big data analytics and engineering workflows and use open-source technologies; IT professionals preparing for Microsoft exam 70-775

Prerequisites
None

Expected Duration (hours)
0.9

Lesson Objectives

Interactive Queries with Spark SQL and Interactive Hive

  • start the course
  • describe Spark SQL in HDInsight
  • run queries using Spark SQL
  • cache Spark DataFrames
  • read and write Parquet files in Spark
  • describe how to use BI tools with Apache Spark on HDInsight
  • optimize Spark SQL join types
  • manage Spark Thrift server
  • describe the different storage types for interactive queries
  • describe Interactive Hive
  • describe Hive LLAP and how to enable it through Hive settings
  • connect Interactive Hive clusters to BI tools
  • run Spark SQL queries
  • Course Number:
    df_mahd_a06_it_enus

    Expertise Level
    Intermediate