Managing HDInsight Data, Jobs, and Security


Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description
In this course, you will learn how to manage HDInsight data, jobs, and security. The course is one in a series that prepares learners for exam 70-775: Perform Data Engineering on Microsoft Azure HDInsight.

Target Audience
IT professionals who implement and work with big data analytics and engineering workflows and use open-source technologies; IT professionals preparing for Microsoft exam 70-775

Prerequisites
None

Expected Duration (hours)
1.9

Lesson Objectives

Managing HDInsight Data, Jobs, and Security

  • start the course
  • copy data from cloud to on-premise and vice versa
  • describe Azure Data Lake and how to store data
  • describe Azure Blob storage and how to store data
  • describe how to ingest data in Apache Hive and Apache Spark using Apache Sqoop
  • ingest data in Apache Hive and Apache Spark using Application Development Framework (ADF)
  • describe how to ingest data in Apache Hive and Apache Spark using AzCopy
  • describe how to ingest data in Apache Hive and Apache Spark using AdlCopy
  • describe what Apache Hadoop YARN is and how it works
  • describe how YARN ResourceManager UI works
  • use the YARN CLI to kill an HDInsight job
  • view logs for different types of HDInsight jobs
  • debug jobs such as Hadoop and Spark jobs
  • describe the Azure Operations Management Suite (OMS)
  • manage HDInsight users, groups, and permissions
  • configure Kerberos on HDInsight
  • configure service accounts on HDInsight
  • describe how to implement SSH tunneling
  • identify how to restrict access to HDInsight data
  • manage HDInsight users
  • Course Number:
    df_mahd_a03_it_enus

    Expertise Level
    Intermediate