Managing HDInsight Data, Jobs, and Security
Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level
Overview/Description
In this course, you will learn how to manage HDInsight data, jobs, and security. The course is one in a series that prepares learners for exam 70-775: Perform Data Engineering on Microsoft Azure HDInsight.
Target Audience
IT professionals who implement and work with big data analytics and engineering workflows and use open-source technologies; IT professionals preparing for Microsoft exam 70-775
Prerequisites
None
Expected Duration (hours)
1.9
Lesson Objectives Managing HDInsight Data, Jobs, and Security
start the course
copy data from cloud to on-premise and vice versa
describe Azure Data Lake and how to store data
describe Azure Blob storage and how to store data
describe how to ingest data in Apache Hive and Apache Spark using Apache Sqoop
ingest data in Apache Hive and Apache Spark using Application Development Framework (ADF)
describe how to ingest data in Apache Hive and Apache Spark using AzCopy
describe how to ingest data in Apache Hive and Apache Spark using AdlCopy
describe what Apache Hadoop YARN is and how it works
describe how YARN ResourceManager UI works
use the YARN CLI to kill an HDInsight job
view logs for different types of HDInsight jobs
debug jobs such as Hadoop and Spark jobs
describe the Azure Operations Management Suite (OMS)
manage HDInsight users, groups, and permissions
configure Kerberos on HDInsight
configure service accounts on HDInsight
describe how to implement SSH tunneling
identify how to restrict access to HDInsight data
manage HDInsight users
Course Number: df_mahd_a03_it_enus
Expertise Level
Intermediate