Hadoop Maintenance and Distributions


Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description
Distributions provide performance and functionality enhancements over the base open source code Apache provides. In this course, you'll learn about the various distributions available and common maintenance tasks in a Hadoop environment.

Target Audience
Individuals who wish to understand key concepts and features of Hadoop and its tools

Prerequisites
None

Expected Duration (hours)
0.7

Lesson Objectives

Hadoop Maintenance and Distributions

  • start the course
  • demonstrate how to perform metadata and data backups
  • create and delete snapshots
  • list common problems for Hadoop administrators
  • use the filesystem balancer tool to keep filesystem datanodes evenly balanced
  • remove a node from a Hadoop cluster
  • describe the benefits of distributions
  • list the components of a Cloudera distribution, including Impala, Crunch, Kite, and Cloudera Manager
  • name the components of a Hortonworks distribution, including Tez, Falcon, and Ambari
  • recall the benefits of the MapR distribution
  • perform Hadoop snapshot operations
  • Course Number:
    df_hdra_a05_it_enus

    Expertise Level
    Intermediate