Hadoop Distributed File System


Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description
This course covers the HDFS architecture and its main building blocks. In addition, subjects such as data replication, communication protocols, and accessibility are introduced.

Target Audience
Individuals who wish to understand key concepts and features of Hadoop and its tools

Prerequisites
None

Expected Duration (hours)
0.6

Lesson Objectives

Hadoop Distributed File System

  • start the course
  • provide an overview of the HDFS architecture and its main building blocks
  • list considerations for the HDFS architecture, such as hardware failure, large data sets, and the coherency model
  • describe NameNode and DataNodes in HDFS
  • describe the file system namespace
  • provide an overview of data replication
  • list considerations relating to robustness
  • describe the various HDFS communication protocols
  • describe data organization considerations such as data blocks and replication pipelining
  • list accessibility features such as FS Shell, DFSAdmin, and Browser Interface
  • describe space reclamation considerations such as file deletes and replication factors
  • work with the HDFS architecture
  • Course Number:
    df_hdra_a01_it_enus

    Expertise Level
    Intermediate