Hadoop Distributed File System
Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level
Overview/Description
This course covers the HDFS architecture and its main building blocks. In addition, subjects such as data replication, communication protocols, and accessibility are introduced.
Target Audience
Individuals who wish to understand key concepts and features of Hadoop and its tools
Prerequisites
None
Expected Duration (hours)
0.6
Lesson Objectives Hadoop Distributed File System
start the course
provide an overview of the HDFS architecture and its main building blocks
list considerations for the HDFS architecture, such as hardware failure, large data sets, and the coherency model
describe NameNode and DataNodes in HDFS
describe the file system namespace
provide an overview of data replication
list considerations relating to robustness
describe the various HDFS communication protocols
describe data organization considerations such as data blocks and replication pipelining
list accessibility features such as FS Shell, DFSAdmin, and Browser Interface
describe space reclamation considerations such as file deletes and replication factors
work with the HDFS architecture
Course Number: df_hdra_a01_it_enus
Expertise Level
Intermediate