Hadoop Clusters


Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description
Clusters are used to store and analyze large volumes of data in a distributed computer environment. This course outlines the best practices to follow when implementing clusters in Hadoop.

Target Audience
Individuals who wish to understand key concepts and features of Hadoop and its tools

Prerequisites
None

Expected Duration (hours)
0.9

Lesson Objectives

Hadoop Clusters

  • start the course
  • configure an Ubuntu server for ssh and Java for Hadoop
  • set up Hadoop on a single node
  • set up Hadoop on four nodes
  • describe the different cluster configurations, including single-rack deployments, three-rack deployments, and large-scale deployments
  • add a new node to an existing Hadoop cluster
  • format HDFS and configure common options
  • run an example mapreduce job to perform a word count
  • start a Hadoop cluster and run a mapreduce job
  • Course Number:
    df_hdra_a02_it_enus

    Expertise Level
    Intermediate