Google Cloud Dataproc


Overview/Description
Target Audience
Prerequisites
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description
GCP provides fully managed cloud services for running Apache Spark and Hadoop. This course will introduce you to the concepts of cluster management with Dataproc, including machine types and workers.

Target Audience
Data professionals who are responsible for provisioning and optimizing big data solutions, and data enthusiasts getting started with Google Cloud Platform

Prerequisites
None

Expected Duration (hours)
0.7

Lesson Objectives

Google Cloud Dataproc

  • start the course
  • recognize big data concepts and solutions using GCP
  • define Cloud Dataproc and its benefits
  • recall the various ways to access Dataproc
  • describe the various areas of the dashboard and create a project
  • recognize the process for creating a cluster in Dataproc
  • recall the process for deleting a cluster using Dataproc
  • define master and worker nodes in Dataproc
  • describe custom machine types and preemptible worker nodes
  • define the processes for identity and access management with permissions and IAM roles
  • recognize the basic concepts of cluster management in Dataproc
  • Course Number:
    cl_gcde_a05_it_enus

    Expertise Level
    Intermediate