Data Silos, Lakes, & Streams: Sources, Visualizations, & ETL Operations


Overview/Description
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description

This course discusses the transition of data warehousing to cloud-based solutions using the AWS (Amazon Web Services) cloud platform. You will explore Amazon Redshift, a fully managed petabyte-scale data warehouse service which forms part of the larger AWS cloud-computing platform. The 12-video course demonstrates how to create and configure an Amazon Redshift cluster; to load data into it from an S3 (simple storage service) bucket; and configure a Glue crawler for stored data. This course examines how to visualize the data stored in the data lake and how to perform ETL (extract, transform, load) operations on the data using Glue scripts. You will work with the DynamoDB, a NoSQL database service that supports key-value and document data structures. You will learn how to use AWS QuickSight, a high-performance business intelligence service which integrates seamlessly with Glue tables by using the Amazon Athena Query Service. Finally, you will configure jobs to run extract, transform, and load operations on data stored in our data lake.



Expected Duration (hours)
1.5

Lesson Objectives

Data Silos, Lakes, & Streams: Sources, Visualizations, & ETL Operations

  • Course Overview
  • configure a Redshift cluster to store data
  • load data into a Redshift cluster from S3 buckets
  • configure a JDBC connection on Glue to the Redshift cluster
  • crawl data on a Redshift cluster using a Glue crawler
  • crawl data stored in a DynamoDB table
  • configure the Amazon QuickSight business intelligence tool to visualize data
  • build charts and dashboards in QuickSight
  • define a job in Glue to perform ETL operations
  • run ETL scripts using Glue
  • perform ETL operations in Glue to backup data originally stored in Redshift
  • perform ETL operations in Glue to backup data originally stored in DynamoDB
  • recall how to use AWS services for visualizations and ETL
  • Course Number:
    it_dsdslsdj_03_enus

    Expertise Level
    Expert