Raw Data to Insights: Data Ingestion & Statistical Analysis


Overview/Description
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description

Explore how statistical analysis can turn raw data into insights, and then examine how to use the data to improve business intelligence, in this 10-video course. Learn how to scrutinize and perform analytics on the collected data. The course explores several approaches for identifying values and insights from data by using various standard and intuitive principles, including data exploration and data ingestion, along with the practical implementation by using R. First, you will learn how to detect outliers by using R, and how to compare simple linear regression models, with and without outliers, to improve the quality of the data. Because today's data are available in diversified formats, with large volume and high velocity, this course next demonstrates how to use a variety of technologies: Apache Kafka, Apache NiFi, Apache Sqoop, and Wavefront (a program for simulating two-dimensional acoustic systems) to ingest data. Finally, you will learn how these tools can help users in data extraction, scalability, integration support, and security.



Expected Duration (hours)
0.9

Lesson Objectives

Raw Data to Insights: Data Ingestion & Statistical Analysis

  • Course Overview
  • describe how we can use statistical analysis to add value to data
  • recorgnize the concept of data correction along with the various essential approaches of implementing data correction which includes data detection localization, imputation and correction
  • demonstrate how we can facilitate outlier detection using R
  • describe the layered architecture of data from the perspective of data ingestion, prcoessing, and visualization
  • list and compare the various essential data ingestion tools that we can use to ingest data
  • set up Kafka and Apache NiFi to ingest data
  • demonstrate the steps involved in ingesting data from databases to Hadoop clusters using Sqoop
  • demonstrate how we can ingest data using WaveFront
  • detect outliers using R and ingest data using Apache NiFi and WaveFront
  • Course Number:
    it_dsrdindj_01_enus

    Expertise Level
    Intermediate