Cloud Data Science: Data Cleanup with Azure Machine Learning Studio


Overview/Description
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description

Discover how to clean up your data in Azure Machine Learning Studio using filters, SQL Transformations, and missing and duplicate data features.



Expected Duration (hours)
1.2

Lesson Objectives

Cloud Data Science: Data Cleanup with Azure Machine Learning Studio

  • use the filter modules of Azure Machine Learning Studio to transform your data
  • use the count modules of Azure Machine Learning Studio to summarize the important information in your data set
  • use the Apply SQL Transformation module of Azure Machine Learning Studio to specify a SQL query on your input dataset
  • use the Clean Missing Data module of Azure Machine Learning Studio to remove or replace missing values
  • use the Remove Duplicate Rows module of Azure Machine Learning Studio to remove duplicates from your dataset
  • use the SMOTE module of Azure Machine Learning Studio to increase the number of cases in your dataset
  • use the Clip Values module of Azure Machine Learning Studio to detect outliers and rescale numeric data
  • use the Group Data into Bins module of Azure Machine Learning Studio to change the distribution of continuous data
  • use the Normalize Data module of Azure Machine Learning Studio to transform a dataset
  • apply cleanup modules to your datasets in Azure Machine Learning Studio
  • Course Number:
    it_dfpdsm_06_enus

    Expertise Level
    Expert