Analyzing Data Using Python: Cleaning & Analyzing Data in Pandas


Overview/Description
Expected Duration
Lesson Objectives
Course Number
Expertise Level



Overview/Description

For data analysis to be useful and accurate, the analyzed data needs to be cleaned and curated. There are copious methods to achieve this in pandas. In this course, you'll learn how to identify and eliminate duplicates in pandas.

You'll start by using the pandas cut method to discretize data into bins, using bins to plot histograms and identify outliers using box-and-whisker plots. You'll parse and work with datetime objects read in from strings and convert string columns to datetime using the dateutils python library.

Moving on, you'll master different pandas methods for aggregating data - including the groupby, pivot, and pivot_table methods. Lastly, you'll perform various joins - inner, left outer, right outer, and full outer - using both the merge and join methods.



Expected Duration (hours)
1.9

Lesson Objectives

Analyzing Data Using Python: Cleaning & Analyzing Data in Pandas

  • discover the key concepts covered in this course
  • identify and deal with duplicate records
  • summarize records into bins or categories
  • compute aggregations on data
  • perform common grouping and aggregation operations
  • use pivot tables to explore data
  • use pivot tables to summarize data
  • combine and merge records
  • perform inner join operations using the merge() method
  • perform left and right join operations using the merge() method
  • implement joins using the join() method
  • manipulate and analyze time series data
  • summarize the key concepts covered in this course
  • Course Number:
    it_daavlpdj_04_enus

    Expertise Level
    Intermediate