Classification, regression, and clustering are some of the most commonly used machine learning (ML) techniques and there are various algorithms available for these tasks. In this 10-video course, learners can explore their application in Pandas ML. First, examine how to load data from a CSV (comma-separated values) file into a Pandas data frame and prepare the data for training a classification model. Then use the scikit-learn library to build and train a LinearSVC classification model and evaluate its performance with available model evaluation functions. You will explore how to install Pandas ML and define and configure a ModelFrame, then compare training and evaluation in Pandas ML with equivalent tasks in scikit-learn. Learn how to build a linear regression model by using Pandas ML. Then evaluate a regression model by using metrics such as r-square and mean squared error, and visualize its performance with Matplotlib. Work with ModelFrames for feature extraction and encoding, and configure and build a clustering model with the K-Means algorithm, analyzing data clusters to determine unique characteristics. Finally, complete an exercise on regression, classification, and clustering.
Machine & Deep Learning Algorithms: Data Preperation in Pandas ML
Course Overview
load data from a CSV file into a Pandas dataframe and prepare the data for training a classification model
use the scikit-learn library to build and train a LinearSVC classification model and then evaluate its performance using the available model evaluation functions
install Pandas ML and then define and configure a ModelFrame
compare training and evaluation in Pandas ML with the equivalent tasks in scikit-learn
use Pandas for feature extraction and one-hot encoding, load its contents into a ModelFrame, and initialize and train a linear regression model
evaluate a regression model using metrics such as r-square and mean squared error and visualize its performance using Matplotlib
work with ModelFrames for feature extraction and label encoding
configure and build a clustering model using the K-Means algorithm and analyze data clusters to determine characteristics that are unique to them
distinguish between the use of scikit-learn and Pandas ML when training a model and identify some of the metrics used to evaluate a model