When you enroll in this course, you'll also be asked to select a specific program.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 6 modules in this course
This course introduces you to one of the main types of modeling families of supervised Machine Learning: Classification. You will learn how to train predictive models to classify categorical outcomes and how to use error metrics to compare across different models. The hands-on section of this course focuses on using best practices for classification, including train and test splits, and handling data sets with unbalanced classes.
By the end of this course you should be able to:
-Differentiate uses and applications of classification and classification ensembles
-Describe and use logistic regression models
-Describe and use decision tree and tree-ensemble models
-Describe and use other ensemble methods for classification
-Use a variety of error metrics to compare and select the classification model that best suits your data
-Use oversampling and undersampling as techniques to handle unbalanced classes in a data set
Who should take this course?
This course targets aspiring data scientists interested in acquiring hands-on experience with Supervised Machine Learning Classification techniques in a business setting.
What skills should you have?
To make the most out of this course, you should have familiarity with programming on a Python development environment, as well as fundamental understanding of Data Cleaning, Exploratory Data Analysis, Calculus, Linear Algebra, Probability, and Statistics.
Logistic regression is one of the most studied and widely used classification algorithms, probably due to its popularity in regulated industries and financial settings. Although more modern classifiers might likely output models with higher accuracy, logistic regressions are great baseline models due to their high interpretability and parametric nature. This module will walk you through extending a linear regression example into a logistic regression, as well as the most common error metrics that you might want to use to compare several classifiers and select that best suits your business problem.
What's included
12 videos4 readings3 assignments2 app items
Show info about module content
12 videos•Total 86 minutes
Welcome•1 minute
Introduction: What is Classification?•6 minutes
Introduction to Logistic Regression•3 minutes
Classification with Logistic Regression•6 minutes
Logistic Regression with Multi-Classes•2 minutes
Implementing Logistic Regression Models•4 minutes
Confusion Matrix, Accuracy, Specificity, Precision, and Recall•7 minutes
Classification Error Metrics: ROC and Precision-Recall Curves•7 minutes
Implementing the Calculation of ROC and Precision-Recall Curves•4 minutes
[Optional] Logistic Regression Lab - Part 1•14 minutes
[Optional] Logistic Regression Lab - Part 2•17 minutes
[Optional] Logistic Regression Lab - Part 3•14 minutes
4 readings•Total 20 minutes
About this course•3 minutes
Optional: Download data assets•3 minutes
[Optional] Download Assets for Demo Lab: Logistic Regression •10 minutes
K Nearest Neighbors is a popular classification method because they are easy computation and easy to interpret. This module walks you through the theory behind k nearest neighbors as well as a demo for you to practice building k nearest neighbors models with sklearn.
What's included
8 videos1 reading3 assignments2 app items
Show info about module content
8 videos•Total 50 minutes
K Nearest Neighbors for Classification•5 minutes
K Nearest Neighbors Decision Boundary•4 minutes
K Nearest Neighbors Distance Measurement•6 minutes
K Nearest Neighbors Pros and Cons•3 minutes
K Nearest Neighbors with Feature Scaling•6 minutes
[Optional] K Nearest Neighbors Notebook - Part 1•10 minutes
[Optional] K Nearest Neighbors Notebook - Part 2•6 minutes
[Optional] K Nearest Neighbors Notebook - Part 3•11 minutes
1 reading•Total 1 minute
Summary/Review•1 minute
3 assignments•Total 32 minutes
Module 2 Graded Quiz - KNN•24 minutes
K Nearest Neighbors•3 minutes
K Nearest Neighbors Labs•5 minutes
2 app items•Total 60 minutes
Demo Lab: K Nearest Neighbors•30 minutes
Practice Lab: K Nearest Neighbors•30 minutes
Support Vector Machines
Module 3•3 hours to complete
Module details
This module will walk you through the main idea of how support vector machines construct hyperplanes to map your data into regions that concentrate a majority of data points of a certain class. Although support vector machines are widely used for regression, outlier detection, and classification, this module will focus on the latter.
What's included
12 videos1 reading4 assignments2 app items
Show info about module content
12 videos•Total 67 minutes
Introduction to Support Vector Machines•4 minutes
Classification with Support Vector Machines•3 minutes
The Support Vector Machines Cost Function•5 minutes
Regularization in Support Vector Machines•7 minutes
Introduction to Support Vector Machines Gaussian Kernels•2 minutes
Support Vector Machines Gaussian Kernels - Part 1•4 minutes
Support Vector Machines Gaussian Kernels - Part 2•4 minutes
Support Vector Machines Workflow•5 minutes
Implementing Support Vector Machines Kernal Models•4 minutes
[Optional] Support Vector Machines Notebook - Part 1•9 minutes
[Optional] Support Vector Machines Notebook - Part 2•9 minutes
[Optional] Support Vector Machines Notebook - Part 3•11 minutes
1 reading•Total 2 minutes
Summary/Review•2 minutes
4 assignments•Total 41 minutes
Module 3 Graded Quiz: Support Vector Machines•30 minutes
Support Vector Machines•5 minutes
Support Vector Machines Kernels•3 minutes
Support Vector Machines Labs•3 minutes
2 app items•Total 50 minutes
Demo Lab: Support Vector Machines•20 minutes
Practice Lab: Support Vector Machines•30 minutes
Decision Trees
Module 4•3 hours to complete
Module details
Decision tree methods are a common baseline model for classification tasks due to their visual appeal and high interpretability. This module walks you through the theory behind decision trees and a few hands-on examples of building decision tree models for classification. You will realize the main pros and cons of these techniques. This background will be useful when you are presented with decision tree ensembles in the next module.
What's included
9 videos2 readings3 assignments2 app items
Show info about module content
9 videos•Total 60 minutes
Overview of Classifiers•3 minutes
Introduction to Decision Trees•6 minutes
Building a Decision Tree•7 minutes
Entropy-based Splitting•3 minutes
Other Decision Tree Splitting Criteria•5 minutes
Pros and Cons of Decision Trees•6 minutes
[Optional] Decision Trees Notebook - Part 1•7 minutes
[Optional] Decision Trees Notebook - Part 2•9 minutes
[Optional] Decision Trees Notebook - Part 3•16 minutes
2 readings•Total 13 minutes
[Optional] Download Assets for Demo Lab: Decision Trees •10 minutes
Summary/Review•3 minutes
3 assignments•Total 37 minutes
Module 4 Graded Quiz: Decision Trees•30 minutes
Decision Trees•4 minutes
Decision Trees Labs•3 minutes
2 app items•Total 60 minutes
Demo Lab: Decision Trees•30 minutes
Practice Lab: Decision Trees•30 minutes
Ensemble Models
Module 5•9 hours to complete
Module details
Ensemble models are a very popular technique as they can assist your models be more resistant to outliers and have better chances at generalizing with future data. They also gained popularity after several ensembles helped people win prediction competitions. Recently, stochastic gradient boosting became a go-to candidate model for many data scientists.
This model walks you through the theory behind ensemble models and popular tree-based ensembles.
What's included
15 videos3 readings6 assignments7 app items
Show info about module content
15 videos•Total 93 minutes
Ensemble Based Methods and Bagging - Part 1•2 minutes
Ensemble Based Methods and Bagging - Part 2•2 minutes
Ensemble Based Methods and Bagging - Part 3•3 minutes
Random Forest•7 minutes
[Optional] Bagging Notebook - Part 1•7 minutes
[Optional] Bagging Notebook - Part 2•7 minutes
[Optional] Bagging Notebook - Part 3•10 minutes
Boosting and Stacking•4 minutes
Overview of Boosting•3 minutes
Adaboost and Gradient Boosting Overview•7 minutes
Adaboost and Gradient Boosting Syntax•4 minutes
Stacking•7 minutes
[Optional] Boosting Notebook - Part 1•7 minutes
[Optional] Boosting Notebook - Part 2•16 minutes
[Optional] Boosting Notebook - Part 3•5 minutes
3 readings•Total 16 minutes
[Optional] Download Assets for Demo Lab: Bagging •3 minutes
[Optional] Download Assets for Demo Lab: Boosting and Stacking •3 minutes
Summary/Review•10 minutes
6 assignments•Total 51 minutes
Module 5 Graded Quiz•30 minutes
Bagging•5 minutes
Random Forest•3 minutes
Bagging Labs•3 minutes
Boosting and Stacking•5 minutes
Boosting and Stacking Labs•5 minutes
7 app items•Total 360 minutes
Practice Lab: Random Forest•45 minutes
Demo Lab: Bagging•30 minutes
Practice Lab: Bagging•45 minutes
Demo Lab: Boosting and Stacking•45 minutes
Practice Lab: Ada Boost•45 minutes
Practice Lab: Stacking For Classification with Python•45 minutes
Practice Lab: (Optional) Gradient Boosting•105 minutes
Modeling Unbalanced Classes
Module 6•4 hours to complete
Module details
Some classification models are better suited than others to outliers, low occurrence of a class, or rare events. The most common methods to add robustness to a classifier are related to stratified sampling to re-balance the training data. This module will walk you through both stratified sampling methods and more novel approaches to model data sets with unbalanced classes.
At IBM, we know how rapidly tech evolves and recognize the crucial need for businesses and professionals to build job-ready, hands-on skills quickly. As a market-leading tech innovator, we’re committed to helping you thrive in this dynamic landscape. Through IBM Skills Network, our expertly designed training programs in AI, software development, cybersecurity, data science, business management, and more, provide the essential skills you need to secure your first job, advance your career, or drive business success. Whether you’re upskilling yourself or your team, our courses, Specializations, and Professional Certificates build the technical expertise that ensures you, and your organization, excel in a competitive world.
"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."
Jennifer J.
Learner since 2020
"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."
Larry W.
Learner since 2021
"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."
Chaitanya A.
"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."
Learner reviews
4.8
462 reviews
5 stars
84.41%
4 stars
11.68%
3 stars
1.08%
2 stars
0.86%
1 star
1.94%
Showing 3 of 462
J
JK
5·
Reviewed on Sep 14, 2022
The course is well designed and easy to follow. (communication and feedback mechanism with Coursera could be improved).
A
AF
5·
Reviewed on Feb 5, 2023
Well-structured learning path. If you dont have previous python experience you can catch up after a couple of weeks as the workflow is similar regardless of the algorithmn you are using
A
AF
5·
Reviewed on Nov 7, 2020
Great course and very well structured. I'm really impressed with the instructor who give thorough walkthrough to the code.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Certificate?
When you enroll in the course, you get access to all of the courses in the Certificate, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.