What is a machine learning workflow in this course?

In this course, a machine learning workflow means turning raw data into usable model results through a repeatable sequence of preparation, modeling, and evaluation. The emphasis is on core foundations like prediction, pattern discovery, feature preparation, and time-based forecasting so you can see how the pieces fit together.

When would you use a machine learning workflow?

You would use a machine learning workflow when you need a structured way to move from raw data to a prediction, grouping, anomaly-finding, or forecast. In this course, it is used for problems where choosing a method and checking its results matters more than relying on intuition alone.

How does a machine learning workflow fit into a broader data workflow?

It sits between collecting data and using model outputs, giving you a clear process for preparing inputs, training methods, and judging results. The course treats it as the link between data preparation and applied tasks like prediction, pattern discovery, and forecasting.

How is a machine learning workflow different from traditional data analysis?

Traditional data analysis is mainly about describing what is already in the data, while a machine learning workflow is about learning patterns that can be applied to new cases. In this course, that means going beyond summaries and charts to train, test, and interpret models.

Do you need any prerequisites before learning a machine learning workflow?

A basic understanding of data analysis and Python-based work is helpful, because the course focuses on applying machine learning methods rather than only defining them. What matters most is being able to work with tabular data, follow a modeling process, and interpret results.

What tools, platforms, or methods are used in this course?

The course uses Python-based tools, especially Pandas for working with data and Scikit-learn for building and evaluating models. It also introduces forecasting-focused libraries for time series work.

What specific tasks will you practice or complete in this course?

You'll practice preparing data, building prediction models, exploring unlabeled data for groups or unusual cases, and creating forecasts from time-based patterns. Across those tasks, the course keeps the focus on following a repeatable machine learning workflow from input data to evaluated output.

Foundations of Machine Learning

This course is part of multiple programs.

Instructor: Professionals from the Industry

8,511 already enrolled

Included with

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

15 reviews

Intermediate level

Recommended experience

3 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

4 modules

Gain insight into a topic and learn the fundamentals.

15 reviews

Intermediate level

Recommended experience

3 weeks to complete

at 10 hours a week

Flexible schedule

Learn at your own pace

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

20 assignments

Taught in English

91% of learners achieved a positive career outcome

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is available as part of

When you enroll in this course, you'll also be asked to select a specific program.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 4 modules in this course

Welcome to the Foundations of Machine Learning, your practical guide to fundamental techniques powering data-driven solutions. Master key ML domains—supervised learning (prediction), unsupervised learning (pattern discovery), data preprocessing & feature engineering, and time series forecasting—using Pandas, Scikit-learn, Statsmodels, and Prophet to tackle real-world challenges.

By the end of this course, you'll be able to: - Implement and evaluate key supervised models (e.g., regression, classification, Tree-based models & SVMs) for prediction. - Apply unsupervised methods (e.g., K-Means, Isolation Forest) for segmentation and anomaly detection. - Perform robust data preprocessing: handle missing data, encode categoricals, scale features, and apply dimensionality reduction (PCA). - Build and analyze time series forecasts with ARIMA, Exponential Smoothing, Holt-Winters and Prophet. Through hands-on exercises and a capstone customer purchase prediction project, you'll develop versatile skills to confidently address common machine learning challenges.

Module details

Welcome to supervised learning, the foundation of modern machine learning! In this module, you'll master essential algorithms such as linear regression, logistic regression, decision trees, and support vector machines (SVMs) that form the backbone of predictive analytics. We'll guide you through hands-on implementations using industry-standard tools like Scikit-learn, helping you build models that can predict outcomes with impressive accuracy. By the end of this module, you'll be able to select the right algorithm for different problems, train and evaluate models effectively, and interpret their results to drive data-informed decisions.

What's included

13 videos10 readings6 assignments4 ungraded labs

13 videosTotal 67 minutes

Welcome to the Course3 minutes
Regression in Action: Predicting Sales From Advertising 6 minutes
Classification in Action: Predicting Diabetes From Patient Data5 minutes
Understanding Regression Through a Real-World Example6 minutes
Script-Building and Evaluating a Simple Linear Regression Model6 minutes
Getting Started with Logistic Regression for Binary Classification6 minutes
Evaluating Binary Classification Models with Logistic Regression6 minutes
How Decision Trees Make Predictions in Healthcare4 minutes
Evaluating Decision Tree Performance and Avoiding Overfitting5 minutes
Improving Model Accuracy with Random Forests5 minutes
Using SVMs to Recognize Handwritten Digits5 minutes
How SVMs Make Decisions: Margins and Support Vectors4 minutes
Using the RBF Kernel to Improve Classification5 minutes

10 readingsTotal 85 minutes

What Is Supervised Learning?10 minutes
How Supervised Models Are Trained and Used in Real Life7 minutes
What Is Linear Regression and How Does It Work? 7 minutes
Evaluating a Linear Regression Model10 minutes
What Is Logistic Regression and Why Do We Use It? 10 minutes
How Do We Know If Our Classification Model Works?10 minutes
How Do Decision Trees Work?8 minutes
Decision Trees: Pros, Cons, and an Alternative8 minutes
How Support Vector Machines Make Decisions 7 minutes
Understanding the Kernel Trick in SVMs8 minutes

6 assignmentsTotal 105 minutes

Supervised Learning Mastery30 minutes
Knowledge Check: Supervised Learning Basics15 minutes
Knowledge Check: Linear Regression Key Concepts15 minutes
Knowledge Check: Logistic Regression Key Concepts15 minutes
Knowledge Check: Decision Trees & Random Forests Key Concepts15 minutes
Knowledge Check: SVM Key Concepts15 minutes

4 ungraded labsTotal 240 minutes

Predicting House Prices Using Linear Regression60 minutes
Predicting Loan Approval Using Logistic Regression60 minutes
Attrition Prediction Using Decision Trees & Random Forests60 minutes
Classifying Handwritten Digits Using SVMs60 minutes

What do you do when your data doesn't have labeled examples? In this module, you'll explore unsupervised learning, where algorithms find structure and insights in data all on their own. You'll master clustering techniques like K-Means and hierarchical clustering to group similar customers, products, or behaviors, and learn how to detect anomalies that could represent fraud or unusual events. By the end of this module, you'll be equipped with powerful tools to uncover hidden insights in your data that supervised methods might miss, expanding your toolkit for real-world data science challenges.

What's included

10 videos8 readings5 assignments4 ungraded labs

10 videosTotal 44 minutes

What Makes Unsupervised Learning So Powerful3 minutes
How Netflix & Spotify Use Unsupervised Learning7 minutes
Exploring Unlabeled Data in Python6 minutes
Customer Segmentation: Seeing Natural Clusters in Your Data3 minutes
Clustering with K-Means: From Code to Customer Insights3 minutes
Choosing the Best K with the Elbow Method4 minutes
What Is Hierarchical Clustering and How Do We Visualize It?4 minutes
Hierarchical Clustering in Action: Python Implementation & Insights7 minutes
What Is Anomaly Detection? Exploring Credit Card Fraud Patterns3 minutes
Anomaly Detection with Isolation Forest in Python4 minutes

8 readingsTotal 52 minutes

What Is Unsupervised Learning?7 minutes
Anomaly Detection & Industry Applications7 minutes
How K-Means Clustering Works5 minutes
Choosing K and Limitations of K-Means8 minutes
What Is Hierarchical Clustering?5 minutes
Interpreting Dendrograms & Understanding Trade-offs5 minutes
What Is Anomaly Detection and Why Is It Different?5 minutes
Methods and Challenges in Anomaly Detection10 minutes

5 assignmentsTotal 90 minutes

Unsupervised Learning Mastery30 minutes
Knowledge Check: Unsupervised Learning Fundamentals15 minutes
Knowledge Check: K-Means Clustering Key Concepts15 minutes
Knowledge Check: Hierarchical Clustering Key Concepts15 minutes
Knowledge Check: Anomaly Detection Key Concepts15 minutes

4 ungraded labsTotal 240 minutes

Visualizing Customer Segmentation Data60 minutes
Segmenting Customers Using K-Means Clustering60 minutes
Grouping Airline Customers Using Hierarchical Clustering60 minutes
Detecting Credit Card Fraud with Isolation Forest60 minutes

Did you know that data preparation often determines model success more than algorithm selection? In this essential module, you'll learn the critical skills of data preprocessing and feature engineering that separate novice from professional data scientists. We'll guide you through handling missing data, encoding categorical variables, scaling features, and selecting the most important attributes that will make your models shine. By mastering these techniques, you'll dramatically improve your models' accuracy and reliability, ensuring they perform well on real-world messy data that would otherwise cause less-prepared models to fail.

What's included

11 videos7 readings5 assignments4 ungraded labs

11 videosTotal 45 minutes

Why Data Preprocessing & Feature Engineering Matter So Much3 minutes
Why Missing Data Breaks Models: The Problem in Action4 minutes
How Missing Data Affects Model Accuracy — and What to Do About It5 minutes
Why ML Models Can't Handle Raw Categorical Data5 minutes
Types of Categorical Variables and How to Encode Them3 minutes
Label Encoding and Model Performance Comparison5 minutes
Why Feature Scaling Matters in Machine Learning5 minutes
Scaling Your Data: Normalization with Min-Max Scaler3 minutes
Standardization with Z-Score Scaling + Impact on Model Performance3 minutes
Why Too Many Features Can Hurt Your Model3 minutes
Applying Feature Selection & PCA in Python5 minutes

7 readingsTotal 54 minutes

What Causes Missing Data—and Why It Matters5 minutes
How to Handle Missing Data in ML Pipelines8 minutes
Why We Encode Categorical Data in Machine Learning10 minutes
Choosing the Right Encoding Method for Your Data5 minutes
What Is Feature Scaling and Why It Matters in Machine Learning6 minutes
Why and How We Select the Right Features10 minutes
What Is Feature Extraction and When Should You Use It?10 minutes

5 assignmentsTotal 90 minutes

Data Preprocessing & Feature Engineering Mastery30 minutes
Knowledge Check: Handling Missing Data Key Concepts15 minutes
Knowledge Check: Encoding Categorical Variables Key Concepts15 minutes
Knowledge Check: Feature Scaling Key Concepts15 minutes
Knowledge Check: Feature Selection & PCA Key Concepts15 minutes

4 ungraded labsTotal 240 minutes

Cleaning a Customer Purchase Dataset60 minutes
Transforming Categorical Data for a Salary Prediction Model60 minutes
Scaling Features for a Loan Approval Model60 minutes
Reducing Features for a House Price Prediction Model60 minutes

Let's figure out how to properly make forecasts from time-based data! In this module, you'll learn specialized techniques for working with time-dependent data like stock prices, sales forecasts, and sensor readings that traditional ML approaches can't handle effectively. You'll implement practical forecasting models using tools like ARIMA, Exponential Smoothing, and Facebook Prophet, understanding how to identify trends, seasonality, and other temporal patterns. By the end of this module, you'll be able to build accurate forecasting systems that can predict future values based on historical patterns, adding a powerful and in-demand skill to your machine learning toolkit.

What's included

9 videos5 readings4 assignments1 programming assignment3 ungraded labs

9 videosTotal 43 minutes

Why Time Series Isn't Just Another Dataset11 minutes
What Makes Time Series Special: Trends, Seasonality & More4 minutes
Visualizing a Time Series in Python: Airline Passengers Example3 minutes
Decomposing Time Series into Trend, Seasonality, and Noise5 minutes
Why Regression Fails for Forecasting: A Retail Sales Example5 minutes
What Makes Forecasting Different: Let's Try ARIMA & Exponential Smoothing5 minutes
Getting Started with Facebook Prophet in Python3 minutes
Why Facebook Prophet Makes Forecasting Easy (and Powerful)5 minutes
Ready to Build Your Own ML System?2 minutes