Introduction to Computer Vision

Limited time! Save 40% on 3 months of Coursera Plus and full access to thousands of courses.

Introduction to Computer Vision

This course is part of Computer Vision Specialization

Instructor: Tom Yeh

7,281 already enrolled

Included with

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

31 reviews

Beginner level

Recommended experience

Flexible schedule

2 weeks at 10 hours a week

Learn at your own pace

Build toward a degree

Learn more

4 modules

Gain insight into a topic and learn the fundamentals.

31 reviews

Beginner level

Recommended experience

Flexible schedule

2 weeks at 10 hours a week

Learn at your own pace

Build toward a degree

Learn more

What you'll learn

Understand the fundamental principles and algorithms of classical computer vision.
Apply deep learning models to various computer vision tasks.
Evaluate and implement computer vision solutions for real-world applications.

Skills you'll gain

Tools you'll learn

Details to know

Shareable certificate

Add to your LinkedIn profile

Assessments

23 assignments

Taught in English

See how employees at top companies are mastering in-demand skills

Learn more about Coursera for Business

logos of Petrobras, TATA, Danone, Capgemini, P&G and L'Oreal

Build your subject-matter expertise

This course is part of the Computer Vision Specialization

When you enroll in this course, you'll also be enrolled in this Specialization.

Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate

There are 4 modules in this course

Introduction to Computer Vision guides learners through the essential algorithms and methods to help computers 'see' and interpret visual data. You will first learn the core concepts and techniques that have been traditionally used to analyze images. Then, you will learn modern deep learning methods, such as neural networks and specific models designed for image recognition, and how it can be used to perform more complex tasks like object detection and image segmentation. Additionally, you will learn the creation and impact of AI-generated images and videos, exploring the ethical considerations of such technology.

This course can be taken for academic credit as part of CU Boulder’s MS in Data Science or MS in Computer Science degrees offered on the Coursera platform. These fully accredited graduate degrees offer targeted courses, short 8-week sessions, and pay-as-you-go tuition. Admission is based on performance in three preliminary courses, not academic history. CU degrees on Coursera are ideal for recent graduates or working professionals. Learn more: MS in Data Science: https://www.coursera.org/degrees/master-of-science-data-science-boulder MS in Computer Science: https://coursera.org/degrees/ms-computer-science-boulder

Welcome to Introduction to Computer Vision, the first course in the Computer Vision specialization. In this first module, you'll be introduced to how this course operates "by Hand" and "in Excel." Then, you'll build a foundation in image matrices and arrays to explore different image types: binary, grayscale, and RGB. Next, you'll transition into using functions to perform basic image operations such as addition, negation, and masking. You'll then be introduced to the concept of image transformation through linear algebra. Finally, you'll perform translation, scaling, and rotation matrix operations.

What's included

34 videos9 readings8 assignments

34 videos Total 136 minutes

Meet Your Instructor 3 minutes
Image Overview 2 minutes
Image Array & Matrix 2 minutes
Binary Image & Byte Array 2 minutes
Double Image 3 minutes
RGB Image 5 minutes
LED Display 4 minutes
Byte image 32x32 3 minutes
Greyscale 4 minutes
RGB Image 32x32x3 3 minutes
LED Display 32x32 4 minutes
2D Image Function 3 minutes
Add Images 3 minutes
Solid Square 2 minutes
Add, Negate, and Multiply 3 minutes
Flip Axes 3 minutes
Linear Combination 3 minutes
Masking 3 minutes
Absolute Reference 12 minutes
L1 & L2 Function Examples 2 minutes
2D Gaussian 5 minutes
Array Formula 9 minutes
Pixels vs. Function vs. Points 6 minutes
Translate and Scale by Linear Combination 5 minutes
Matrix Multiplication 10 minutes
Translate and Scale Matrix 5 minutes
Multiple Transformations 3 minutes
Rotation Matrix 4 minutes
Matrix Multiplication Associativity 3 minutes
Matrix Multiplication in Excel 4 minutes
Linear Transformation 3 minutes
Scale and Translate in Excel 4 minutes
Rotate and Multiple Transformations 4 minutes
Pre-multiplied Transformation Matrix 3 minutes

9 readings Total 57 minutes

Course Updates and Accessibility Support 1 minute
Earn Academic Credit for your Work! 10 minutes
Course Support 10 minutes
Inside the Course 10 minutes
Assessment Expectations 10 minutes
AI Citation and Acknowledgement 10 minutes
Get the Workbook: Image 2 minutes
Get the Workbook: Function 2 minutes
Get the Workbook: Transform 2 minutes

8 assignments Total 155 minutes

Image by Hand 15 minutes
Image in Excel 15 minutes
Function by Hand 15 minutes
Function in Excel 15 minutes
Transform by Hand 15 minutes
Transform in Excel 15 minutes
AI Policy Quiz 5 minutes
Image, Function, and Transform 60 minutes

This module dives into feature extraction—quantitative measures that describe image content. Students compute features such as image mass, center, and statistical moments to describe the shape and structure of images. These are implemented both manually and in Excel. The module also explores how to compare images using distance metrics and similarity measures, offering insight into how visual data can be analyzed, categorized, and classified.

What's included

23 videos2 readings5 assignments

23 videos Total 104 minutes

Image Mass 2 minutes
Image Center 5 minutes
First Moment 2 minutes
Second Moment 4 minutes
Image Gradients 8 minutes
Image Histogram 6 minutes
Image Batch, Mass, and Center 8 minutes
First Moment in Excel 4 minutes
Second Moment in Excel 4 minutes
Parameterized Moment Calculation 5 minutes
Image Gradient in Excel 8 minutes
Image Histogram in Excel 5 minutes
Histogram of Gradients (HOG) 5 minutes
Similarity vs. Distance 6 minutes
L1 and L2 Distance 2 minutes
L2 Normalization 3 minutes
Cosine Similarity 2 minutes
Cross Entropy 3 minutes
L1 and L2 Distance in Excel 2 minutes
L2 Normalization in Excel 2 minutes
L1 and L2 Distance Map 6 minutes
Cosine Similarity and Cross Entropy in Excel 4 minutes
Comparing Two Groups 10 minutes

2 readings Total 4 minutes

Get the Workbook: Feature 2 minutes
Get the Workbook: Compare 2 minutes

5 assignments Total 90 minutes

Feature by Hand 15 minutes
Feature in Excel 15 minutes
Compare by Hand 15 minutes
Compare in Excel 15 minutes
Feature and Compare 30 minutes

Filtering techniques are central to detecting patterns in images. This module introduces learners to 1D and 2D filters, covering foundational concepts like convolution, cross-correlation, and Gaussian smoothing. Through both manual and spreadsheet-based exercises, learners apply various filters (e.g., mean, Laplacian, Sobel) and morphological operations like dilation and erosion. These filtering methods enhance image features, detect edges, and prepare data for further processing.

What's included

26 videos2 readings5 assignments

26 videos Total 109 minutes

Overview and Scale 2 minutes
Sliding Window and Cross-Correlation 7 minutes
Convolution by Hand 3 minutes
Lapacian Filter by Hand 5 minutes
Shift Filter by Hand 3 minutes
ReLU & Maxpool by Hand 3 minutes
Scale and Sum Filter 3 minutes
Mean, Lapacian, and Shift Filter 6 minutes
Detection in Excel 3 minutes
Cross-Correlation and Convolution 7 minutes
Gaussian Filter 2 minutes
Parameterized Gaussian Filter 8 minutes
ReLU & Maxpool in Excel 3 minutes
Sliding Window by Hand 3 minutes
Dilate by Hand 4 minutes
Erode by Hand 3 minutes
Cross-Correlation for Filter 2D 5 minutes
Convolution for Filter 2D 4 minutes
Mean Filter for Filter 2D 3 minutes
Sliding Window in Excel 4 minutes
Dilate in Excel 4 minutes
Erode in Excel 3 minutes
Open and Close Filter 2D 8 minutes
Smoothing in Excel 6 minutes
Lapacian Filter in Excel 4 minutes
Sobel Filter in Excel 4 minutes

2 readings Total 4 minutes

Get the Workbook: Filter 1D 2 minutes
Get the Workbook: Filter 2D 2 minutes

5 assignments Total 90 minutes

Filter 1D by Hand 15 minutes
Filter 1D in Excel 15 minutes
Filter 2D by Hand 15 minutes
Filter 2D in Excel 15 minutes
Filter 1D & 2D 30 minutes

This module delves into key concepts of camera models and their role in computer vision and photogrammetry. You will learn about the Extrinsic Matrix, exploring how it defines the position and orientation of a camera in 3D space. Understand the Pinhole Camera Model, a simplified optical system that forms the basis for many computer vision applications, alongside the Intrinsic Matrix, which captures the internal parameters of the camera. Epipolar geometry is examined, with a focus on its significance in 3D reconstruction and stereo vision. The module covers the motivation behind epipolar geometry, breaking down its basic components, and explaining the Essential Matrix, which encapsulates the geometric relationship between camera views, as well as the Fundamental Matrix, a core component in epipolar geometry that represents the relationship between two cameras in stereo vision.

What's included

15 videos3 readings5 assignments

15 videos Total 119 minutes

Orthographic Projection 9 minutes
World to Camera 11 minutes
Camera (3D) to Pixel (2D) 11 minutes
Extrinsic & Intrinsic Matrix 6 minutes
Motivation for Epipolar Geometry 8 minutes
Basic Components of Epipolar Geometry 12 minutes
Epipolar Constraints 7 minutes
Derive the Epipolar Constraint Equation 9 minutes
Object in the World 3 minutes
Two Camera System 11 minutes
Pixel to World 10 minutes
Epipolar Line 4 minutes
Pixels to Epipolar Lines 3 minutes
Epipolar Constraints (Camera) 8 minutes
Essential and Fundamental Matrix 7 minutes

3 readings Total 6 minutes

Get the Workbook: Camera 2 minutes
Get the Workbook: Epipolar Part 1 2 minutes
Get the Workbook: Epipolar Part 2 & 3 2 minutes

5 assignments Total 90 minutes

Camera 15 minutes
Epipolar Part 1 15 minutes
Epipolar Part 2 15 minutes
Epipolar Part 3 15 minutes
Camera and Epipolar 30 minutes

Earn a career certificate

Add this credential to your LinkedIn profile, resume, or CV. Share it on social media and in your performance review.

Build toward a degree

This course is part of the following degree program(s) offered by University of Colorado Boulder. If you are admitted and enroll, your completed coursework may count toward your degree learning and your progress can transfer with you.¹

Instructor

Instructor ratings

(6 ratings)

Tom Yeh

University of Colorado Boulder

4 Courses 18,967 learners

Offered by

University of Colorado Boulder

Explore more from Algorithms

IBM
Introduction to Computer Vision and Image Processing
Course
Status: Free Trial
MathWorks
Introduction to Computer Vision
Course
Status: Free Trial
University of Colorado Boulder
Deep Learning for Computer Vision
Course
Status: Free Trial
MathWorks
Introduction to Deep Learning for Computer Vision
Course

Why people choose Coursera for their career

Felipe M.

Learner since 2018

"To be able to take courses at my own pace and rhythm has been an amazing experience. I can learn whenever it fits my schedule and mood."

Jennifer J.

Learner since 2020

"I directly applied the concepts and skills I learned from my courses to an exciting new project at work."

Larry W.

Learner since 2021

"When I need courses on topics that my university doesn't offer, Coursera is one of the best places to go."

Chaitanya A.

"Learning isn't just about being better at your job: it's so much more than that. Coursera allows me to learn without limits."

Learner reviews

5 stars
75%
4 stars
15.62%
3 stars
6.25%
2 stars
3.12%
1 star
0%

Showing 3 of 31

Reviewed on Feb 21, 2026

The course was nice and easy until the last module where some lectures were presented in a very confused way.

View more reviews

Open new doors with Coursera Plus

Unlimited access to 10,000+ world-class courses, hands-on projects, and job-ready certificate programs - all included in your subscription

Learn more

Advance your career with an online degree

Earn a degree from world-class universities - 100% online

Explore degrees

Join over 3,400 global companies that choose Coursera for Business

Upskill your employees to excel in the digital economy

Learn more

Frequently asked questions

To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.