When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 5 modules in this course
Introduction to Deep Learning provides a rigorous, concept-driven introduction to the models that power modern AI systems—from image recognition to large language models. You’ll build neural networks from first principles, understanding how forward passes, loss functions, and backpropagation enable learning. As the course progresses, you’ll train and regularize deep models, design convolutional networks for vision, model sequences with RNNs, LSTMs, and attention, and apply transformer-based architectures such as BERT, GPT, and Vision Transformers. You will also look at the latest trends in contrastive learning and CLIP. By combining mathematical foundations with practical application, this course equips you to understand, train, and use deep learning models with confidence.
This course can be taken for academic credit as part of CU Boulder’s Masters of Science in Computer Science (MS-CS), Master of Science in Artificial Intelligence (MS-AI), and Master of Science in Data Science (MS-DS) degrees offered on the Coursera platform. These fully accredited graduate degrees offer targeted courses, short 8-week sessions, and pay-as-you-go tuition. Admission is based on performance in three preliminary courses, not academic history. CU degrees on Coursera are ideal for recent graduates or working professionals. Learn more:
MS in Artificial Intelligence: https://www.coursera.org/degrees/ms-artificial-intelligence-boulder
MS in Computer Science: https://coursera.org/degrees/ms-computer-science-boulder
MS in Data Science: https://www.coursera.org/degrees/master-of-science-data-science-boulder
Welcome to Introduction to Deep Learning. This module builds the mathematical foundations of neural networks. Starting from linear models, you will learn about the artificial neuron and develop the mathematics of gradient descent and backpropagation. The focus is on understanding how and why neural networks work through the underlying math—covering the forward pass, loss functions, and the chain rule to show how information flows through networks and how they learn from data.
Academic Integrity and AI Use Policy for the Machine Learning Specialization•9 minutes
From Linear Regression to the Artificial Neuron•9 minutes
Activation Functions and Non-Linearity: The Mathematical Notation and Problem Setup•5 minutes
Activation Functions and Non-Linearity: Why Non-Linearity is Important•6 minutes
Activation Functions and Non-Linearity: Sigmoid Activation and its Gradient•10 minutes
Activation Functions and Non-Linearity: Rectified Linear Unit Activation and its Gradient•4 minutes
Activation Functions and Non-Linearity: Other Activations and How to Choose Among Them•4 minutes
Layers, Depth, and Forward Propagation•10 minutes
Matrix Notation and Dimensions•9 minutes
Loss Functions: MSE and Cross-Entropy•7 minutes
Gradient Descent: The Math of Optimization•8 minutes
The Chain Rule and Backpropagation•9 minutes
Backpropagation Through a Network•10 minutes
6 readings•Total 96 minutes
Course Updates and Accessibility Support•1 minute
Earn Academic Credit for Your Work! •10 minutes
Course Support•10 minutes
Assessment Expectations•5 minutes
Download the Recommended Reading for This Course•10 minutes
From Linear Models to Neural Networks - Recommended Reading•60 minutes
2 assignments•Total 35 minutes
AI Policy Quiz•5 minutes
Neural Network Foundations•30 minutes
1 programming assignment•Total 60 minutes
Lab 1: Building and Training Your First Neural Network in Keras•60 minutes
Training and Regularizing Neural Networks
Module 2•4 hours to complete
Module details
This module focuses on training neural networks effectively. Topics include optimization algorithms, hyperparameter tuning, and regularization techniques to prevent overfitting and achieve good generalization. You will compare different optimizers like SGD, momentum, and Adam, understand how learning rate and batch size affect training dynamics, and apply weight decay, dropout, early stopping, and batch normalization.
Training and Regularizing Neural Networks•30 minutes
1 programming assignment•Total 60 minutes
Lab 2: Applying Regularization to Improve Model Generalization•60 minutes
Convolutional Neural Networks for Image Recognition
Module 3•4 hours to complete
Module details
This module introduces you to convolutional neural networks (CNNs), the foundation of modern computer vision. Topics include how convolutional and pooling layers work, CNN architecture design, and practical techniques like data augmentation and transfer learning. The module covers classic architectures like VGG and ResNet and explains why CNNs outperform fully-connected networks on image data.
Introduction to CNNs - Recommended Reading•45 minutes
Training CNNs in Practice - Recommended Reading•30 minutes
1 assignment•Total 30 minutes
Convolutional Neural Networks for Image Recognition•30 minutes
1 programming assignment•Total 60 minutes
Lab 3: Training a CNN for Image Classification with Augmentation•60 minutes
Sequence Modeling – RNNs, LSTMs, and the Attention Mechanism
Module 4•3 hours to complete
Module details
This module covers sequence modeling, starting with recurrent neural networks (RNNs) and long short-term memory networks (LSTMs), then progressing to the attention mechanism—the key innovation that led to transformers. Topics include how RNNs maintain hidden states across time steps, why the vanishing gradient problem motivated LSTMs, and how attention allows models to focus on relevant parts of their input.
Sequence Modeling – RNNs, LSTMs, and the Attention Mechanism•30 minutes
1 programming assignment•Total 60 minutes
Lab 4: Building a Sequence Model with Attention•60 minutes
Transformers, Vision Transformers, and CLIP
Module 5•3 hours to complete
Module details
This final module covers the transformer architecture, which has revolutionized deep learning across domains. Topics include BERT and GPT as encoder-only and decoder-only variants, Vision Transformers (ViT) that apply attention to images, and CLIP for multimodal learning connecting vision and language. The module emphasizes applying pre-trained models to real tasks.
CU Boulder is a dynamic community of scholars and learners on one of the most spectacular college campuses in the country. As one of 34 U.S. public institutions in the prestigious Association of American Universities (AAU), we have a proud tradition of academic excellence, with five Nobel laureates and more than 50 members of prestigious academic academies.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.