When you enroll in this course, you'll also be enrolled in this Specialization.
Learn new concepts from industry experts
Gain a foundational understanding of a subject or tool
Develop job-relevant skills with hands-on projects
Earn a shareable career certificate
There are 3 modules in this course
As large language models revolutionize business operations, sophisticated attackers exploit AI systems through prompt injection, jailbreaking, and content manipulation—vulnerabilities that traditional security tools cannot detect. This intensive course empowers AI developers, cybersecurity professionals, and IT managers to systematically identify and mitigate LLM-specific threats before deployment. Master red-teaming methodologies using industry-standard tools like PyRIT, NVIDIA Garak, and Promptfoo to uncover hidden vulnerabilities through adversarial testing. Learn to design and implement multi-layered content-safety filters that block sophisticated bypass attempts while maintaining system functionality. Through hands-on labs, you'll establish resilience baselines, implement continuous monitoring systems, and create adaptive defenses that strengthen over time.
This course is designed for AI engineers, security professionals, data scientists, and developers interested in ensuring the safety and robustness of AI models. It’s also ideal for technology leaders seeking to implement secure, responsible AI frameworks within their organizations.
Learners should have a basic understanding of machine learning, AI model architecture, and programming concepts. No prior experience with AI red-teaming or safety systems is required.
By end of this course, you'll confidently conduct professional AI security assessments, deploy robust safety mechanisms, and protect LLM applications from evolving attack vectors in production environments.
This module introduces participants to the systematic creation and execution of red-teaming scenarios targeting large language models. Students learn to identify common vulnerability categories including prompt injection, jailbreaking, and data extraction attacks. The module demonstrates how to design realistic adversarial scenarios that mirror real-world attack patterns, using structured methodologies to probe LLM weaknesses. Hands-on demonstrations show how red-teamers simulate malicious user behavior to uncover security gaps before deployment.
What's included
4 videos2 readings1 peer review
Show info about module content
4 videos•Total 27 minutes
Welcome to Secure AI Red-Teaming & Safety Filters•3 minutes
Understanding AI Attack Vectors and Vulnerability Categories•5 minutes
Hands-On Vulnerability Discovery with Automated Tools•13 minutes
2 readings•Total 10 minutes
Welcome to the Course: Course Overview•5 minutes
LLM Red Teaming Guide (Open Source): Systematically Testing Large Language Models for Vulnerabilities•5 minutes
1 peer review•Total 15 minutes
Hands-On-Learning: Red-Team Assessment of ChatAssist Customer Service Bot•15 minutes
Content-Safety Filters: Implementation and Testing
Module 2•1 hour to complete
Module details
This module covers the design, implementation, and evaluation of content-safety filters for LLM applications. Participants explore multi-layered defense strategies including input sanitization, output filtering, and behavioral monitoring systems. The module demonstrates how to configure safety mechanisms that balance security with functionality, and shows practical testing methods to validate filter effectiveness against sophisticated bypass attempts. Real-world examples illustrate the challenges of maintaining robust content filtering while preserving user experience.
Implementing and Configuring Safety Filters for Production•8 minutes
Testing Filter Effectiveness Against Bypass Attempts•10 minutes
1 reading•Total 5 minutes
The Landscape of LLM Guardrails: Intervention Levels and Techniques•5 minutes
1 peer review•Total 20 minutes
Hands-On-Learning: Safety Filter Implementation for SecureChat Enterprise Bot•20 minutes
Testing LLM Resilience and Improving AI Robustness
Module 3•2 hours to complete
Module details
This module focuses on comprehensive resilience testing and systematic improvement of AI system robustness. Students learn to conduct thorough security assessments that measure LLM resistance to adversarial inputs, evaluate defense mechanism effectiveness, and identify areas for improvement. The module demonstrates how to establish baseline security metrics, implement iterative hardening processes, and validate improvements through continuous testing. Participants gain skills in developing robust AI systems that maintain integrity under real-world adversarial conditions.
What's included
4 videos1 reading1 assignment2 peer reviews
Show info about module content
4 videos•Total 31 minutes
Establishing Baseline Security Metrics and Resilience Benchmarks•6 minutes
Continuous Testing and Automated Vulnerability Assessment•7 minutes
Systematic Security Improvement and Adaptive Hardening•15 minutes
Coursera brings together a diverse network of subject matter experts who have demonstrated their expertise through professional industry experience or strong academic backgrounds. These instructors design and teach courses that make practical, career-relevant skills accessible to learners worldwide.
When will I have access to the lectures and assignments?
To access the course materials, assignments and to earn a Certificate, you will need to purchase the Certificate experience when you enroll in a course. You can try a Free Trial instead, or apply for Financial Aid. The course may offer 'Full Course, No Certificate' instead. This option lets you see all course materials, submit required assessments, and get a final grade. This also means that you will not be able to purchase a Certificate experience.
What will I get if I subscribe to this Specialization?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. Your electronic Certificate will be added to your Accomplishments page - from there, you can print your Certificate or add it to your LinkedIn profile.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.