OWASP GenAI Red Teaming Complete Guide

Name: OWASP GenAI Red Teaming Complete Guide
Rating: 4.2 (14 reviews)

Red Teaming RAG, APIs, and Multimodal Architectures

Created byEdcorner Learning

Last updated 5/2026

English

What you'll learn

Understand the full GenAI threat landscape across security, safety, and trust domains
Differentiate traditional red teaming from generative AI-specific red teaming approaches
Apply OWASP, NIST, and MITRE frameworks for AI threat modeling and risk categorization
Identify and exploit key GenAI attack surfaces (LLMs, agents, RAG pipelines, APIs)
Craft prompt injection, jailbreaks, and adversarial multi-turn exploits
Evaluate model responses for hallucinations, bias, toxicity, and alignment bypasses
Test implementation-level controls including content filters, RBAC, and vector store poisoning
Analyze runtime and agentic risks such as decision hijacking and over-reliance
Use tools like PyRIT and PromptBench to simulate real-world adversarial scenarios
Track and report red team metrics, scenario brittleness, and mitigation effectiveness
Design a cross-functional GenAI red team with defined roles, RACI matrices, and governance
Customize red teaming strategies for regional laws, cultural sensitivities, and industry sectors
Create and execute red team playbooks for scalable, automated evaluation pipelines
Close the loop: document, remediate, and communicate risks to stakeholders

Course content

10 sections • 40 lectures • 1h 23m total length

Introduction to GenAI and LLM Ecosystems1:42
Explore how generative AI creates new content using large language models and transformer-based architectures, and examine red teaming to ensure trust, safety, and alignment.
What is GenAI Red Teaming and Why It Matters1:34
Explore generative AI red teaming to uncover security, safety, and trust issues by testing for outputs, data leakage, prompt injection, hallucinations, bias, and toxicity across model, system, and runtime layers.
Key Risks in Generative AI Systems1:47
Differences Between Traditional and GenAI Red Teaming1:42
Compare traditional red teaming with genai red teaming, shifting focus to model behavior, content generation, prompt manipulation, and socio-technical risks, while addressing model drift and ethical considerations.

OWASP & NIST Risk Categories (Security, Safety, Trust)1:40
Explore how the OWASP and NIST risk pillars—security, safety, and trust—frame vulnerabilities from data leakage and hallucinated facts to agent hijacking, guiding red teams across model behavior and training data.
Threat Modeling for AI Systems (STRIDE, MITRE ATLAS, NIST AI RMF)2:41
Attack Surfaces: LLMs, Agents, Multi-modal Inputs2:06
Explore how the model itself becomes an attack surface, and learn to trace data flows from input prompts to execution, guarding against manipulation across LLMs, agents, and multimodal inputs.
Risk Mapping with RAG Triad and Socio-technical Layers2:17
Assess the RAG triad—relevance, accuracy, and groundedness—to test for hallucinations, ensure data grounding and traceability, and evaluate sociotechnical biases across edge cases.

Lifecycle and Blueprint Overview2:15
Apply a lifecycle-based red teaming framework for gen AI, aligned with life cycle stages from acquisition to runtime, featuring four phases: model evaluation, implementation evaluation, system evaluation, and runtime evaluation.
Scoping the Engagement (Use Cases, Regulatory Priorities)2:06
Four-Phase Evaluation Model (Model, Implementation, System, Runtime)2:06
Red Teaming Metrics, Reporting, and Risk Dispositioning2:19
Learn to measure, report, and disposition red team findings with metrics like prompt injections, data leakage, and hallucinations, and craft modular risk reports and remediation plans.

Prompt Injection and Jailbreak Techniques2:11
Leverage red team testing to identify prompt injection and jailbreaking vulnerabilities, applying layered defense, system prompts hardening, tokenizer based detection, and reinforcement learning with rejection RL to improve refusals.
Adversarial Prompt Engineering & Dataset Design2:11
Master adversarial prompt engineering and dataset design to probe alignment, policy filters, and ethical boundaries through red team testing, scoring model responses for risk and leakage.
Multi-Turn Attacks and CoT Reasoning Chains2:13
Explore multi-turn attacks that exploit memory in gen ai systems, using context buildup and chain-of-thought reasoning. Red teamers test memory bound policies, interruption logic, and resets to preserve alignment.
Evaluation Criteria for Prompt Success and Brittleness2:11

Testing for Hallucination, Bias, Toxicity2:16
Identify and mitigate hallucination, bias, and toxicity by red-teaming AI outputs across domains, scoring factual correctness, bias safety, and toxicity risk with retrieval augmented generation and transparent documentation.
Data Poisoning, Model Extraction, Alignment Bypass2:09
Examine data poisoning, model extraction, and alignment bypass as advanced threats to LLM safety, with red teamers testing defenses across training, inference, and deployment using poisoned data.
Socio-Technical Harm & Cultural Sensitivity Testing2:11
Factuality, Grounding, and Response Coherence Tests2:12
Explore factuality, grounding, and response coherence tests in gen ai through red team prompts, citation checks, rag architectures, chain of thought prompts, and remediation workflows.

Testing Content Filters and Prompt Firewalls2:05
Stress test content filters and prompt firewalls with edge-case prompts and adversarial variations to measure refusals, accuracy, and resilience across obfuscated prompts.
RAG Security and Vector Store Manipulation2:11
Role-based Access Control (RBAC), Token Abuse2:11
Examine role-based access control and token hygiene to prevent RBAC misconfigurations and token leakage. Red team simulations reveal privilege escalation and strategies for strict scoping and token rotation.
Testing System Prompts, Caching, and Instruction Retention2:03
Explore how system prompts shape AI behavior, test instruction retention across sessions and users, assess caching risks that can leak privacy or data, and replay completions in multi-turn tasks.

Code Generation Exploits and Sandbox Escape2:07
Understand how code generation can create risk, including sandbox escapes and unsafe commands. Red teamers test isolation, safety, and patterns like directory traversal and command injection to improve defense.
API Injection, Template Attacks, Dependency Risks1:58
Explore API injection, template attacks, and dependency risks in lm systems. Test prompt to API mappings, input validation, and API calls to reveal misrouted data, privilege escalation, or RCE.
Monitoring Evasion and Logging Weaknesses2:07
Strengthen GenAI observability by auditing prompt, action, and memory logs to detect evasion and enforce real-time, tamper-evident, session-based logging with RBAC.
Testing for System-wide Data Integrity and Downtime2:35
Red teamers stress test data pipelines and simulate poisoning and external interface failures to verify whether the system degrades gracefully and to validate failover and safeguards.

Human-AI Trust Manipulation and Over-reliance2:07
Assess how AI confidence, tone, and visuals shape user trust and overtrust in enterprise tools. Explore red teaming methods to inject warnings, provenance, and verification prompts to prevent blind acceptance.
Social Engineering via Generative Output2:03
Explore how social engineering uses trust, urgency, and emotion, and how generative AI automates these tactics, from phishing emails to impersonation, and how red teams test and defend against them.
Multi-Agent Attack Chains and Decision Hijacking2:01
examine how multi-agent attack chains and decision hijacking unfold in autonomous ai systems, and show red team tests to enforce cryptographic signatures, memory boundaries, and role checking.
Chain-of-Custody and Traceability Failures2:06
Develop chain-of-custody and traceability by red-team simulations of broken audit trails and memory edits, and enforce immutable logging, prompt versioning, model tagging, and tool-level logs.

Open-Source Tools for Model Testing (e.g., PyRIT, PromptBench)1:54
Automation of Adversarial Scenarios and Static Datasets2:02
Automate adversarial testing with static datasets and dynamic generation to benchmark robustness, detect brittleness, and continuously improve prompt safety and model alignment across CI/CD.
Logging, Monitoring, and Alerting Integrations2:03
Sample Red Team Playbooks and Walkthroughs2:02
Explore red team playbooks and walkthroughs that simulate prompt injection, injection vectors, and retrieval risks to test models, agents, and workflows across a lifecycle of versioned, risk-tagged remediation.

Requirements

Some exposure to OWASP or NIST frameworks

Description

This comprehensive course on OWASP GenAI Red Teaming Complete Guide equips learners with practical and strategic expertise to test and secure generative AI systems. The curriculum begins with foundational concepts, introducing learners to the generative AI ecosystem, large language models (LLMs), and the importance of red teaming to uncover security, safety, and trust failures. It contrasts GenAI red teaming with traditional methods, highlighting how risks evolve across model architectures, human interfaces, and real-world deployments. Through in-depth risk taxonomy, students explore OWASP and NIST risk categories, STRIDE modeling, MITRE ATLAS tactics, and socio-technical frameworks like the RAG Triad. Key attack surfaces across LLMs, agents, and multi-modal inputs are mapped to emerging threat vectors. The course then presents a structured red teaming blueprint—guiding learners through scoping engagements, evaluation lifecycles, and defining metrics for success and brittleness.

Advanced modules dive into prompt injection, jailbreaks, adversarial prompt design, multi-turn exploits, and bias evaluation techniques. Students also assess model vulnerabilities such as hallucinations, cultural insensitivity, and alignment bypasses. Implementation-level risks are analyzed through tests on content filters, prompt firewalls, RAG vector manipulation, and access control abuse. System-level modules examine sandbox escapes, API attacks, logging gaps, and supply chain integrity. Learners are also introduced to runtime and agentic risks like overtrust, social engineering, multi-agent manipulation, and traceability breakdowns.

Practical tooling sessions feature hands-on red teaming with PyRIT, PromptBench, automation workflows, and playbook design. Finally, the course addresses operational maturity—showing how to build cross-functional red teams, align roles with RACI matrices, and apply red teaming within regulatory and cultural boundaries. With case-driven instruction and security-by-design thinking, this course prepares learners to operationalize GenAI red teaming at both the technical and governance levels.

Who this course is for:

AI Security Engineers looking to build red teaming capabilities for LLM systems
Cybersecurity Analysts and SOC teams responsible for detecting GenAI misuse
Red Team Professionals seeking to expand into AI-specific adversarial simulation
Risk, Compliance, and Governance Leads aiming to align GenAI systems with NIST, OWASP, or EU AI Act standards
Product Owners and Engineering Managers deploying GenAI copilots or RAG-based assistants
AI Researchers and Data Scientists focused on model safety, bias mitigation, and interpretability
Ethics, Policy, and Trust & Safety teams developing responsible AI frameworks and testing protocols
Advanced learners and cybersecurity students wanting hands-on exposure to adversarial GenAI evaluation
Organizations adopting LLMs in regulated domains such as finance, healthcare, legal, and government

OWASP GenAI Red Teaming Complete Guide

What you'll learn

Explore related topics

Course content

Foundations of GenAI Red Teaming4 lectures • 7min

Risk Taxonomy and Threat Modeling4 lectures • 9min

The GenAI Red Teaming Process4 lectures • 9min

Adversarial Techniques and Prompt Attacks4 lectures • 9min

Model Evaluation and Exploitation4 lectures • 9min

Implementation and Guardrail Bypass4 lectures • 9min

System and Supply Chain Testing4 lectures • 9min

Runtime Evaluation and Agentic AI Risks4 lectures • 8min

Tools, Automation and Playbooks4 lectures • 8min

Organizational Maturity and Governance4 lectures • 8min

Requirements

Description

Who this course is for: