Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Securing GenAI Systems: From Prompts to Autonomous Agents

Name: Securing GenAI Systems: From Prompts to Autonomous Agents
Rating: 5.0 (1 reviews)

A Hands-On Security & Architecture Course for Building Safe, Trustworthy, and Production-Ready GenAI Applications

Created byJayant Thakre

Last updated 3/2026

English

What you'll learn

Design secure GenAI architectures
Identify AI-specific vulnerabilities
Prevent prompt injection & data leakage
Secure agents & tool usage
Meet GenAI compliance requirements
Red-team and monitor AI systems

Course content

5 sections • 46 lectures • 10h 8m total length

1.1 Course Overview & Learning Path3:20
What this course covers (and what it doesn’t)
How developers should approach AI security
Real-world consequences of insecure GenAI
1.2 How GenAI Systems Actually Work19:00
Tokens, context windows, inference
Why LLMs are probabilistic, not logical
Non-determinism explained with examples
1.3 Why Traditional AppSec Fails for AI16:30
Comparison: Traditional software vs LLM interface
Why validation, auth, and logic checks break
A new foundation for AI Security
2.1 Common GenAI Architectures17:07
Prompt-only apps
RAG systems
Tool-using agents
Autonomous agents
2.2 AI Data & Control Flow17:16
Prompt → Model → Tool → Action
Where humans exit the loop
Hidden control paths
2.3 AI Attack Surface Mapping18:36
User input vectors
Model behavior vectors
Tool & data vectors

3.1 Prompt Injection Architectural Vulnerability17:24
Direct vs indirect injection
Why “ignore previous instructions” works
3.2 The Fragile Instruction Hierarchy15:03
System, developer, user, tool messages
How hierarchy collapses
3.3 Prompt Injection Causes Structural Damage14:47
System prompt leakage
Policy bypass
Tool override
4.1 RAG Architecture Deep Dive16:29
Chunking, embeddings, retrieval
Where security breaks
4.2 RAG Attack Scenarios13:33
Data poisoning
Malicious document injection
Context hijacking
4.3 Securing RAG Pipelines18:20
Filtering & validation
Metadata enforcement
Context boundaries
5.1 How LLM Tool Calling Works14:15
5.2 Tool Abuse & Parameter Injection14:45
5.3 Securing Tools & APIs19:19
6.1 Agent Autonomy & Emergence18:11
Recognize emergent agent behavior
Why agents behave unexpectedly
6.2 Multi-Agent Failure Modes18:24
Collusion
Recursive loops
Goal hijacking
6.3 Guardrails for Agentic Systems19:19
Action budgets
Execution limits
Kill switches

7.1 AI Data Lifecycle Risks16:00
Training vs inference
Memorization risks
7.2 PII Leakage & Privacy Failures18:48
Privacy failures
Prompt & output risks
7.3 Compliance for Developers19:31
GDPR, CCPA
AI regulations overview
8.1 AI Supply Chain Threats21:54
Chain of custody
Fine-tuning risks
8.2 Model Provenance & Trust16:14
Backdoors
Model versioning
8.3 Managing AI Vendor Risk20:18
Vendor risk vectors
Smoke bomb test
9.1 AI-First Threat Modeling21:21
AI-First Threat Modeling (20 min)
AIA STRIDE framework
Abuse cases vs misuse cases
9.2 Secure Prompt Engineering Patterns17:29
Prompt isolation
Deterministic outputs
9.3 Designing Secure AI Features15:12
Defense-in-depth for AI

LLM01:2025 Prompt Injection6:04
A Prompt Injection Vulnerability occurs when user prompts alter the LLM’s behavior or output in unintended ways. These inputs can affect the model even if they are imperceptible to humans, therefore prompt injections do not need to be human-visible/readable, as long as the content is parsed by the model.
LLM02:2025 Sensitive Information Disclosure8:17
Sensitive information can affect both the LLM and its application context. This includes personal identifiable information (PII), financial details, health records, confidential business data, security credentials, and legal documents. Proprietary models may also have unique training methods and source code considered sensitive, especially in closed or foundation models.
LLM03:2025 Supply Chain Vulnerabilities9:00
LLM supply chains are susceptible to various vulnerabilities, which can affect the integrity of training data, models, and deployment platforms. These risks can result in biased outputs, security breaches, or system failures. While traditional software vulnerabilities focus on issues like code flaws and dependencies, in ML the risks also extend to third-party pre-trained models and data.
These external elements can be manipulated through tampering or poisoning attacks.
LLM04:2025 Data and Model Poisoning8:49
Data poisoning occurs when pre-training, fine-tuning, or embedding data is manipulated to introduce vulnerabilities, backdoors, or biases. This manipulation can compromise model security, performance, or ethical behavior, leading to harmful outputs or impaired capabilities. Common risks include degraded model performance, biased or toxic content, and exploitation of downstream systems.
LLM05:2025 Improper Output Handling8:29
Improper Output Handling refers specifically to insufficient validation, sanitization, and handling of the outputs generated by large language models before they are passed downstream to other components and systems. Since LLM-generated content can be controlled by prompt input, this behavior is similar to providing users indirect access to additional functionality. Improper Output Handling differs from Overreliance in that it deals with LLM-generated outputs before they are passed downstream whereas Overreliance focuses on broader concerns around overdependence on the accuracy and appropriateness of LLM outputs. Successful exploitation of an Improper Output Handling vulnerability can result in XSS and CSRF in web browsers as well as SSRF, privilege escalation, or remote code execution on backend systems. The following conditions can increase the impact of this vulnerability:
LLM06:2025 Excessive Agency7:45
Excessive Agency is the vulnerability that enables damaging actions to be performed in response to unexpected, ambiguous or manipulated outputs from an LLM, regardless of what is causing the LLM to malfunction. Common triggers include:
hallucination/confabulation caused by poorly-engineered benign prompts, or just a poorly-performing model;
direct/indirect prompt injection from a malicious user, an earlier invocation of a malicious/compromised extension, or (in multi-agent/collaborative systems) a malicious/compromised peer agent.
LLM07:2025 System Prompt Leakage7:48
The system prompt leakage vulnerability in LLMs refers to the risk that the system prompts or instructions used to steer the behavior of the model can also contain sensitive information that was not intended to be discovered. System prompts are designed to guide the model’s output based on the requirements of the application, but may inadvertently contain secrets. When discovered, this information can be used to facilitate other attacks.
LLM08:2025 Vector and Embedding Weaknesses8:24
Vectors and embeddings vulnerabilities present significant security risks in systems utilizing Retrieval Augmented Generation (RAG) with Large Language Models (LLMs). Weaknesses in how vectors and embeddings are generated, stored, or retrieved can be exploited by malicious actions (intentional or unintentional) to inject harmful content, manipulate model outputs, or access sensitive information.
Retrieval Augmented Generation (RAG) is a model adaptation technique that enhances the performance and contextual relevance of responses from LLM Applications, by combining pre-trained language models with external knowledge sources.Retrieval Augmentation uses vector mechanisms and embedding. (Ref #1)
LLM10:2025 Unbounded Consumption10:12
Unbounded Consumption refers to the process where a Large Language Model (LLM) generates outputs based on input queries or prompts. Inference is a critical function of LLMs, involving the application of learned patterns and knowledge to produce relevant responses or predictions.
Attacks designed to disrupt service, deplete the target’s financial resources, or even steal intellectual property by cloning a model’s behavior all depend on a common class of security vulnerability in order to succeed. Unbounded Consumption occurs when a Large Language Model (LLM) application allows users to conduct excessive and uncontrolled inferences, leading to risks such as denial of service (DoS), economic losses, model theft, and service degradation. The high computational demands of LLMs, especially in cloud environments, make them vulnerable to resource exploitation and unauthorized usage.

Requirements

Basic understanding of APIs, cloud services, and web security
Familiarity with LLMs (prompting, embeddings, RAG)

Description

Generative AI has changed how software is built — but it has also introduced entirely new security failures that traditional AppSec and cloud security models were never designed to handle.

This course is a deep, hands-on journey into the real security risks of modern GenAI systems, from prompt injection and RAG poisoning to tool abuse and autonomous agent failures. It is designed for software engineers, security engineers, architects, and AI practitioners who need to move beyond theory and understand how GenAI systems actually fail in production — and how to secure them properly.

Unlike high-level AI safety courses, this program is practical, adversarial, and systems-focused. You’ll break real GenAI workflows, observe emergent failures, and then implement concrete defenses using industry-aligned patterns.

By the end of this course, you won’t just understand GenAI security — you’ll know how to design, test, and govern AI systems safely at scale.

What You’ll Learn

Core Concepts

Why GenAI security is fundamentally different from traditional AppSec
How non-determinism breaks existing security assumptions
Where trust boundaries actually exist in AI systems
Why “prompt security” alone is insufficient

Hands-On Skills

Exploit prompt injection and instruction hierarchy failures
Poison RAG pipelines and observe real-world impact
Abuse tool calling and function execution
Trigger unintended behavior in multi-agent systems
Implement real mitigations using policies, constraints, and governance

Defensive Architecture

Secure RAG design patterns
Tool and function authorization models
Agent guardrails and bounded autonomy
Policy enforcement outside the model
Safe failure and human-in-the-loop design

What Makes This Course Different

Hands-on labs, not slides
Real failure modes, not hypothetical risks
Agentic AI coverage (rare and critical)
Security-first design mindset
Aligned with OWASP LLM Top 10 & MAESTRO
Built for production engineers, not researchers

Each week includes:

Conceptual video lessons
Attack walkthroughs
Jupyter-based labs
Defensive redesigns
Reflection and threat modeling exercises

Who This Course Is For

Software Engineers building AI-powered applications
Security Engineers responsible for AI risk
AI/ML Engineers deploying LLM systems
Architects designing agent-based workflows
Security leaders evaluating GenAI risk exposure

No prior AI security experience required — but comfort with APIs and basic Python is recommended.

Final Outcome

After completing this course, learners will be able to:

Identify real GenAI security risks
Design secure AI architectures
Prevent prompt, RAG, and tool-based attacks
Safely deploy agentic systems
Evaluate AI products with a security-first lens

Who this course is for:

Software engineers building GenAI features
ML engineers & AI platform teams
Security engineers transitioning to AI security
Technical leaders & architects
Technical Product Managers

Securing GenAI Systems: From Prompts to Autonomous Agents

What you'll learn

Explore related topics

Course content

Understand the GenAI Landscape6 lectures • 1hr 32min

Master The Threats12 lectures • 3hr 20min

Build Security into the Lifecycle9 lectures • 2hr 47min

OWASP Top 10 for LLMs i.e. GenAI Security Project9 lectures • 1hr 15min

OWASP Top 10 For Agentic Applications 202610 lectures • 1hr 15min

Requirements

Description

Who this course is for: