Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

AI Guardrails & Cybersecurity-AI Agents, Red Teaming HandsOn

Name: AI Guardrails & Cybersecurity-AI Agents, Red Teaming HandsOn
Rating: 4.4 (3190 reviews)

Build Responsible AI using OpenSource Models [HuggingFace], Platforms [Amazon Bedrock], Frameworks[Nemo, GuardrailsAI]

Created byFirstlink Consulting

Last updated 7/2026

English

German [Auto],English [Auto],

What you'll learn

Understand the fundamentals of AI Guardrails and their importance in ethical AI development.
Retrieval Augmented Generation: Learn about RAG, Vector store
User Input Guardrails : Learn about prompt injections, user input moderations (hate, violence etc) and ways to detect user input violations
Hallucination: Learn about Hallucination and detecting Hallucination using Open Source model from Hugging Face
Evaluators - Faithfulness Evaluator(LLM-As-A-Judge), SAS Evaluator, Context Relevance Evaluator and RAGAS Evalauator
Haystack Framework: Introduction to Haystack pipeline
Guardrails on AWS Bedrock : Learn to configure, deploy and run Guardrails on AWS Bedrock
Explore Real World Guardrails Models using Huggingface and Colab Notebooks
Learn architecture and gain insight on open source frameworks like GuardrailsAI and NemoGuardrails with real-world AI projects.
Learn to implement AI Guardrails and Nemo Framework in AI projects to prevent bias, ensure privacy, and enhance security.

Course content

29 sections • 128 lectures • 8h 58m total length

Welcome to AI Guardrails4:17
This section covers 10,000 foot view of AI Application and how Guardrails are applied on GenAI Applications. It also highlights what you will learn with the course offerings.
Course Contents2:57
Explore lm fundamentals, guardrails, and cybersecurity for generative ai; learn vector embeddings, retrieval augmented generation, input and foundation model guardrails, and bedrock integration for ai guardrails.

10,000 Foot view on Language Models7:33
10,000 foot view of different models in the current industry. First, we will learn about different model categories and how models have evolved over time eg- BERT, Language Model, LLM. We will also cover different terminology used in the industry for model development eg- Fine tuning, SFT(Supervise Fine Tuning, RLHF(Reinforcement Learning From Human Feedback)
LLM Inference Parameters11:46
We will take a deep dive on different inference parameters that will help regulate and manage response generation. These parameters are temperature, top_k, top_p, response length, stop sequences and penalties
Vector Embedding4:47
Explore embedding models, vector embeddings, and vector search, showing how multidimensional vectors capture semantic meaning and distance-based similarity for tasks like query understanding and clustering.
RAG - Retrieval Augment Generation7:02
Explore retrieval augmented generation (rag) and how it optimizes language models by referencing an external knowledge base, with data collection, chunking, embedding, and vector database interactions.

Introduction to Large Language Models (LLMs)1:56
Explore the fundamentals of large language models and their role in conversational AI, education, and content creation, and examine AI guardrails that govern their responsible use.
LLM Constraints in Modern AI Applications1:34
Explore four major LLM constraints in modern ai applications, including hallucination, bias and ethical concerns, data privacy and security, and output alignment, to guide responsible and innovative deployments.
LLM Constraints - Hallucination3:32
Explore how hallucination in large language models produces false or irrelevant outputs and learn guardrails to detect, mitigate, and safely deploy AI across high-stakes domains.
LLM Constraints - Bias and Ethical Concerns2:05
Explore bias and ethical concerns in LMS, including fairness, equity, privacy, security, transparency, and accountability, and how guardrails safeguard responsible deployment.
LLM Constraints - Data Privacy and Security2:14
Explore data privacy and data security in the use of LLMs, and learn guardrails that protect personal data, prevent unauthorized disclosure, and preserve trust and reputation.
LLM Constraints – Output Alignment3:13
Learn how output alignment keeps large language models aligned with organizational goals and values, preventing misalignment and guiding responsible use through guardrails for generative artificial intelligence.
Understanding Constraints in Large Language Models

Introduction to AI Guardrails3:16
Explore how AI guardrails use filters and guidelines to govern data intake. See how they safeguard outputs by blocking inappropriate prompts and correcting responses.
Applications of AI Guardrails2:48
Explore how guardrails enhance reliability, predictability, ethical use, and public trust across e-commerce, mental health, legal, and travel domains, with input and output checks and Guardrails AI and Namo guardrails.
Understand AI Guardrails

Prompts2:16
Explore prompts as the intermediary between human intent and generative AI, detailing instruction, context, input data, and output format with examples like bar chart summaries.
Prompt Injection3:16
Explore how prompts guide generative AI and how prompt injection can bypass guardrails, highlighting instruction, context, input data, and output format. Learn mitigation with prompt guard.
Prompt Guard3:07
Explore prompt guard, an open source prompt card classifier for llama 3.1 that detects malicious prompts and injected inputs. Fine tune it on your data for precise filtering.
Hands on with prompt-guard model10:06
demonstrates running a prompt guard model with 86 million parameters on google colab using hugging face, checking for jailbreak and prompt injection using tokenization, logits, and probability scores.
Llama Guard Theory1:24
Explore Llama Guard three, a 3.18 billion parameter pre-trained model fine-tuned for content safety classification that outputs safe or unsafe prompts, supports eight languages, and lists 14 moderation categories.
Llama Guard Prompt3:04
Learn how Llama guard prompts extend Llama 2 to Llama 3, adding new categories such as information, election, code interpreter, and abuse, with a multilingual prompt format and safety checks.
Hands on with Llama Guard Model8:10
In this video, we will use Llama Guard 2 model from Meta to moderate contents from malicious users.

Hallucination4:49
While hallucinations are a major source of misinformation, they are not the only cause; biases introduced by the training data and incomplete information can also contribute.
Detect Hallucination3:06
Learn to detect hallucination using faithfulness metrics in a retrieval augmented generation pipeline, and apply open-source models like V3 hallucination judge and the Victory hallucination evaluation model.
Detect Hallucination with phi3-hallucination-judge model9:05
Explore how to detect model hallucinations with the phi3-hallucination-judge model using a Colab notebook, tokens, and Pemf transformers pipelines to assess prompts, knowledge base, and responses.
Detect Hallucination with hallucination-evaluation-model5:47
Detect hallucination using a vector-based Hallucination Evaluation model on Hugging Face, comparing premise and hypothesis with retrieved evidence in a rag retrieval process, and review real-world Colab examples.

Introduction to Microsoft Presidio5:58
Learn how Microsoft Presidio acts as a privacy guardrail to identify, assess, and anonymize PII across data types, preventing data leakage and ensuring regulatory compliance for AI pipelines.
Architecture and Components4:01
Explore Presidio's modular architecture with an analyzer engine that detects PII via pattern matching and NLP recognizers, and an anonymizer engine that masks, replaces, or encrypts identified data.
Introduction to Presidio Analyzer2:11
Learn how the Presidio Analyzer Engine detects PII with pattern matching of credit card numbers, phone numbers, social security numbers, NLP-based recognition, and context analysis, plus extensible custom recognizers.
Analyzer - HandsOn6:07
Detects PII entities in text with a Python based Procedure Analyzer and predefined and custom recognizers, using regex NER and NLP engines to analyze and anonymize data.
Custom Analyzer - Hands On4:25
Create a custom PII recognizer in Presidio, install packages, import the analyze engine and pattern cognizer, and register a title recognizer with a denial list for text analysis.
Introduction to Anonymizer1:57
Explore the anonymizer engine and its privacy-preserving transformations, including redaction, masking, replacement, and hashing, to protect PII while preserving data utility with configurable entity-type controls.
Anonymizer Hands On6:00
In this hands-on lecture, learn to configure the anonymizer engine on Presidio Analyzer results, apply replace, redact, mask, hash, or encrypt options, and run a phone-number masking example.

AWS Bedrock - Guardrails Components8:41
Explore guardrails on AWS Bedrock, including content, sensitive information, and word filters, denied topics, and contextual grounding checks, plus defenses against jailbreaks and prompt injections to prevent hallucinations.
Introduction to Bedrock2:15
Explore Amazon Bedrock guardrails in a hands-on session, use Bedrock Studio playgrounds for chat, text, and image generation, and manage base and custom models with guardrails.
Bedrock-Guardrail - Hands On11:15
Create and test guardrails in AWS bedrock, using content filters, denied topics, profanity and PII controls, and grounding checks for an investment firm scenario.
Multimodal Guardrails- Image3:32
Configure multimodal image guardrails with strong content filters for harmful categories using bedrock's guardrails api to assess prompts and images and block misconduct with high strength.

Introduction to Garak4:19
Garrick scans language models for vulnerabilities, identifying failures and prompt injections. It tests foundation models with probes, generators, and detectors across OpenAI, Hugging Face, and Cohere.
Garak Probes2:04
Garak probes define a number of ways of testing a generator (typically an LLM) for a specific vulnerability or failure mode.
Install Garak2:49
Install Garrick via the CLI, explore probe commands, and run vulnerability tests on models like GPT-2 and ChatGPT 3.5/4.0 from Huggingface.
Detect LLM Vulnerability - Encoding3:48
Probes that try to get a model to generate a specific piece of given text by presenting an encoded version of that text. Attempts to circumvent safeguards on input filtering.
XFilteration5:24
Exfiltration is the unauthorized movement of sensitive information from a secure location, often with malicious intent.
Detect LLM Vulnerability - XFilteration4:31
Explore markdown image exfiltration probes and XSS in Garrick documentation, run tests with GPT 3.5 turbo and GPT 2 on Hugging Face, and review detector results and JSON line reports.
Detect LLM Vulnerability - Profanity2:38
Explore how profanity probes reveal LLM vulnerability to harmful text generation, and how detectors flag terms such as sexual profanity and mental disability during GPT-3.5 turbo testing with YAML configs.

Requirements

For beginners: This course is structured to welcome learners at various levels. If you're new to AI, we'll start with foundational concepts, making it an ideal opportunity to dive into ethical AI development with minimal barriers.
Familiarity with programming, preferably in Python, as it's commonly used for AI development.
Access to a computer with internet connection, capable of running AI development tools and frameworks.
No prior experience with GuardrailsAI or NemoGuardrails is required; this course is designed to introduce these tools from the ground up.

Description

77% of enterprises faced Generative AI breaches last year (IBM 2025). This hands-on course teaches you to deploy production guardrails against prompt injection, hallucinations, and cyber attacks using Llama Guard 3, AWS Bedrock, and CrewAI. Master open-source frameworks like GuardrailsAI, Nemo Guardrails, and Haystack to secure real AI applications.

What You'll Learn:

1. GUARDRAIL FRAMEWORKS

Nemo Guardrails: Production-grade dialog management & intent filtering
GuardrailsAI: RAIL specs, validator policies, output structuring
AWS Bedrock Guardrails: Enterprise content policy configuration
Haystack Evaluators: RAG faithfulness/SAS metrics
Llama Guard 3: Multimodal (vision+text) jailbreak detection

2. SECURITY TESTING TOOLS

Garak: Red Teaming to scan LLM vulnerability (encoding/XFilteration/profanity)
CrewAI + OWASP ZAP: Scan Web Vulnerabilities with AI-powered web penetration testing
Prompt-Guard: Real-time injection attack blocking

3. PLATFORMS & MODELS

AWS Bedrock: Cloud-based guardrail deployment
Hugging Face: Access to phi3/prompt-guard models
Phi-3.5-vision-instruct: Multimodal safety enforcement
phi3-hallucination-judge: Hallucination scoring engine
FastRAG: Secure retrieval-augmented generation pipelines

Below is the course details

1. Input Security Guardrails

Nemo Guardrails: Dialog management for intent-based filtering
Llama Guard 3: Vision-text hybrid moderation (NSFW/jailbreak detection)
Prompt-Guard: Real-time injection blocking

2. Output Validation Systems

phi3-hallucination-judge: Quantify truthfulness scores
GuardrailsAI Validators: Enforce PII/deny-topic policies
LLM-as-Judge Fallbacks: Context relevancy checks

3. Vulnerability Scanning

Garak Probes:
- Encoding attacks
- XFilteration exploits
- Profanity detection

4. AI-Powered Cybersecurity

CrewAI Penetration testing:
- Web vulnerability scanning
- ZAP Proxy automation
- Multi-agent threat hunting

5. Enterprise Platform Guardrails

AWS Bedrock:
- Content policy configuration
- Multimodal image guardrails
Nemo Production Deployment:
- Intent classification workflows
- Custom validator integration

6. RAG Security & Evaluation

Haystack Framework:
- Pipeline construction
- SAS/faithfulness metrics
GuardrailsAI RAIL Specs:
- Output structure validation
- On-fail remediation policies

7. Multimodal Agentic Safety

ReAct Architecture: Multi-hop reasoning
Phi-3.5-vision-instruct:
- Nutritional analysis case study
- Compliance checks

KEY HANDS-ON PROJECTS

- Nemo Intent Firewall: Block restricted queries in production chatbots
- GuardrailsAI HIPAA Enforcer: PII redaction & deny-topic policies
- CrewAI Web Vulnerability Scanner: Automated XSS/SQLi detection
- Multimodal Jailbreak Detector: NSFW/image attack prevention
- RAG Audit Dashboard: SAS scoring for retrieval faithfulness

Who Should Enroll:

This course is ideal for AI developers, data scientists, business leaders, and enthusiasts eager to enhance their understanding of ethical AI practices quickly. Whether you aim to apply ethical considerations to current projects or seek to broaden your knowledge of AI safety measures, this course will equip you with the insights needed for responsible AI development.

Join Us:

Embrace the opportunity to shape the future of AI by embedding ethical considerations and safety measures into the fabric of AI technologies. Enroll in "AI Guardrails: Ensuring Ethical and Safe AI Deployments" and take a significant step towards responsible and safe AI deployment.

Who this course is for:

This course is tailored for a diverse audience interested in the ethical implications and security of artificial intelligence, including: AI and Machine Learning Engineers seeking to integrate ethical guidelines and security measures into their AI models.
Data Scientists and Analysts aiming to understand and apply data privacy and integrity measures in AI applications.
IT Professionals and Cybersecurity Experts looking to expand their knowledge in AI vulnerabilities and protective mechanisms.
Ethics Officers and Compliance Professionals in technology sectors who wish to grasp the technical aspects of AI ethics for better regulation and implementation.
Technology Students and Enthusiasts curious about the intersection of AI, ethics, and security, aiming to build responsible AI solutions. Whether you're a professional aiming to ensure AI implementations in your organization are secure and ethical, or a student venturing into the realm of AI with a desire to champion ethical practices, this course offers valuable insights and skills to help you make a meaningful impact in the field of artificial intelligence.

AI Guardrails & Cybersecurity-AI Agents, Red Teaming HandsOn

What you'll learn

Explore related topics

Course content

Introduction2 lectures • 7min

General Concepts4 lectures • 31min

Large Language Models6 lectures • 15min

AI Guardrails2 lectures • 6min

LLM01 OWASP - Prompt Injection7 lectures • 31min

Multimodel Guardrails2 lectures • 17min

LLM06 OWASP - Misinformation4 lectures • 23min

LLM06 OWASP - Sensitive Information Disclosure (Data Loss Prevention-DLP) [NEW]7 lectures • 31min

Guardrails on AWS Bedrock4 lectures • 26min

Scan LLM Vulnerability - Red Teaming7 lectures • 26min

Requirements

Description

Who this course is for: