Retrieval-Augmented Generation (RAG) for AI Agents

Name: Retrieval-Augmented Generation (RAG) for AI Agents
Rating: 4.1 (13 reviews)

A Hands-On Guide from Simple to Advanced RAG for AI Agents

Created byMichael Leontiev

Last updated 1/2026

English

What you'll learn

Learn about Vector Databases and their advantages for AI Agents
Gain practical knowledge about implementing RAG search
Learn how to build AI agents using LangGraph library
Understand the principles of semantic search

Course content

1 section • 7 lectures • 35m total length

Introduction2:32
Lesson 17:44
Explore VV8, a vector database, and its Python client to store and semantically search text vectors for AI agents, including installation, collection creation, vectorization, and integration.
Lesson 22:55
Lesson 36:19
Lesson 48:14
Explore advanced retrieval augmented generation by token-level embeddings and context weighting, compare to simple rag, and learn milvus setup, colbert encoder, and patent-search techniques.
Lesson 52:24
Lesson 65:05

Requirements

Moderate understanding of Python
Windows Operating System

Description

Unlock the next generation of AI applications by mastering Retrieval-Augmented Generation (RAG) for AI Agents. This comprehensive video series transforms you from an LLM novice into a skilled practitioner capable of building intelligent agents that access real-time knowledge, maintain conversational context, and eliminate hallucinations through semantic information retrieval. Begin with foundational concepts: understand why traditional LLMs fall short without external tools, explore context window limitations, and discover how semantic search fundamentally outperforms keyword matching. Lessons 1-3 provide hands-on implementation using Weaviate vector database and DeBERTa embeddings. You'll learn Docker deployment on Windows, CUDA GPU acceleration, and construct a functional patent-search agent using LangGraph that dynamically retrieves relevant abstracts and descriptions from USPTO data sources. Progress to advanced techniques in Lessons 4-6, where you'll implement token-level RAG using Milvus and ColBERT. This revolutionary approach stores individual token embeddings with contextual nuance, enabling granular control over search relevance. Learn to emphasize critical terms while de-emphasizing generic words, dramatically improving precision through amplitude weighting. Through step-by-step coding demonstrations, you'll master end-to-end vectorization pipelines, multi-stage agent orchestration, and seamless LLM integration with DeepSeek Chat. By course completion, you'll have built both simple chunk-based and sophisticated token-level RAG systems, ready to deploy production-grade customer support bots, research assistants, and domain-specific agents that ground their decisions in your proprietary data with remarkable accuracy.

Who this course is for:

Those with programing background that wish to build AI agents and/or use Vector Databases