Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Advanced RAG: Build & Deploy Production GenAI Apps
Bestseller
Highest Rated
Rating: 4.6 out of 5(48 ratings)
363 students

Advanced RAG: Build & Deploy Production GenAI Apps

Multi-Agent RAG, CrewAI, AutoGen, Microsoft Agent Framework, RAG, Langchain, Deep RAG, Production RAG, RAGWire
Last updated 4/2026
English

What you'll learn

  • Build a production RAG pipeline with BM25 hybrid search, RRF fusion, and Qdrant vector database
  • Build agentic RAG systems with LangChain, LangGraph self-correcting agents, and supervisor workflows
  • Build multi-agent RAG with CrewAI, Microsoft AutoGen, and Microsoft Agent Framework
  • Deploy RAG agents to AWS ECS Fargate, GCP Cloud Run, Azure, Railway, and Render with Docker
  • Build a FastAPI backend with OpenAI-compatible endpoints, SSE streaming, and Postman testing
  • Build a production Chainlit chat UI with authentication, chat history, and document ingestion
  • Configure RAGWire with OpenAI GPT, Groq, Google Gemini, Ollama, and HuggingFace embeddings
  • Implement LLM-driven auto metadata filtering over complex nested document structures in Qdrant

Course content

11 sections114 lectures11h 0m total length
  • Introduction2:36

    Course Introduction!

  • What You Will Learn in This Course!5:43

    Build a chainlit front-end connected to a back-end agentic system and vector database, enabling data uploads, OpenAI-compatible endpoints, and production-grade RAG app deployment.

  • Download Code Files0:01
  • Getting Started with The Course5:16
  • Environment Setup - PIP, UV, Anaconda and Requirements.txt6:00
  • LangSmith Setup: Debug LangChain RAG Pipelines5:35
  • Install Docker and Qdrant Vector DB Locally5:30
  • Ollama Setup: Run Qwen 3.5 and Gemma 4 Locally6:46

Requirements

  • Basic Python programming knowledge (functions, classes, pip)
  • Familiarity with REST APIs and using a terminal or command line
  • Basic understanding of Gen AI and Langchain concepts

Description

Retrieval-Augmented Generation (RAG) is at the core of every serious AI application today. But basic RAG pipelines quickly hit their limits when documents are large, queries are complex, or your application needs to run reliably in production.


In this course, you will build RAGWire — a production-grade RAG toolkit built on LangChain, Qdrant, and LangGraph — from the ground up. You will start with a simple hybrid search pipeline and progressively add advanced retrieval, metadata filtering, agentic RAG, multi-agent frameworks, a full chat UI, and multi-cloud deployment.


By the end of this course you will know how to:

  • Build a hybrid RAG pipeline with BM25 sparse + dense retrieval and Reciprocal Rank Fusion (RRF)

  • Configure RAGWire with OpenAI GPT, Groq, Google Gemini, Ollama, and HuggingFace embeddings

  • Implement LLM-driven auto metadata filtering over complex, nested document structures

  • Build agentic RAG pipelines with LangChain agent tools, memory, and reasoning

  • Build a self-correcting RAG agent that grades its own retrieval and rewrites queries when quality is low

  • Build supervisor multi-agent systems that route queries to specialist agents using LangGraph

  • Build multi-agent document analysts with CrewAI, Microsoft AutoGen, and Microsoft Agent Framework

  • Build a production Chainlit chat UI with authentication, chat history, and document upload

  • Build a FastAPI backend with OpenAI-compatible /v1/chat/completions endpoints and SSE streaming

  • Deploy RAG agents to Render, Railway, AWS ECS Fargate, GCP Cloud Run, and Azure

  • Secure production APIs with API keys and protect credentials with Docker .dockerignore


This is a hands-on, code-first course. Every section produces working, runnable code that you can adapt to your own documents and use cases.

Who this course is for:

  • Python developers who want to build production-grade RAG systems beyond basic tutorials
  • ML engineers looking to deploy LangChain and LangGraph agents to AWS, GCP, or Azure
  • Developers who want hands-on experience with LangGraph, AutoGen, and CrewAI
  • Backend developers who want to build OpenAI-compatible FastAPI endpoints for AI applications
  • AI engineers who want hands-on experience with CrewAI, AutoGen, and multi-agent frameworks
  • Anyone building document search, enterprise AI assistants, or agentic RAG applications