Master RAG: Retrieval-Augmented Generation Systems [NEW]

Name: Master RAG: Retrieval-Augmented Generation Systems [NEW]
Rating: 4.5 (4360 reviews)

Unlock the Power of AI with the RAG Triad: Advanced Techniques in Information Retrieval, Response Generation with RAG

Created byPaulo Dichone | Software Engineer, AWS Cloud Practitioner & Instructor

Last updated 6/2026

English

What you'll learn

Understanding the RAG Triad:Grasp the core components: Retriever, Generator, and Fusion Module.
Advanced Retrieval Techniques: Explore sparse and dense retrieval methods, including Dense Passage Retrieval (DPR).
Coherent Response Generation:Generate fluent and contextually appropriate responses based on retrieved documents.
Query Expansion and Re-Ranking Techniques:Improve document relevance through re-ranking strategies.

Course content

9 sections • 39 lectures • 3h 16m total length

Introduction1:54
Explore advanced retrieval-augmented generation techniques to boost rag pipeline performance through hands-on concepts, for developers, data scientists, and ai engineers.
Course Structure0:49
Master RAG course structure blends theory with hands-on practice, covering fundamental concepts and the lingo, with hands-on sessions occasionally supported by theory for a fuller overview.
Development Environment Setup1:23
Set up your development environment by installing Python and choosing a code editor like VS Code. Create an OpenAI account and API key to follow along with hands-on labs.

Introduction to RAG and the RAG Triad - Overview3:03
Introduce rag, its motivation and advantages, and explain the rag triad—query, response, and context—and how they interact to improve retrieval-augmented generation systems, ensuring grounded, relevant results.
What is RAG and Naive RAG Overview and Pitfalls8:50
Explore how rag blends a retriever and a generator to produce contextual, relevant responses, and review naive rag's indexing, embedding, vector store, retrieval, and augmentation along with pitfalls.
Deep Dive into Each Naive RAG Drawbacks6:11
Explore the drawbacks of naive retrieval-augmented generation, from limited contextual understanding and keyword-based relevance to poor retrieval-generation integration, scaling challenges, and robustness issues like hallucination, bias, and toxicity.
Long Context vs RAG - When to Choose Each [NEW]9:17
Check in0:56
Paolo invites you to leave a review to help others see the course's value, while inviting questions on the discussion board and encouraging community engagement.

Advanced RAG Techniques - Intro to Expansion with Generated Answers5:58
Boost retrieval quality with advanced rag techniques through pre and post retrieval enhancements, including query expansion with generated answers from a large language model and vector database reranking.
Hands-on - Expansion with Answers - Splitting Text5:08
Apply retrieval-augmented generation by splitting a Microsoft annual report pdf into 410 chunks of 1000-character segments using LangChain tools, then embed with ChromaDB.
Embedding the Chunks and Showing Them3:03
Split text into 256-token chunks with zero overlap using sentence transformers, generate embeddings via a sentence transformer embedding function, and index them in chroma db while noting token size limits.
Adding Documents to the Vector Store and Performing Similarity Search2:53
Learn to build a chroma vector store by creating a collection, attaching an embedding function, indexing token splits, and performing similarity search to retrieve relevant documents.
Generating the Answer & Concatenating the Relevant Documents5:16
Create augmented queries using a document query generator, join the original query with a hypothetical answer using a large language model, to retrieve five documents and embeddings.
Plotting and Projecting the Embedded Results on Graph5:24
Project embeddings with UMAP from the chroma collection and plot original and augmented queries against retrieved documents to show improved alignment in embedding space.
Query Expansion with Generated Answers - Summary1:30
Leverage RAG-inspired query expansion with generated answers to improve retrieval, showing how combining the original query and the generated answer helps rank relevant documents.

Query Expansion with Multiple Queries - Overview3:05
Expand queries with multiple subqueries generated by a large language model to improve retrieval accuracy. Retrieve documents for all subqueries, aggregate results, and generate a final contextual answer.
Getting Generated Augmented Queries5:38
Develop generated augmented queries for a retrieval-augmented generation system by extracting and splitting pdf data, embedding with sentence transformers, indexing in chroma db, and generating up to five related questions.
Retrieving and Plotting Embeddings in a 2D Graph7:01
Concatenate the original query with augmented queries, retrieve and deduplicate results from the vector store, then project embeddings with UMAP to visualize in a 2D graph.
CHALLENGE: Your Turn0:34
Explore and refine prompts and queries to see how results vary, recognizing that prompts guide related queries and influence documents from the vector database.
Expansion with Multiple Queries Downsides & Summary1:23
Explore expansion with multiple queries in retrieval augmented generation, concatenating queries with original to query a vector database, and mitigate downsides like noise or hallucinations with relevance feedback or reranking.

Re-ranking & Cross-encoder and Bi-encoders - Overview4:50
Refine and reorder retrieved documents with a cross-encoder reranking model, to prioritize top results for search, qa, and legal document retrieval.
Ranking Long-tail Results with Cross-encoder7:24
Learn how to rerank long-tail search results using a cross-encoder in a retrieval-augmented generation workflow, including embeddings, vector database, and query expansion to improve relevance.
Final Step - Pass the Ranked Documents through a LLM to Get Relevant Answer4:53
Select the five ranked documents, concatenate them as context, and query a GPT 3.5 turbo to reveal factors driving fiscal year 2023 revenue, such as Microsoft cloud and LinkedIn revenue.
Re-ranking Summary0:57
Apply cross encoder reranking to reorder initial retrieved documents, feed top results into a large language model, and refine RAG applications by testing with different queries.

Dense Passage Retrieval Overview2:14
Learn dense passage retrieval (dpr) with dual encoders that map questions and passages to dense vectors, using dot product similarity for open-domain question answering and fast document retrieval.
The DPR technique - Full Hands-on4:50
Implement the DPR technique end-to-end with pre-trained question and context encoders, tokenizers, and cosine similarity to retrieve the most relevant passages for a query.
DPR Summary0:58
Explore dense passage retrieval (DPR) using a question encoder and a passage encoder to create dense vectors and retrieve results via dot product similarity for rag applications.

Other Techniques1:18
Explore advanced retrieval-augmented generation techniques beyond dense passage retrieval, including embedding adapters, deep chunking, and rag fusion, and learn to refine your rag workflows by researching current papers.
Get the Source Code for This Section0:04
Contextual Retrieval (Anthropic's Techinique)9:56
Late Chunking for Better Context14:18
Agentic RAG with LangGraph24:54
GraphRAG for Complex Reasoning19:20
Multimodal RAG with ColPali10:29
Advanced RAG - Summary4:04

Requirements

Basic Understanding of AI and NLP Concepts: Familiarity with foundational AI and NLP principles.
Programming Experience: Proficiency in Python, as it will be the primary programming language used.

Description

Unlock the full potential of AI with our comprehensive course on Retrieval-Augmented Generation (RAG) Systems. Dive deep into the powerful RAG Triad and learn how to leverage advanced techniques in information retrieval, response generation, and agent-based architecture. Designed for AI enthusiasts, data scientists, and NLP professionals, this course provides everything you need to build state-of-the-art RAG systems that deliver accurate, contextually relevant, and coherent responses to complex queries.

What You'll Learn:

The RAG Triad: Understand the components of RAG systems, such as the retriever, generator, and Fusion Module, and how they work together to enhance information retrieval and response generation.
Advanced Retrieval Techniques: Explore sparse and dense retrieval methods, including Dense Passage Retrieval (DPR), and learn how to implement hybrid retrieval approaches for superior accuracy.
Coherent Response Generation: Master using advanced language models like GPT-3 to generate fluent and contextually appropriate responses based on retrieved documents.
Hands-On Projects: Engage in practical exercises and real-world projects to build a complete RAG system from scratch and apply your skills in various applications such as search engines, customer support, and research.

By the end of this course, you'll be equipped with the skills and knowledge to create robust RAG systems that can easily handle complex queries, making you a leader in AI and NLP.

Enroll now to transform your AI capabilities and stay ahead in the ever-evolving field of artificial intelligence.

Who this course is for:

AI Enthusiasts: Eager to explore advanced AI and NLP techniques.
Data Scientists: Seeking to enhance their information retrieval skills.
NLP Professionals: Aiming to master RAG systems for complex queries.
Tech Innovators: Looking to apply cutting-edge AI in real-world applications.

Master RAG: Retrieval-Augmented Generation Systems [NEW]

What you'll learn

Explore related topics

Course content

Introduction3 lectures • 4min

Download Code and Resources2 lectures • 2min

RAG (Retrieval-Augmented Generation) Deep Dive - Naive RAG vs Advanced RAG5 lectures • 28min

Advance RAG Deep Dive - Advanced Techniques7 lectures • 29min

Hands-on: Advanced RAG Technique - Query Expansion with Multiple Queries5 lectures • 18min

Hands-on - Advances RAG Technique: Re-Ranking with Cross-encoder4 lectures • 18min

Hands-on - Advances RAG Technique: Dense Passage Retrieval DPR3 lectures • 8min

Other Advanced RAG Techniques - NEW Content8 lectures • 1hr 24min

Wrap up - What's Next2 lectures • 4min

Requirements

Description

Who this course is for: