Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Agentic AI - Private Agentic RAG with LangGraph and Ollama

Name: Agentic AI - Private Agentic RAG with LangGraph and Ollama
Rating: 4.4 (285 reviews)

LangGraph v1, Ollama, Agentic RAG, Private RAG, Corrective RAG, CRAG, Reflexion, Self-RAG, Adaptive RAG, MySQL Agent

Role Play

Created byKGP Talkie | Laxmi Kant

Last updated 6/2026

English

German [Auto],English [Auto],

What you'll learn

Build private, production-ready Agentic RAG systems using LangGraph v1 and Ollama.
Create custom LLM workflows with LangGraph state machines, nodes, edges, and conditional routing.
Implement PageRAG, metadata extraction, PDF processing with Docling, and page-level ingestion.
Use ChromaDB, embeddings, metadata filtering, and MMR retrieval for high-accuracy search.
Apply BM25+ re-ranking and advanced retrieval pipelines for financial document analysis.
Build Agentic RAG: tool calling, reasoning loops, structured outputs, and multi-step workflows.
Implement Corrective RAG (CRAG) with document grading, query rewriting, and web search fallback.
Create custom Ollama models, Modelfiles, embeddings, and integrate with LangChain.
Build Reflexion, Self-RAG and Adaptive RAG along with MySQL Agent

Course content

14 sections • 158 lectures • 17h 37m total length

Introduction5:33
Course Introduction!
AI Agent Mastery Learning Path | Must Watch7:12
Follow the prescribed learning path from basic python to lang chain, lang graph, and private agentic rag to achieve higher success. Skipping prerequisites leads to confusion and incomplete learning.
Code Files and Install Requirements.txt4:12
Download the code file from the GitHub repository, unzip it, open it in VSCode, and install requirements.txt with pip for the Lang Smith setup and Landgraf references.

Installation and Introduction to Ollama5:40
Install ollama from olama.com by downloading the appropriate executable for Windows, Linux, or macOS, then explore the llm, lm serving, and api workflow with available models and the gui.
Explore Ollama UI Interface and Models6:12
Explore the Ulama UI to access cloud and local models. Learn how to download models, run them on GPU, and observe the thinking and response process.
Explore Context and Realtime Search Settings in Ollama8:34
Explore how to configure ollama settings, manage accounts, upgrade to cloud, set model location and context length, and enable internet access for real-time search.
Use Ollama for Quick Documents Analysis5:56
Learn to use the Lama UI as a rag system to attach images and PDFs, read with vision models like Gemma, and code workflows with Lang chain or Lang Groff.
Inspecting Qwen3 Model6:53
Inspect the Quinn three 8‑billion-parameter model in the ulama library, examining its mixture of experts, four‑bit quantization, 32 attention heads, 36 transformer layers.
Qwen3 Benchmarking Overview6:01
Evaluate benchmarks across tasks—from graduate level question answering to Olympiad mathematics and live code bench—comparing 8b, 30b, 235b, and 330b parameter models, tools, and tool calling.
Test Benchmark Questions Locally6:35
Benchmark and test a queen 330 billion parameter model locally with live coding, answering physics and algebra questions, compare to GPT-5, and evaluate thinking versus non-thinking models.
How to Select Model for a Specific Task or Project7:19
Select models by task: run large models in the cloud and use embedding models like gnomic embed text or Gemma; leverage vision models such as Gma3 and Llama 3.2 vision.
ollama pull and run commands5:56
Learn to use ollama pull and run commands from the terminal to download and serve models locally, switch between the UI and CLI, and manage model history.
ollama serve, ollama rm and ollama show commands7:04
Start ollama serve and inspect model files, then manage models with pull, list, and remove commands. Explore the UI to run Quinn and llama models with memory optimization.
ollama cp and ollama ps commands6:08
Master the Ollama cp and ps commands to copy models, name copies, and monitor running instances, including context windows and GPU usage.
Create and Run Ollama Model with Predefined Settings9:00
Learn to create and run an Ollama model with predefined settings by loading a model file, configuring temperature and context, adding a system prompt, and executing the run.
Exploring Ollama Message Commands5:14
Explore Ollama message commands in message mode, view help, inspect model info and files, and compare loading a llama 3.2 model or a saved seldon model while setting parameters.
Create Ollama Model with Message Commands6:49
Learn to create an Ollama model from llama 3.2 using message commands, set temperature and context, configure a system prompt, save the model, and adjust history and verbose settings.
Ollama Raw API Requests10:34
Explore llama raw APIs to generate completions and chat completions, test streaming and non-streaming modes, and format outputs as JSON using curl and git bash without frameworks.
Load Uncesored GGUF Models for Banned Content Generation [Only Educational Pu6:55
Learn to download a GGUF model from the hugging phase and load an uncensored wizard LM model into Ullama locally. Configure a system message for educational content exploration.

Run LLMs Locally in 2026: Qwen3.5, Nemotron & Mixtral Overview6:19
Learn to run local llms on Ullama Olama, focusing on Nemotron 3 nano (4b) and a 32b model, compare mamba and mix-of-experts architectures, and review context window and Quint 3.5.
VRAM Requirements: Qwen3.5, Nemotron & Mixtral on Ollama (2026)7:45
Calculate model memory by multiplying parameter counts by 0.5 bytes for 4-bit quantization, illustrating qwen3.5, nemotron, and mixtral on ollama with tokenizer, metadata, and vision memory.
Dense vs MoE vs Hybrid Mamba: LLM Architectures Explained4:59
Compare dense, sparse mixture of expert, and mamba architectures, detailing attention, kvcache, routing, and ssm state. Explain how active parameters differ and why context handling affects text generation speed.
Nemotron 3 Nano 4B Coding Test: LeetCode Hard Problems Locally5:57
Compare dense transformer, sparse MoE, and hybrid member transformers—Nemotron 3 Nano, Quin 3.59b, Quin 3.535b—locally with Ullama to tackle LeetCode hard problems and assess performance and acceptance.
Nemotron 3 Nano 4B vs Qwen3.5 4B: LeetCode Medium Benchmark6:18
Compare Nemotron3 nano 4B and Quen 3.5 4B on LeetCode hard and medium problems, showing 4B models struggle with hard tasks and perform better on medium tasks.
Ollama Speed Benchmark: Nemotron 3 Nano vs Qwen3.5 Tokens/sec6:21
Understand that agent states are stored as a dictionary of messages (human, AI, tool, system) and managed by memory with volatile RAM and short-term and long-term memory options.
Nemotron 3 Nano 30B Benchmark: LeetCode Hard + Speed Test4:48
Compare the generation speed of Nemotron 3 nano and Quint 3.54 billion parameter model using mamba versus dense architectures, and examine per-token throughput with no KV cache.
Qwen3.5 35B MoE Benchmark: LeetCode Hard + Speed Test4:21
Demonstrates solving LeetCode hard problems with Nemotron 30 billion parameter model, benchmarking generation speed and test case validation, and comparing with Quint 3.5 35 billion parameter model.

Introduction to Flow Engineering and Finite State Machine for LangGraph7:55
Explore flow engineering and finite state machines through the line graph, learning state, node, and edge concepts while managing messages and history.
How to Create Custom State in LangGraph6:02
Create a custom LangGraph state with a typed dict, defining input and output text fields, and build a state graph canvas to manage nodes and edges.
How to Create Custom Node and How States are Modified by LangGraph5:45
Create custom nodes in a line graph by writing a Python method and typing the input state, then transform input text to uppercase and return the updated state.
Execute Nested Nodes with LangGraph State7:50
Create and connect custom nodes like add prefix and add suffix, manage state for input and output text, and execute nested nodes in a line graph to produce the output.
Build and Visualize Your First LangGraph Graph7:06
Create and visualize your first LangGraph graph by building a state graph canvas, adding nodes like process_input, add_prefix, and add_suffix, and connecting edges to form a runnable line chain.
Invoke LangGraph and Explore How States are Changing6:26
Compile and invoke a LangGraph workflow with graph.invoke, passing input_text state and generating output_text automatically. See how keys flow across nodes and how simple versus nested states shape execution.

[Must] Learning Path for Advanced LangGraph Worflows1:27
Advance your LangGraph workflows by building a SQL agent with the line graph, exploring rag applications, and designing private agents like a MySQL agent, after reinforcing fundamentals.
Introduction MySQL Agent3:27
Introduce the MySQL agent with line graph fundamentals, outlining routing and workflows for get database schema, generate SQL query, validate SQL query, execute SQL query, and fix SQL query.
MySQL Agent Notebook Setup4:34
Set up a MySQL agent notebook by importing libraries, loading the employees db, creating the db, and connecting with a SQL database connector using long chain tools and reasoning.
MySQL Database Setup and DB SCHEMA Extraction5:53
establish a MySQL database connection, enumerate six tables, and extract the database schema to empower an agent with schema-driven data retrieval and few-shot prompts.
Create get_database_schema Langchain Tool for MySQL6:13
Implement the get_database_schema Langchain tool for MySQL to return full schema or a specific table's schema, validating table names and providing a helpful error with available tables.
Create generate_sql_query Langchain Tool for MySQL7:11
Design and implement a generate_sql_query Langchain tool for MySQL that uses a defined schema and prompts to generate only select queries for read operations.
Create validate_sql_query Langchain Tool for MySQL Part 15:17
Validate a MySQL sql query using a LangChain tool for safety and syntax before execution. Clean and normalize the query, removing sql code blocks with regex.
Create validate_sql_query Langchain Tool for MySQL Part 24:41
Design and test a validate_sql_query Langchain tool for MySQL that enforces only select statements and blocks dangerous keywords, ensuring safe, validated queries.
Create execute_sql_query Langchain Tool for MySQL6:28
learn to implement an execute_sql_query tool in LangChain for MySQL, validate queries before execution, handle errors, run queries, and interpret results with practical testing.
Create fix_sql_error Langchain Tool for MySQL4:01
Design and test a fix_sql_error tool for SQL using LangChain, passing original query, error message, and question, then return a corrected SQL query that follows SQL syntax.
Create MySQL AgentState and LLM with Tools3:43
Create a MySQL agent by defining agent state and annotating messages, wiring tools like get_database_schema and generate_sql_query within an LM with tools. Build the Elm with the tools.
Create Agent Node4:20
Create an agent node with a name and variable, and craft a system prompt detailing a SQL analyst workflow: get schema, generate SQL, validate SQL, execute SQL, and retry fixes.
Create Conditional Router to Control Agent Execution6:32
Create a conditional router that controls agent execution using should_continue, reads the latest agent state, and handles tool calls until a final answer is reached.
Create MySQL Agent with LangGraph5:03
Create a MySQL agent with LangGraph by building a graph of agent and tool nodes, wiring edges and conditional flows, so the agent orchestrates tool calls until final answer.
Qwen3 vs OpenAI GPT-OSS - Performance Evaluation of MySQL Agent7:19
Test and troubleshoot a MySQL agent by running sample queries, handling input parameters and agent state, and compare Qwen3 with GPT Oasis for SQL generation, noting model quality affects results.
Evaluating MySQL Agent with Complex Queries5:11
Evaluate a MySQL agent by testing complex queries, including group by and joins, to compute the average salary by department and identify top paid employees.

PageRAG Learning Sneak Peek3:17
Explore the PageRAG sneak peek: ingest json data into a vector db, extract metadata, apply reranking, and prepare documents for a private agent RAG workflow.
Introduction to RAG7:37
Master retrieval augmented generation (rag) concepts from data ingestion to retrieval and reranking, then design agentic rag using advanced techniques and vector databases with an embedding model.
PageRAG Architecture Design Part 15:24
Explore the page rag architecture that chunks financial documents page-wise, adds metadata, and ingests them into a vector DB for precise, reranked retrieval.
PageRAG Architecture Design Part 28:19
Design a page architecture where the agent automatically fetches relevant chunks using embeddings and metadata, filters by metadata, reranks by cosine similarity, and delivers a final answer.
PageRAG Notebook Setup7:14
Set up PageRAG notebook with metadata ingestion, filtering based on the metadata, cosine-based ranking and reranking of chunks, and vector DB integration using embeddings, doc link, and dedup hashing.
Introduction to Chroma DB and Its Setup7:54
Learn to set up chroma vector db, create a financial box collection, configure nomic embed text, specify base url, and persist data for pdf ingestion and retrieval.
Extract Documents Metadata8:05
Extracts metadata from file names by parsing company name, document type, fiscal quarter optional, and fiscal year into a dictionary. Handles pdf removal, 4-part vs 3-part formats for precise filtering.
Extract Markdown Text from PDF Documents using Docling9:29
extract markdown text from pdf pages with the doclink document converter, converting pdf pages to markdown data and preparing for page-wise access.
Compute SHA256 Hash of a File Content to Avoid Duplicated Ingestion5:52
Compute a sha256 hash of a PDF by reading 4096-byte chunks to prevent duplicate ingestion in the vector DB, storing the hash as metadata.
Track the Processed Files for Deduplication8:15
Track processed files to prevent duplicate ingestion and deduplication using a chroma vector store, fetch metadata and file hashes, and prepare for document ingestion.
Documents Ingestion in Chroma Vector DB Part 17:09
Ingest documents into a chroma vector DB by converting the data dir to a pathlib path and recursively listing PDF files, computing metadata, embeddings, and hashes.
Documents Ingestion in Chroma Vector DB Part 28:27
Ingest PDF pages by converting them to markdown, extract metadata, assemble page content with metadata, and ingest prepared documents into the vector DB.
Ingest Whole Documents Dir in Vector DB5:22
Ingest documents into a vector db using the doc link to auto-detect file types, select rapid OCR with cuda, convert to markdown.
Search Sample Chunks and Why We Need Metadata Filtering7:10
Learn how to ingest documents into a vector DB, apply metadata filtering and LLM-based extraction to retrieve and rerank relevant chunks using cosine similarity for accurate RAG results.

Learning Path for Data Retrieval and RAG3:56
Master data retrieval and reranking in rag systems by ingesting data correctly, extracting metadata from queries with structured llm outputs, and using mmr and bm25 ranking for vector db search.
Data Retrieval and Re-ranking Notebook Setup6:59
Execute data retrieval and reranking by configuring vector embeddings, metadata extraction, and LM-driven structured outputs, then apply BM25 reranking with filtered search to improve document relevance.
Define Fiscal Quarter and Documents Type Pydantic Model6:20
Define fiscal quarter and document type using a pydantic model and enums, creating a pedantic schema for structured lm outputs and guiding classification for 10-K, 10-Q, 8-K and Q1–Q4.
How to Extract Metadata from User Query6:28
Learn to extract chunk metadata from user queries by using a metadata schema and a language model to produce a structured dictionary with optional fields for vector DB filtering.
Create ChunkMetadata Pydantic Class for Metadata Extraction8:19
Define a base Pydantic model for chunk metadata extraction, with optional fields for company name, doc type, fiscal year, and fiscal quarter, enforcing predefined values via enum-based model config.
Create Pydantic Model for Ranking Keywords5:57
Define a ranking keywords pydantic model and a ranking keywords class that produces exactly five financial keywords from the user query. Integrate the model with the LLM to rank documents.
Extract Metadata Filters with Structured LLM Output7:09
Extract metadata filters and ranking keywords from user queries with a structured llm output, guided by a defined schema, detailed prompts, and few-shot mappings for company names and document types.
Generate Ranking Keywords5:54
Generate five exact financial keywords from 10-K and 10-Q filings, then apply a bm25 ranking on extracted chunks to rank related content.
Implement Basic Document Retrieval9:45
Implement the search docs method to retrieve the top five documents from the vector DB using MMR search with ranking keywords and metadata filters.
Implement Filter Clause in Chroma DB7:48
Implement metadata filtering in Chroma DB by composing a search keyword with filters and multiple conditions, using and/or logic to refine results. Set a default k and ranking keywords.
Implement Enhanced Document Retrieval9:55
Implement enhanced document retrieval by applying full text search with ranking keywords, using where document filters and contains/not contains, plus metadata and embedding model filtering.
Deep Dive Into Enhanced Retrieval Strategy8:18
Explore enhanced retrieval by combining metadata filters with full-text keyword ranking, increasing k, and reranking to push the most relevant chunks to the top.
Process Chunks for Re-Ranking Part 18:14
Extract headings, subheadings, and the following paragraph to support reranking, focusing on table headings and concise content for ranking keywords.
Process Chunks for Re-Ranking Part 26:26
Process chunks for re-ranking explains how to pair sections and headings with their content, validate next content availability, and build formatted heading-content chunks for later re-ranking.
Rank Documents using BM25 and Ranking Keywords8:43
Rank documents using BM25 plus on the heading and content chunks, then tokenize the query and corpus, compute scores, and return the top-k most relevant documents.
Rank Documents using BM25 and Ranking Keywords Part 26:52
Rank documents with BM25 plus by extracting, joining, and lowercasing document chunks, using ranking keywords, then tokenizing and scoring against query tokens to retrieve top-k results.
Rank Documents using BM25 and Ranking Keywords Part 36:50
Rank documents by keyword with a Python sort using a lambda key to sort in descending order of doc scores, and print top k indices to verify proper ranking.
Prod Level Advanced Data Retrieval and Re-Ranking in Action4:43
Explore production-level data retrieval and re-ranking with keyword-based ranking to surface contextually relevant documents, using cash flow examples like consolidated statements of cash flow and free cash flow.
Designing a GenAI Workflow Using LangGraph and Ollama Models

Introduction to Agentic RAG6:04
Explore the data ingestion and retrieval workflow of agentic rag, including vector stores, hash-based ingestion, cosine similarity with filtering and reranking, and the generation stage.
Code Like a Pro - Create Reusable Centralized Retrieval and Re-Ranker Part 17:04
Centralize data retrieval and reranking code into a reusable utils.py, enabling universal data retrieval across rag systems and applications, and export notebook methods to a clean Python module.
Code Like a Pro - Create Reusable Centralized Retrieval and Re-Ranker Part 26:43
Refactor notebook code into a centralized utils.py for retrieval and reranking, import and test utils.extract_filters with a sample query, and wire the utilities into an application.
Agentic RAG Workflow and Agent State Creation5:10
Design a rag agent workflow and create the agent state by building a retrieve dock node, applying filters, ranking keywords, searching docs, and preparing context.
Write Retrieve Docs Langchain Tool Part 17:36
Implement a Python retrieval tool for a LangChain workflow, using filters, ranking keywords, and document search and reranking, then expose it as a LangChain tool with logging and environment setup.
Write Retrieve Docs Langchain Tool Part 28:19
Learn to build a retrieve docs workflow with LangChain, format retrieved docs (metadata, content), handle empty results, and save the final context as a .md file.
Save Retrieved Context for Debugging and Understanding6:12
Store retrieved text in a local debug_logs directory as a utf-8 markdown file, then return the retrieved text to serve as the agent's context for debugging and understanding.
Create Agent Node with AgentState7:13
Design and implement an agent node by defining agent state, reading messages, attaching a tool, binding tools, and integrating a detailed system prompt to drive tool calls and document retrieval.
Create Agentic PageRAG6:44
Create an agent page with a graph workflow that routes between an agent node and a tool node, handling tool calls and delivering answer via a retrieval system with ranking.
Agentic PageRAG in Action7:03
Demonstrate testing an agentic rag workflow, where a query is broken into multiple retrievals, documents are retrieved and ranked, and a final answer with revenue data for 2023 is presented.

Quick Walkthrough of Corrective RAG Research Paper5:06
Explore the corrective RAG approach, where retrieved documents pass through an evaluator to discard irrelevant results and refetch from internal or external knowledge bases, ensuring robust production-ready RAG.
Corrective RAG (CRAG) System Design5:29
Design and implement a corrective RAG system that retrieves from internal vector, grades relevance, and routes to answer generation or web search via DuckDuckGo, with query rewriting when needed.
CRAG Notebook Setup3:10
Set up the CRAG notebook, configure state graphs and messages, import tools and embeddings, and build a retrieval tool with a structured output format for grading.
Create Centralized retrieve_docs Langchain Tool6:54
Implement two centralized tools—the retrieve docs tool and the web search tool—by importing utilities, loading environment variables, and modularizing code for production-ready Langchain workflows.
Wide Retrieval, Narrow Selection - BM25 Reranking7:27
Apply wide retrieval and narrow selection using BM25 reranking, guiding document retrieval with filters and ranking keywords in the search docs pipeline to retrieve and rank documents.
Create AgentState and Test Retrieve Docs and Web Search Tools8:12
Build and test an agent state for retrieved documents and rewritten queries, then implement and use my tools for doc retrieval and web search to power multi-node workflows.
Create Retrieve Docs Node8:08
Create a retrieve node that fetches user question from state messages, calls the document retrieval tool with default k, logs retrieved documents to debug logs, and returns results for grading.
Create Documents Grading Node8:49
Create a document grading node that uses a router-based decision to route to answer or rewrite query, via a structured Pydantic data model and a boolean relevance field.
Create Re-Write User Query Node5:04
Learn to create a rewrite query node that transforms the user question into a concise, retrieval-targeted prompt for document search, integrating it with a web search retriever.
Create External Knowledge Base Retriever (Web Search) Node7:36
Build a web search node using DuckDuckGo to retrieve external knowledge by rewriting the original query, following the research paper and contrasting with internal vector DB results.
Create Answer Generator Node6:48
Create an answer generator node that uses retrieved docs and wave search to generate the final answer, then route to the answer node or through rewrite and web search.
Create Graph Execution Router Logic for Answer and Rewrite Node3:34
Learn to build a graph execution router that routes to the answer or rewrite node based on relevancy, including debug messages and proper node labeling.
Create CRAG Agent in LangGraph8:31
Create a crag agent in LangGraph by wiring retriever, grade, rewrite, web search, and answer nodes, linking edges, compiling, and testing performance.
CRAG Agent Performance Evaluation Part 15:50
conduct a practical crag agent performance evaluation by invoking the agent with a user query, retrieving documents through the retriever, calling tools, and validating the final answer with the grader.
CRAG Agent Performance Evaluation Part 27:27
In this CRAG agent performance evaluation, the speaker demonstrates how a retriever and vector store retriever use rewritten queries and web search, noting single‑company success and multi‑company challenges.

Quick Walk-through to Reflexion RAG Agent5:18
Explore a reflection-based rag agent architecture: draft and revised nodes, self-reflection, evaluator, and retrieval loop, inspired by the reflection language agent with verbal reinforcement learning.
Reflexion Agentic RAG System Design and Notebook Setup8:47
Set up the reflection notebook for the agentic rag workflow, initialize agent state, tools, and structured output, and configure the retrieve-revise loop with a max iteration limit.
Create Draft Node for Initial Answer Trajectory Part 15:32
Create a draft node for an agentic reflection system using a structured llm and Pydantic schema to generate answers and surface missing information for search queries.
Create Draft Node for Initial Answer Trajectory Part 28:23
Create a draft node that formats text into a structured JSON response with answer and reflection, capturing missing information and search queries for the AI message guiding the next node.
Retrieve Documents for Reflection Node Part 16:19
Create a retrieval node that fetches documents from the vector store for each generated search query and assemble retrieved_text into retrieved_docs for the critic agent to use as context.
Retrieve Documents for Reflection Node Part 27:47
Explains how the retrieve documents for reflection node gathers queries, fetches up to three documents per query using mmr and vector db, and formats a combined, searchable result.
Create Revise (Critique) Node with Self Reflection Part 18:27
Create a revise node that critiques and self-reflects on its generated answers, produces search queries, and follows an answer schema with a detailed system prompt to refine results.
Create Revise (Critique) Node with Self Reflection Part 29:10
Explore creating a self-reflection based revise node that critiques its prior answer, outputs JSON data, and tests completion and search queries in a retrieval-based prompt workflow.
Create Router Logic for Revise (Critique) Node5:34
Implement router logic for the revise node by evaluating evaluator feedback, the complete flag, max iterations, and routing via search queries to the retriever, revised node, and reflection rig agent.
Create Reflexion Agentic RAG4:46
Build a reflection agent graph with a graph canvas and shared state. Add draft, retrieve, and revise nodes; connect edges and conditional routing; compile the graph.
Performance Testing of Reflexion Agentic RAG Part 15:00
Conduct performance testing of the reflection agentic RAG system, showcasing prompt engineering, state management, and iterative querying to retrieve documents and generate a final answer.
Performance Testing of Reflexion Agentic RAG Part 25:46
Conduct performance testing of the reflection agentic RAG, detailing draft, retrieve, and revise cycles, including Amazon and Apple 2024 Q1 comparisons and 2023 iPhone and MacBook segment earnings.

Requirements

Basic Python knowledge is helpful, but all steps are explained clearly for beginners.

Description

**This course is not for absolute beginners in AI - you should first learn LangChain fundamentals, then LangGraph, and only after that take this course for the best learning experience.**

Private Agentic RAG with LangGraph and Ollama is an advanced, project-based course that teaches you how to build private, production-ready Retrieval-Augmented Generation (RAG) systems using LangGraph, LangChain, Ollama, ChromaDB, Docling, and Python.

This course is designed for developers who want strong control over their data, full privacy, and complete end-to-end workflows using local LLMs.

You will learn how to build modern RAG systems, implement advanced retrieval pipelines, add agent workflows, use LangGraph state machines, integrate SQL agents, and run everything on your own machine using Ollama. All projects run 100 percent locally, with no external API cost and no data leaving your system.

The entire course is practical. Every concept is explained with step-by-step notebooks, complete Python code, and real examples using SEC financial filings from Amazon, Google, Apple, and Microsoft.

What You Will Learn

Ollama and Local LLM Setup

Install and configure Ollama for private LLM deployment
Use models like Qwen3, GPT-OSS, Llama 3.2, and nomic-embed
Create custom LLMs with Modelfiles
Use Ollama CLI and REST API for text, chat, and embeddings

LangGraph Fundamentals

Build state machines using TypedDict
Create nodes, reducers, and conditional edges
Build multi-step workflows with START/END logic
Visualize execution with diagrams
Understand message accumulation and state merging

Complete RAG Systems (from scratch)

Ingest PDFs using Docling with OCR and table extraction
Build page-level chunks for accurate retrieval
Extract metadata from filenames and LLMs
Remove duplicates using SHA-256 hashing
Store documents in ChromaDB with metadata filters

Two-Stage Retrieval Pipeline

Build metadata filters from natural language
Generate financial keywords using structured LLM outputs
Use ChromaDB with MMR search
Implement BM25Plus re-ranking for better accuracy
Extract headings and sections for improved ranking

Agentic RAG using LangGraph

Build tool-calling agents using the ReAct pattern
Implement document retrieval tools using LangChain
Build agents that call tools multiple times
Add table-based answers with citations
Support multi-turn conversations with memory

Corrective RAG (CRAG)

Grade retrieved documents using a Pydantic schema
Detect irrelevant results and rewrite queries
Add web search fallback using DuckDuckGo
Prevent infinite loops with controlled retries
Generate final answers with correct citations

MySQL SQL Agent

Build a natural-language SQL agent with LangGraph
Retrieve schema, generate SQL, validate, run, and fix errors
Handle multi-table joins and complex metrics
Automatically correct broken SQL queries
Support explanations and safe database access

Financial Document Analysis Project

Work with real SEC filings: 10-K, 10-Q, 8-K
Build a complete RAG system that answers questions like:
- “What was Amazon’s revenue in 2023?”
- “Compare Google and Apple’s cash flow for 2024”
- “Show segment revenue with citations and tables”
Use ChromaDB + BM25 for accurate retrieval
Produce clean, formatted answers with tables and reasoning

Who This Course Is For

Developers and engineers who want to build advanced RAG systems
ML practitioners who want full privacy using local LLMs
AI engineers working on LangGraph, LangChain, or agent systems
Backend developers who want to build real GenAI applications
Anyone interested in private, production-grade LLM workflows

This is an advanced-level course. Good LangGraph or Langchain knowledge is required.

Why This Course Is Different

The entire course runs locally using Ollama
Zero API cost and complete data privacy
Covers modern RAG techniques: PageRAG, CRAG, Reflexion ideas
Real datasets from top tech companies
Covers LangGraph deeply with real production workflows
Includes SQL agents, financial RAG systems, and multi-step agents
Step-by-step, practical, and code-heavy

By the End of This Course You Will Be Able To

Build private, production-ready RAG systems
Deploy and fine-tune local LLMs with Ollama
Build graph-based agents using LangGraph v1
Create advanced retrieval pipelines using MMR and BM25Plus
Analyze financial documents with precise citations
Build SQL agents for natural language database queries
Handle query rewriting, grading, and web fallback
Build complete agentic RAG applications end-to-end

Who this course is for:

For developers and AI learners who want to build private Agentic RAG systems with LangGraph v1 and Ollama.
For anyone who wants practical skills in LangGraph v1, Ollama, and building real AI agents.
For beginners and professionals who want to create private, secure, and advanced RAG workflows.
For developers looking to master Agentic RAG, LangGraph v1 workflows, and local LLMs.

Agentic AI - Private Agentic RAG with LangGraph and Ollama

What you'll learn

Explore related topics

Course content

Introduction3 lectures • 17min

Ollama Setup16 lectures • 1hr 51min

2026 Local LLM Benchmarking: Speed, Coding & Real-World Performance8 lectures • 47min

LangGraph Getting Started6 lectures • 41min

MySQL Agent16 lectures • 1hr 21min

PageRAG - Data Ingestion14 lectures • 1hr 40min

PageRAG - Data Retrieval and Re-Ranking19 lectures • 2hr 9min

PageRAG - Agentic RAG10 lectures • 1hr 8min

Corrective RAG (CRAG)15 lectures • 1hr 38min

Reflexion RAG- Learning through Self-Reflection12 lectures • 1hr 21min

Requirements

Description

Who this course is for: