Complete Generative AI Course: RAG, AI Agents & Deployment

Name: Complete Generative AI Course: RAG, AI Agents & Deployment
Rating: 4.4 (1977 reviews)

Learn Generative AI from scratch – Build RAG, AI Agents & Chatbots, master MCP, and deploy real-world projects

Created bySiddhardhan S

Last updated 5/2026

English

German [Auto],English [Auto],

What you'll learn

Master the foundations of Generative AI, Large Language Models, and Transformer architecture.
Build real-world AI applications including chatbots, RAG systems, MCP servers, and multi-agent systems.
Deploy LLM-powered solutions on the cloud using Docker, Streamlit, Ollama, vLLM, and AWS EC2.
Gain the knowledge and hands-on skills required to step into a Generative AI Engineer role.

Course content

12 sections • 70 lectures • 23h 9m total length

Introduction1:32
Join this hands-on generative ai course for beginners to build chatbots and retrieval-powered apps, master MCP, context protocol, and deployment with docker, aws, ec2, and fast api, becoming industry-ready.
What you will learn2:42
Discover generative AI foundations, including LLMs, transformers, and prompt engineering, then build chatbots, retrieval-augmented generation pipelines, AI agents with MCP, and deploy capstone projects.
Environment Setup: Python, IDEs & Dev Tools1:20
Begin with Python essentials and notebooks, using Jupyter Notebook, Jupyter Lab, or Google Colab, then switch to PyCharm or VS Code, and later incorporate Docker and AWS EC2 for deployments.

AI vs ML vs DL vs GenAI4:21
Clarify the hierarchy from artificial intelligence to machine learning, deep learning, and generative AI, and show how generative AI creates new content with examples like Siri, ChatGPT, and DALL-E.
Large Language Models12:36
Explore large language models and transformer architecture, their applications from chatbots to document question answering with RAG, and the tradeoffs of proprietary versus open source models.
Transformer - architecture11:25
Demystifies transformer architecture, showing how self-attention powers modern LLMs and how encoders and decoders convert text to embeddings and generate output.
Section 2- Quiz (GenAI Foundation)

OpenAI LLMs (Proprietary)20:51
Learn to call OpenAI LLMs in Python, access proprietary and open-source models, and integrate with frameworks like long chain and llama index, whether in the cloud or locally.
Gemini LLMs (Proprietary)10:37
Connect Python with Google's Gemini LLMs to generate content, reason, and code, using the Gemini API, install the library, set the API key, and build multi-turn chat apps.
Groq LLMs (Open-Source)12:07
Explore grok, an open source llm platform, and learn to set up its Python interface, obtain an API key, and run your first request with Llama or Deep Seek.
Ollama (Open-Source & Local)15:08
learn to run open-source language models locally with Ollama, installing, loading models like gamma two billion, and interacting via Python and Jupyter Lab for a developer-friendly local LLM workflow.
Accessing LLMs via LangChain16:45
Learn to access multiple llms from OpenAI, Gemini, and Grok through a single LangChain interface. Build prompts, memory, data connections, and pipelines while easily switching providers for real-world AI apps.
Accessing LLMs via LlamaIndex8:46
Section 3 - Quiz (Accessing LLMs)
Practice Exercise - Section 30:26

Understanding RAG15:51
Learn how retrieval augmented generation overcomes llm limitations by linking to external knowledge sources, using document ingestion, embeddings, and a vector database to provide reliable, up-to-date, grounded answers.
Important Update: LangChain Changes for RetrievalQA1:06
Please read this before watching the next video on LangChain RAG video
Building a RAG system in Python with LangChain50:22
Building a RAG system in Python with Llamaindex37:35
Build a retrieval augmented generation workflow using Llama index to ingest PDFs, index with embeddings in Chroma DB, and retrieve context for language model based answers.
Build a PDF question-answering RAG app with Streamlit30:21
Section 6 - Quiz (RAG)
Practice Exercise - Section 61:02

Understanding AI Agents9:44
Explore AI agents as autonomous systems that perceive, reason, plan, and act to achieve user-defined goals using tools, memory, and APIs, with real-world applications and popular frameworks.
Build AI Agent with PydanticAI35:34
Build an AI agent with pedantic AI by defining inputs and outputs, registering custom tools, and executing a weather forecast tool to fetch current conditions.
Build AI Agent with Microsoft's AutoGen13:40
Build a weather agent with Microsoft's Autogen to show how multi-agent collaboration, conversation driven workflows, and easy tool integration power real-time weather queries via external APIs.
Multi-Agent system with CrewAI39:40
Master multi-agent systems with crew AI, coordinating a stock research agent and a trader agent to use live market data and decide buy, hold, or sell.
Section 7 - Quiz (AI Agents)
Practice Exercise - Section 71:03

Running LLMs Locally with Ollama & Docker25:51
Deploy open source LLMs locally and in production using Docker and Ollama. Move to cloud with EC2, RunPod, and VLM, and expose models via a FastAPI REST API.
Launching an AWS EC2 Instance21:39
Launch an AWS EC2 GPU instance (G5 xlarge) to run larger LLMs with Nvidia GPU and PyTorch. Manage costs with quotas, security groups, and SSH access, then terminate when finished.
Deploying Ollama LLMs on EC2 with Docker21:10
Deploy Ollama llms on AWS EC2 with Docker to run llama models in a GPU-enabled cloud. Expose port 11434 and connect from your local machine to access the model.
vLLM - High-Performance Serving on EC220:47
Serve Local LLMs (Ollama) via FastAPI26:46
Wrap locally running Ollama models with a FastAPI server to expose chat endpoints for front-end apps, using env configuration, pydantic schemas, and dockerizing with Docker Compose.
Deploying LLMs on RunPod (Cost-effective GPU)23:48
Deploy a FastAPI LM on RunPod to run in the cloud with on-demand GPU at a fraction of the cost, enabling access from anywhere.

Understanding MCP8:30
Discover the Model Context Protocol (MCP), a universal plug-and-play standard unifying AI models, tools, and data sources. Learn how MCP's host, client, and server enable modular, secure AI development.
Build an MCP Server32:51
Build an MCP server in Python to wrap an existing weather tool, learn MCP host and client roles, and connect it to Cloud Desktop using stdio transport.
Pydantic AI Agent with MCP tool19:00
Connect a Pydantic AI agent with an MCP tool to control an MVP weather server, enabling the agent to call the weather tool, process responses, and deliver conversational outputs.
CrewAI Agent with MCP tool20:16
Learn to plug an MCP tool into crew AI to query a weather MCP server with a single weather tool, returning a natural language forecast for a city.
Section 9 - Quiz (MCP)
Practice Exercise - Section 91:00

Section 10 - Capstone Projects - Real-World GenAI Applications1:58
Project 1 - ConvoPro – Private ChatGPT Clone2:49
Build a private, customizable ChatGPT clone with a streamlit UI, local llm hosting, and mongodb-backed chat history, deployable via docker or aws ec2 for secure, private use.
Project 1 - ConvoPro - DB & Environment Setup30:14
Set up the ConvoPro project by creating a GitHub repository and configuring a git workflow. Link PyCharm and install a virtual environment, requirements, env templates, MongoDB, llama models.
Project 1 - ConvoPro - Implementation44:28
Learn to implement a self-contained generative AI chat app with ConvoPro by building modular components—config, MongoDB-backed DB, LM factory, and Streamlit UI—loading llama models locally and generating titles for conversations.
Project 1 - ConvoPro - Deploy on EC250:56
Deploy ConvoPro on an EC2 instance to share a public chat interface. Configure GPU-accelerated models, MongoDB, and a Streamlit app via Docker and GitHub deployment.
Project 2 - StudyPal – RAG-Powered AI Study Assistant3:48
Develop a rag-powered ai study assistant with subject and chapter selection, simple-language explanations, video references, chat history, deployable on Streamlit with Grok LLM, Chroma DB, LangChain, and AWS EC2.
Project 2 - StudyPal - Environment Setup14:02
Set up study pal environment by creating a GitHub repo, cloning it, configuring a Python virtual environment, and installing libraries from requirements.txt for ingestion and chat with LangChain and Chroma.
Project 2 - StudyPal - Document Ingestion12:16
Project 2 - StudyPal - RAG Pipeline Implementation23:11
Build a rag pipeline with two vector databases: full book and per chapter, and connect a streamlit frontend with a conversational retrieval chain and a YouTube video search.
Project 2 - StudyPal - EC2 Deployment21:12
Project 3 - AstraRAG - Agentic RAG Chatbot - Production-Grade3:10
Create a production-grade agentic RAC chatbot as an end-to-end cloud app, using FastAPI, Streamlit, Grok, Chroma DB, Llama index, Docker, and AWS EC2, with explainable, document-grounded answers.
Project 3 - AstraRAG - Environment Setup14:10
Project 3 - AstraRAG - Document Ingestion Pipeline14:09
Project 3 - AstraRAG - Build RAG Agent23:29
Project 3 - AstraRAG - Build Backend & Frontend26:37
Build a AstraRAG grounded chatbot with backend and frontend, using fast API endpoints and Streamlit UI, enabling document ingestion, knowledge grounding, and explainability with sources, tools used, and rationale.
Project 3 - AstraRAG - Deploy locally with Docker35:21
Deploy the AstraRAG chatbot with Docker by building a Docker image from a Dockerfile, running containers for the API and Streamlit frontend, and deploying to EC2 via GitHub steps.
Project 3 - AstraRAG - EC2 Deployment with Docker16:50
Conclusion1:24
Conclude your journey in generative AI by mastering chatbots, AI agents, deployment, and MCP, linked to capstone projects, with ongoing appendix modules and future tool updates to stay prepared.

Requirements

This course requires only a basic understanding of Python and Machine Learning. No prior knowledge of Generative AI is needed — we start from the fundamentals and progress to advanced concepts. All you need is the curiosity to learn by building real-world projects.

Description

This complete Generative AI course takes you from beginner to advanced with hands-on projects, real-world applications, and career-ready skills. You’ll learn the foundations of Generative AI, explore Large Language Models (LLMs), master frameworks like LangChain, LlamaIndex, CrewAI, and PydanticAI, and deploy your own AI solutions on the cloud. The course is tailored to equip you with both the knowledge and practical experience required to step into a Generative AI Engineer role.

Each section includes quizzes & coding exercises to help you test your knowledge and reinforce your skills.

What you’ll learn in each section

1. Introduction – Get started with the course, understand what you will learn & set up Python environments (Colab, Jupyter, PyCharm).
2. Generative AI – Foundation – Understand AI vs ML vs DL vs GenAI, dive into Large Language Models, and learn the Transformer architecture.
3. Accessing LLMs in Python – Use OpenAI, Gemini, Groq, and Ollama LLMs, and connect them through LangChain and LlamaIndex.
4. Prompt Engineering – Explore prompt templates, zero-shot, and few-shot prompting to effectively interact with LLMs.
5. Building GenAI Chatbots – Build and deploy chatbots step by step using LangChain, LlamaIndex, Streamlit UI, and Streamlit Cloud.
6. Retrieval-Augmented Generation (RAG) – Understand RAG, build RAG pipelines with LangChain and LlamaIndex, and create a PDF Q&A bot.
7. AI Agents – Learn what AI agents are and build agents with PydanticAI, AutoGen, and CrewAI for multi-agent workflows.
8. LLM Deployment – Deploy open-source LLMs with Ollama, Docker, and vLLM, and set them up on AWS EC2 for real-world usage.
9. Model Context Protocol (MCP) – Understand MCP, build an MCP server, and integrate MCP tools with PydanticAI and CrewAI agents.
10. Capstone Projects – Apply everything learned to build real-world AI projects: Enterprise Chatbots, RAG Assistants, and Intelligent AI Agents with Full Cloud Deployment.

Who this course is for:

This course is for students, developers, and professionals with basic Python/ML knowledge who want to become Generative AI Engineers through hands-on projects.

Complete Generative AI Course: RAG, AI Agents & Deployment

What you'll learn

Explore related topics

Course content

Introduction3 lectures • 6min

Generative AI – Foundation3 lectures • 28min

Accessing LLMs in Python7 lectures • 1hr 25min

Prompt Engineering4 lectures • 37min

Building Generative AI Chatbots5 lectures • 1hr 39min

RAG - Retrieval-Augmented Generation6 lectures • 2hr 16min

AI Agents5 lectures • 1hr 40min

LLM Deployment6 lectures • 2hr 20min

MCP – Model Context Protocol5 lectures • 1hr 22min

Capstone Projects – Build and Deploy Real-World AI Solutions18 lectures • 5hr 40min

Requirements

Description

Who this course is for: