Mastering Generative AI: Foundations to Advanced Application

Name: Mastering Generative AI: Foundations to Advanced Application
Rating: 4.1 (8 reviews)

Unlock the Power of Generative AI: Build, Deploy, and Innovate with Cutting-Edge AI Technologies.

Created byAlpharithm Technologies Private Limited

Last updated 4/2025

English

What you'll learn

Master Generative AI Basics: Build proficiency with frameworks like LangChain and Hugging Face, developing a strong foundation in core AI principles and tools.
Create Multimodal Apps: Learn to build AI that processes text, audio, and images, including chatbots and document-based interactions using PDFs, Excel, and SQL.
Apply Advanced Image/Text Manipulation: Use AI for image upscaling, recoloring, generative filling, and RAG to enhance and transform multimedia content.
Develop Multi-Agent Systems: Design AI systems with multi-agent collaboration, setting LLM guardrails, API management, and optimized agent configurations.

Course content

14 sections • 40 lectures • 18h 2m total length

Introduction to Generative AI.46:33
This session introduces generative AI, exploring vector embedding, LLMs like GPT-4, and adaptable foundation models. It covers the RAG framework to minimize AI "hallucinations" and emphasizes prompt engineering for effective AI use. A hands-on Grok LLM demo lets participants create an interactive AI application.
How to build AI application with frameworks. Introduction to Llama Index.33:25
Participants deepen their understanding of Generative AI frameworks, focusing on the Llama Index in Google Colab. Through hands-on setup, they explore data embedding, LLM integration, and model selection, gaining practical skills for deploying generative AI applications.
How to use Generative AI to chat with PDF documents.31:47
Learn to build generative AI applications for "chatting with documents," enabling interactive and efficient data retrieval. Through a hands-on setup in Google Colab, they explore document processing with tools like Llama Index and PDF Plumber for tailored, data-specific AI responses.
Which of the following is a limitation of Generative AI?

How to evaluate Generative AI output using confusion matrix.29:16
The Generative AI program, participants learn to evaluate LLM accuracy using metrics like precision, recall, and F1 scores. A hands-on segment guides them through setting up their environment, using a PDF to create question-answer pairs, and assessing model responses to enhance accuracy.
Introduction to Tokenization in Generative AI.30:27
This session covers tokenization in NLP and generative AI, exploring word, sub-word, and character tokenization to enhance contextual understanding. Participants practice token-level evaluation, using metrics like precision, recall, F1, and ROUGE scores for performance insights in NLP tasks.
How to use Generative AI to summarize documents.32:19
The Generative AI program, participants learn AI text summarization techniques—extractive, abstractive, and hybrid—highlighting time-saving and communication benefits. They explore data logging in structured formats like JSON to track AI interactions, enhancing observability and improvement. A hands-on exercise involves generating and analyzing AI summaries for product reviews, providing practical skills in summarization and data analysis.
Precision, Recall, True Positives, and F1 Score in AI Evaluation.

Introduction to Multimodal AI. How to build Audio based GenerativeAI application30:51
The Generative AI program, participants dive into "Multimodal AI," integrating text, audio, video, and images to enhance customer interactions. Through demos of speech-to-text and text-to-speech, they explore challenges like accent variability and noise. A hands-on segment with the Groq API enables participants to create a voice-based AI assistant, applying multimodal AI in real-world scenarios like customer care and accessibility.
How to use Function Calling in Generative AI applications.33:57
"Function Calling in Generative AI" delves into enabling AI to execute tasks like booking tickets or managing finances, moving beyond text generation. Participants explore integration techniques, real-world applications, and the benefits of enhanced interactivity. A hands-on demo with the Grok library demonstrates creating and connecting functions for financial tasks, equipping participants to build dynamic, task-driven AI applications."
How to use Streamlit for building Generative AI applications.36:03
This session introduces Streamlit, a Python library for rapid web app development, transitioning from theory to practical AI application-building. Participants recap key Generative AI topics, then explore Streamlit’s features like interactive widgets and real-time updates, ideal for AI projects. The session includes building a simple app, setting up a GitHub repository, and deploying it on Streamlit Cloud, providing hands-on experience and tools for future AI projects.
Key Features of Streamlit and Speech-to-Text Tools.

How to set Guardrails for LLMs while building Generative AI applications.19:15
The Generative AI Program, participants learn to transition from Google Colab to Streamlit, focusing on code structure, token limits, and system message guardrails. They enhance the UI with widgets and CSS, create a GitHub repository for a Sachin Tendulkar chatbot, and deploy the app to Streamlit. The session concludes with testing the app's functionality and preparing for the next steps in deployment and chatbot limitations.
How to build conversative Generative AI applications.34:33
In this session, participants enhance the summarizer app by adding a PDF uploader, integrating multiple LLM models, and improving the UI with a dropdown for summary types. The session covers backend engineering for smooth model handling and concludes with deploying the app on Streamlit via GitHub.
How to create multiple LLM functionalities inside same Generative AI application28:02
The session focused on building a text summarizer app that condenses large texts into concise summaries. Topics included coding in Google Colab, Streamlit app building, securely storing API keys, configuring LLM outputs, dynamic prompt assignment, AI roles, guardrails, session state, and CI/CD. The app was tested in GitHub Codespace, deployed on Streamlit Cloud, and demonstrated secure API key storage and dynamic functionality.
Guardrails and Multi-Functionality in Conversational AI Applications.

How to configure multiple LLM models inside same Generative AI application.23:09
In this session, participants enhance the summarizer app by adding a PDF uploader, integrating multiple LLM models, and improving the UI with a dropdown for summary types. The session covers backend engineering for smooth model handling and concludes with deploying the app on Streamlit via GitHub.
How to use Huggingface Platform in building Generative AI application.29:15
In this session, participants explore Hugging Face’s open-source tools, including the Model Hub, Datasets, and Spaces, to drive AI innovation. They learn about the Inference API for applications like chatbots and image generation, focusing on scalability and rate limiting. The session concludes with a hands-on demo, where participants set up accounts and deploy a Streamlit app for text-to-image generation.
How to use Image Classification techniques in Generative AI application.26:59
In this session, participants explore image classification in generative AI, focusing on CNNs, data preprocessing, and model optimization techniques. They set up a GitHub project and develop a Streamlit app for gender classification and AI-driven image detection. The session concludes with deploying the app on Streamlit and discussing detection accuracy and threshold settings.
Integrating LLMs, Hugging Face, and Image Classification in Generative AI.

How to use FastHtml for building Generative AI applications.29:39
This session introduces "Fast HTML," a Python web framework known for speed, simplicity, and scalability. Participants explore its advantages over Streamlit, review SQLite integration, and learn deployment options. A hands-on demo covers creating applications with features like image generation and parallel prompting, concluding with running a Fast HTML app.
How to manage api calls in Generative AI applications using limitation, deletion25:27
In this session, participants enhance their Fast HTML app with features like image deletion for storage management and an image counter to track usage. They implement a free image generation limit to encourage paid upgrades and prevent overuse. Hands-on demos cover feature implementation, setting limits, and managing upgrade prompts.
How to use Retrieval Augumented Generation for grounding LLMs.31:16
This session introduces Retrieval Augmented Generation (RAG) to enhance language model accuracy by grounding responses with retrieved knowledge. Participants learn about vector stores, embeddings, and their role in storing and retrieving unstructured data. The session covers tools like ChromaDB and Pinecone, highlighting their benefits and use cases.
FastAPI, API Management, and RAG for Generative AI Applications.

How to use Vector Embeddings to build effective Generative AI application.36:35
This session dives into Retrieval Augmented Generation (RAG), focusing on reducing hallucination and grounding LLM responses. Participants learn to preprocess data, create embeddings, and store them in vector databases like Astra DB. A hands-on demo covers the complete RAG workflow, from document chunking to generating accurate, query-based responses.
How to improve RAG applications using techniques like Reranking, Query Expansion35:06
This session delves into Retrieval Augmented Generation (RAG), covering data retrieval accuracy, multi-chunk fusion, and evaluation metrics like BLEU and ROUGE. Participants explore re-ranking, query expansion, and human feedback to refine RAG systems with hands-on demos and best practices.
How to use vector databases in building Generative AI applications.32:15
In this session, participants build a local vector database from a PDF using Faiss, focusing on embedding storage and efficient similarity searches. They explore tools like Pickle files for secure data handling and caching for optimization, with hands-on demos showcasing dimensional embeddings and query retrieval accuracy.
Optimizing Generative AI with Vector Embeddings, RAG, and Databases.

How to use Langchain framework to build effective Generative AI application.23:37
Intro to Computer Vision & Image-Based Generative AI.22:13
Participants explore object detection in computer vision using YOLO, focusing on bounding boxes, confidence scores, and real-world applications. Hands-on exercises cover YOLO detection and LLM integration for summarizing detected objects and their details.
How to use Generative AI to upscale image quality of low resolution images.16:58
Participants explore image upscaling, enhancing resolution using AI techniques like GANs and VAEs compared to traditional methods. Hands-on exercises with Cloudinary demonstrate applications in photography, gaming, and medical imaging for sharper, realistic visuals.
Building Image-Based Generative AI with LangChain and Vision Techniques.

How to use Generative AI to extend size of images to bigger sizes.23:28
In this session, participants explore generative AI fill to expand, repair, or modify images by generating seamless additional content. They learn techniques like texture synthesis, image outpainting, and background completion, with hands-on practice using Cloudinary to extend images for real-world applications in design and media.
How to use Generative AI to replace an object from a image.18:54
In this session, participants explore Generative Replace, an AI technique for seamlessly substituting objects in images. They learn to use tools like Cloudinary to automate replacements, leveraging GANs and object detection for realistic results, with applications in e-commerce, marketing, and content creation.
How to use Image-to-Text in building effective Generative AI Applications.23:18
In this session, participants explore Image-to-Text AI, which interprets images to generate descriptive text, enhancing accessibility and automating tasks like cataloging and storytelling. They learn about models like LAVA and LAMA 3.1, combining vision and language for detailed descriptions and narratives.
Generative AI: Image Expansion, Object Replacement, and Image-to-Text.

How to use Generative AI to remove unwanted portions of images.24:26
In this session, participants explore AI-driven image editing, enabling users to edit images through text commands like 'remove the chair,' without traditional skills. Using tools like Cloudinary’s Generative Remove, they learn object removal, non-destructive editing, and accessibility-focused techniques powered by advanced AI object detection.
How to use Generative AI to change the color of object inside a image.20:07
In this session, participants explore AI-driven image recoloring, enabling quick, realistic color changes to specific items in images using text commands. Leveraging tools like Cloudinary’s API, they learn to efficiently showcase product variations with advanced object detection and natural language processing.
How to use Generative AI to repair portions of image that has been torn.21:50
In this session, participants explore AI-powered image restoration, a fast and efficient technique to repair old or damaged photos using tools like Cloudinary. They learn about key technologies such as GANs, deep learning models, and image in-painting for seamless restoration, including a hands-on demonstration.
Generative AI: Remove, Recolor, and Repair Images with Natural Language.

Requirements

Basic Python Knowledge: Understanding Python fundamentals is essential for working with AI scripts.
Familiarity with AI/ML Concepts: A basic understanding of large language models, data preprocessing will help in comprehending Generative AI techniques.

Description

The Generative AI Mastery: This comprehensive course is crafted for AI enthusiasts, data scientists, and professionals looking to deepen their expertise in Generative AI. Covering a wide range of AI capabilities, it guides learners through building, refining, and evaluating AI systems capable of generating, analyzing, and modifying text, images, and audio. Using cutting-edge frameworks such as LangChain, Llama Index, and Hugging Face, students will gain hands-on experience with core Generative AI techniques, including Retrieval-Augmented Generation (RAG), image classification, vector embeddings, and model fine-tuning.

Throughout the course, you’ll explore how to set practical guardrails, ensure model alignment, and manage multiple large language models (LLMs) within a single application. Each module combines theory with hands-on projects, helping students put their skills to work on real-world tasks. Projects include document analysis, interacting with and analyzing SQL databases via natural language, and voice cloning. These projects, along with advanced multimodal exercises, will solidify your understanding of AI's practical applications. By course completion, you’ll be equipped with the skills to design, deploy, and innovate in the field of Generative AI, allowing you to harness its full potential across diverse industries and applications, and empowering you to develop impactful AI-driven solutions in today's fast-evolving tech landscape.

Who this course is for:

Aspiring AI Developers
Data Scientists and Machine Learning Engineers
AI Enthusiasts and Hobbyists
Product Managers or Entrepreneurs
AI Researchers or Academics
Professionals Seeking Career Transition into AI

Mastering Generative AI: Foundations to Advanced Application

What you'll learn

Explore related topics

Course content

Introduction to Generative AI: Building Applications with Llama Index3 lectures • 1hr 52min

Evaluating Gen AI: Tokenization, Confusion Matrix & Document Summarization3 lectures • 1hr 32min

Multimodal AI: Building Audio Apps, Function Calling & Streamlit Applications3 lectures • 1hr 41min

Guardrails and Multi-Functionality in Conversational AI Applications3 lectures • 1hr 22min

Integrating LLMs, Hugging Face, and Image Classification in Generative AI3 lectures • 1hr 19min

FastAPI, API Management, and RAG for Generative AI Applications3 lectures • 1hr 26min

Optimizing Generative AI with Vector Embeddings, RAG, and Databases3 lectures • 1hr 44min

Building Image-Based Generative AI with LangChain and Vision Techniques3 lectures • 1hr 3min

Generative AI: Image Expansion, Object Replacement, and Image-to-Text3 lectures • 1hr 6min

Generative AI: Remove, Recolor, and Repair Images with Natural Language3 lectures • 1hr 6min

Requirements

Description

Who this course is for: