What you'll learn

Technology behind LLMs
How to build an LLM programmatically
How to fine tune an LLM
How to build web based applications using LLM
What is LangChain
How to use LangChain in applications
How to evaluate a large language model
How to assess bias in an LLM
How to fine tune GPT 3.5 easily
What is RAG
How does RAG differ from fine tuning
What are the recent advances in RAG
How to build a RAG application

Course content

8 sections • 29 lectures • 2h 50m total length

Introduction1:14
Explore the evolution of NLP to LM, examine core deep learning models behind LMS, and build privacy-aware web apps with Streamlit, including fine-tuning and launching.
Understanding the evolution4:36
Trace the evolution of natural language processing from rule-based systems to large language models, covering tokenization, lemmatization, and key milestones like BERT and GPT.
Core NLP Concepts behind LLMs12:22
Explore core natural language processing concepts, including sentence segmentation and tokenization, regular expressions, stemming with Porter, lemmatization with Spacy, stopwords, pos tagging, and dependency parsing.
Embedding | BPE | Sentencepiece8:56
Encode text into numerical representations for machine learning by applying tokenization and subword methods like BPE and sentencepiece, and contrast sparse bag-of-words with dense embeddings.

RNN10:12
Explore how RNNs address memory and persistence with loops and a hidden state that updates from the current input and previous state to enable time series forecasting and next-word prediction.
LSTM6:05
Learn how LSTMs use memory cells and four interacting layers to capture long term dependencies, address vanishing gradients, and improve next word predictions with input, forget, and output gates.
Build a language model using LSTM11:37
Build a simple lstm language model with keras and tensorflow 2.x, using tokenization, padding, and embeddings to predict next words, then discuss limitations of a tiny dataset and simple architecture.
Transformers8:34
Transformers enable parallel processing and excel at long-range dependencies via multi-head self-attention with query, key, and value. The lecture shows building a PDF question answering tool with Hugging Face models.

LLM Chatbot9:09
Build a chatgpt-like chatbot in a Streamlit web app using an OpenAI API key, focusing on data privacy while generating topics, sections, and content.
PDF Chatbot5:42
Upload a PDF and chat with contents using Streamlit, PDF plumber, and OpenAI API key; extract text, prompt the model, and display answers, with warnings for missing PDFs or questions.
Webpage Summarizer3:55
Build a web app that fetches content from a user-provided page, extracts paragraph text with beautifulsoup, and uses a Hugging Face transformers summarizer via streamlit to display bullet points.

Tuning an LLM (GPT2) model7:57
Tune a GPT-2 model on your dataset to improve privacy and knowledge repository. Use Colab, PyTorch, and Transformers to train a custom dataset with padding for roleplay prompts.
Fine Tuning GPT 3.5 Model | Simpler & Easier5:12
Fine-tune the GPT-3.5 model with a five-step workflow, prepare json-lines data with roles like system, user, and assistant, and deploy a ready-to-use fine-tuned model.

Understanding Langchain4:24
Explore LangChain's modular components, chains, and agents that streamline language model applications and enable interactions with OpenAI or Hugging Face, with practical setup and testing.
Prompt Engineering with LangChain3:28
Explore prompt engineering with long chain and Lang, using prompt templates to reduce bias and craft structured or unstructured responses, demonstrated by extracting NPS content from a PDF.
Chat with an Excel document5:49
Employ a csv agent powered by OpenAI and pandas to answer natural language questions about an insurance dataset, revealing record counts and BMI-based obesity insights.

Different Measures to evaluate an LLM6:24
Explore evaluation metrics for llms: perplexity, bleu, and rouge. Learn about n-grams, brevity penalties, and bias detection with the weak method and cosine similarity in embeddings.
Perplexity4:18
Predict the next word with a GPT-2 model and compute perplexity to evaluate language models. Use transformers and torch, with tokenizers, and run on GPU or CPU depending on availability.
ROUGE3:44
Evaluate summarization quality with Rouge metrics by comparing generated and reference summaries, using Rouge one, Rouge two, and Rouge L, with stemming and reporting precision, recall, and F1.
Assessing Bias in A Language Model6:51
Assess bias in Bert using a word embedding association test with male/female names and career/family words, computing embeddings, cosine similarity, and a wheat score; p value 0.0885 signals borderline significance.

What is RAG | RAG Vs Fine Tuning6:09
Compare retrieval augmented generation (rag) with fine tuning, showing how rag retrieves external knowledge to reduce hallucinations and improve accuracy in dynamic knowledge intensive tasks.
Recent Advancements in RAG6:11
Explore recent advancements in RAG, from naive to modular frameworks, with pre and post retrieval, iterative and recursive retrieval, and modular racks that integrate search, memory, and adapters.
RAG Application : PDF Chatbot3:43
Build a retrieval augmented generation rag chat app for pdfs by extracting text from a document, retrieving external information from Wikipedia, and generating responses with OpenAI.
RAG Application - FAQ3:56
Implement a retrieval augmented generation system using a sentence transformer to fetch the relevant FAQ answer from a similarity index and generate a detailed response with GPT two language model.

RAG Vs Agents3:55
Compare RAG systems, which merge retrieval with generative AI to deliver accurate outputs from external sources, with AI agents that autonomously plan actions and adapt to dynamic goals.
Financial Analyst Agent4:04
Build a financial analyst agent using the five framework with OpenAI's GPT-4 and Yahoo Finance tools. Access real-time stock prices, company information, analyst recommendations, and news within multimodal workflows.
Management Consultant Agent3:51
Develop an ai powered management consultant agent to curate engaging, high quality Forbes article content using GPT-4 and XR tools, emphasizing research, citation, and structured writing.
Data Analysis Agent3:59
Create an interactive ai powered data analysis assistant that loads local datasets, validates file paths, and answers questions with natural language using gpt-4 and duckdb.
Multi Agent Team4:01
Develop a multifunctional AI assistant with a web agent and a finance agent, powered by the five framework and GPT-4, for web searches and financial data insights.

Requirements

If you dont know Python programming or Machine learning, do not take this course.
Python programming
Basic understanding of machine learning
Basic understanding of NLP, although we are providing a refresher
An inquisitiveness to explore
An openness to experiment since evolution is happening at a fast pace

Description

Welcome to a succinct and dynamic course tailored for individuals keen on diving into the world of Large Language Models (LLMs) and building custom applications. Whether you're a corporate IT professional concerned with data privacy or an AI enthusiast eager to leverage the power of LLMs, this course is designed for you!

Here’s a sneak peek of what we’ll explore:

Journey from NLP to LLMs:
- Unveil the evolution from Natural Language Processing (NLP) to the advent of LLMs and understand the significance of this progression.
Technology Underpinning LLMs:
- Delve into the core technologies driving LLMs including Recurrent Neural Networks (RNNs), Long Short-Term Memory networks (LSTMs) and Transformers.
Fine Tuning
Building LLM-based Web Applications:
- Get hands-on with Streamlit and the OpenAI API to construct interactive web applications powered by LLMs.
- Step-by-step guidance on crafting web applications for chatbots, engaging with PDF files, and summarizing web pages.
Building RAG applications
Introduction to LangChain:
- Explore LangChain as a framework to further enhance your LLM application development experience.
Resources
- All the codes and datasets used in the program are provided as downloadable resources.
Refresher on Core NLP:
- A module for those wanting to brush up on the fundamentals of NLP to better grasp the advanced concepts presented.

This course is a living entity! As the field of LLMs evolves, so will the course material to ensure you stay updated with the latest advancements.

This course is perfectly suited for those aspiring to craft custom applications to harness the boundless potential of AI while being mindful of data privacy. Seize this opportunity to step into a world where language and technology intertwine seamlessly and embark on a learning journey that’s as engaging as it is enlightening!

Join us, as we unravel the mysteries of Large Language Models!

Who this course is for:

ML engineers and data scientists
Students
Tech enthusiasts

What you'll learn

Explore related topics

Course content

Introduction4 lectures • 27min

Deep Learning models4 lectures • 36min

Deploying LLM based web application using streamlit3 lectures • 19min

Tuning the LLMs with your dataset2 lectures • 13min

LangChain3 lectures • 14min

Evaluating a Large Language Model4 lectures • 21min

Retrieval Augmented Generation (RAG)4 lectures • 20min

AI Agents5 lectures • 20min

Requirements

Description

Who this course is for: