
Explore the evolution of NLP to LM, examine core deep learning models behind LMS, and build privacy-aware web apps with Streamlit, including fine-tuning and launching.
Trace the evolution of natural language processing from rule-based systems to large language models, covering tokenization, lemmatization, and key milestones like BERT and GPT.
Explore core natural language processing concepts, including sentence segmentation and tokenization, regular expressions, stemming with Porter, lemmatization with Spacy, stopwords, pos tagging, and dependency parsing.
Encode text into numerical representations for machine learning by applying tokenization and subword methods like BPE and sentencepiece, and contrast sparse bag-of-words with dense embeddings.
Explore how RNNs address memory and persistence with loops and a hidden state that updates from the current input and previous state to enable time series forecasting and next-word prediction.
Learn how LSTMs use memory cells and four interacting layers to capture long term dependencies, address vanishing gradients, and improve next word predictions with input, forget, and output gates.
Build a simple lstm language model with keras and tensorflow 2.x, using tokenization, padding, and embeddings to predict next words, then discuss limitations of a tiny dataset and simple architecture.
Transformers enable parallel processing and excel at long-range dependencies via multi-head self-attention with query, key, and value. The lecture shows building a PDF question answering tool with Hugging Face models.
Build a chatgpt-like chatbot in a Streamlit web app using an OpenAI API key, focusing on data privacy while generating topics, sections, and content.
Upload a PDF and chat with contents using Streamlit, PDF plumber, and OpenAI API key; extract text, prompt the model, and display answers, with warnings for missing PDFs or questions.
Build a web app that fetches content from a user-provided page, extracts paragraph text with beautifulsoup, and uses a Hugging Face transformers summarizer via streamlit to display bullet points.
Tune a GPT-2 model on your dataset to improve privacy and knowledge repository. Use Colab, PyTorch, and Transformers to train a custom dataset with padding for roleplay prompts.
Fine-tune the GPT-3.5 model with a five-step workflow, prepare json-lines data with roles like system, user, and assistant, and deploy a ready-to-use fine-tuned model.
Explore LangChain's modular components, chains, and agents that streamline language model applications and enable interactions with OpenAI or Hugging Face, with practical setup and testing.
Explore prompt engineering with long chain and Lang, using prompt templates to reduce bias and craft structured or unstructured responses, demonstrated by extracting NPS content from a PDF.
Employ a csv agent powered by OpenAI and pandas to answer natural language questions about an insurance dataset, revealing record counts and BMI-based obesity insights.
Explore evaluation metrics for llms: perplexity, bleu, and rouge. Learn about n-grams, brevity penalties, and bias detection with the weak method and cosine similarity in embeddings.
Predict the next word with a GPT-2 model and compute perplexity to evaluate language models. Use transformers and torch, with tokenizers, and run on GPU or CPU depending on availability.
Evaluate summarization quality with Rouge metrics by comparing generated and reference summaries, using Rouge one, Rouge two, and Rouge L, with stemming and reporting precision, recall, and F1.
Assess bias in Bert using a word embedding association test with male/female names and career/family words, computing embeddings, cosine similarity, and a wheat score; p value 0.0885 signals borderline significance.
Compare retrieval augmented generation (rag) with fine tuning, showing how rag retrieves external knowledge to reduce hallucinations and improve accuracy in dynamic knowledge intensive tasks.
Explore recent advancements in RAG, from naive to modular frameworks, with pre and post retrieval, iterative and recursive retrieval, and modular racks that integrate search, memory, and adapters.
Build a retrieval augmented generation rag chat app for pdfs by extracting text from a document, retrieving external information from Wikipedia, and generating responses with OpenAI.
Implement a retrieval augmented generation system using a sentence transformer to fetch the relevant FAQ answer from a similarity index and generate a detailed response with GPT two language model.
Compare RAG systems, which merge retrieval with generative AI to deliver accurate outputs from external sources, with AI agents that autonomously plan actions and adapt to dynamic goals.
Build a financial analyst agent using the five framework with OpenAI's GPT-4 and Yahoo Finance tools. Access real-time stock prices, company information, analyst recommendations, and news within multimodal workflows.
Develop an ai powered management consultant agent to curate engaging, high quality Forbes article content using GPT-4 and XR tools, emphasizing research, citation, and structured writing.
Create an interactive ai powered data analysis assistant that loads local datasets, validates file paths, and answers questions with natural language using gpt-4 and duckdb.
Develop a multifunctional AI assistant with a web agent and a finance agent, powered by the five framework and GPT-4, for web searches and financial data insights.
Welcome to a succinct and dynamic course tailored for individuals keen on diving into the world of Large Language Models (LLMs) and building custom applications. Whether you're a corporate IT professional concerned with data privacy or an AI enthusiast eager to leverage the power of LLMs, this course is designed for you!
Here’s a sneak peek of what we’ll explore:
Journey from NLP to LLMs:
Unveil the evolution from Natural Language Processing (NLP) to the advent of LLMs and understand the significance of this progression.
Technology Underpinning LLMs:
Delve into the core technologies driving LLMs including Recurrent Neural Networks (RNNs), Long Short-Term Memory networks (LSTMs) and Transformers.
Fine Tuning
Building LLM-based Web Applications:
Get hands-on with Streamlit and the OpenAI API to construct interactive web applications powered by LLMs.
Step-by-step guidance on crafting web applications for chatbots, engaging with PDF files, and summarizing web pages.
Building RAG applications
Introduction to LangChain:
Explore LangChain as a framework to further enhance your LLM application development experience.
Resources
All the codes and datasets used in the program are provided as downloadable resources.
Refresher on Core NLP:
A module for those wanting to brush up on the fundamentals of NLP to better grasp the advanced concepts presented.
This course is a living entity! As the field of LLMs evolves, so will the course material to ensure you stay updated with the latest advancements.
This course is perfectly suited for those aspiring to craft custom applications to harness the boundless potential of AI while being mindful of data privacy. Seize this opportunity to step into a world where language and technology intertwine seamlessly and embark on a learning journey that’s as engaging as it is enlightening!
Join us, as we unravel the mysteries of Large Language Models!