Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

LLM Fine-Tuning with Hugging Face: LoRA, QLoRA, PEFT

Name: LLM Fine-Tuning with Hugging Face: LoRA, QLoRA, PEFT
Rating: 4.3 (774 reviews)

Fine-tune BERT, T5, ViT, LLaMA-style models and Qwen3-TTS using Hugging Face Transformers, custom datasets, LoRA, QLoRA

Bestseller

Created byKGP Talkie | Laxmi Kant

Last updated 6/2026

English

What you'll learn

Understand Hugging Face Transformers and how Transformer models power modern NLP and Generative AI applications.
Use Hugging Face pipelines, checkpoints, datasets, tokenizers, Auto Classes, and Spaces for practical AI projects.
Learn Transformer architecture including attention, QKV vectors, encoder-decoder blocks, and positional encoding.
Fine-tune transformers for text classification, question answering, natural language inference, text summarization, and machine translation.
Understand BERT architecture, masked language modeling, next sentence prediction, and BERT fine-tuning.
Fine-tune BERT for multi-class sentiment classification and build a Streamlit app for real-time prediction.
Fine-tune DistilBERT, MobileBERT, and TinyBERT for fake news detection and performance benchmarking.
Fine-tune Transformer models for NER, text summarization, image classification, and custom NLP tasks.
Learn PEFT, LoRA, QLoRA, 4-bit quantization, and fine-tune LLMs on custom datasets.
Fine-tune LLaMA-style chat models and Qwen3-TTS audio models for voice cloning and custom speech generation.

Course content

15 sections • 183 lectures • 20h 25m total length

Course Introduction2:28
Course Introduction!!!
Course Roadmap3:44
About Me2:08
Keys to Success2:11
Course Resources and Udemy Player Settings2:39
Udemy Rating and Course Certificate1:43
Download Course Code File0:01
Get code files here!!!
Setup Free GPU - Install Requirements.txt6:51
Setup Project for Fine Tuning on Local GPU6:08

Sneak Peek3:03
Introduction to Hugging Face4:08
Hugging Face Models Explained6:28
Hugging Face Models Deep Dive4:38
Hugging Face Model Card Explained6:57
Hugging Face Datasets Explained6:40
Hugging Face Spaces (Free Demo) Explained6:18
Hugging Face Buckets2:45
Hugging Face Transformers Pipeline6:16
Hugging Face AutoClasses2:51
Introduction to Google Colab - Free GPU Setup9:50
Introduction to Major Transformers and LLM Releases5:02
How Text Classification Hugging Face Pipeline Works4:55
Hugging Face Text Classification Pipeline Part 17:24
Hugging Face Text Classification Pipeline Part 24:53
Hugging Face NER Tagging (Token Classification) Part 18:44
Hugging Face NER Tagging (Token Classification) Part 28:01
Hugging Face Question Answering Pipeline7:27
Hugging Face Text Summarization9:56
Hugging Face Translation Pipeline8:27
Hugging Face Text Generation Pipeline11:00
Hugging Face Image Classification Pipeline10:01
Hugging Face Image Segmentation Pipeline8:10
Hugging Face Text to Speech Pipeline5:44
Hugging Face Text to Music Generation5:45

Introduction to Transformer Architecture and LLMs1:46
Build the architectural foundation needed to understand attention, encoders, decoders, and modern LLM fine-tuning.
Seq2Seq Models Explained: Part 16:58
Seq2Seq Models Explained: Part 28:58
Seq2Seq Limitations and the Need for Attention3:50
Applying Attention in Seq2Seq Networks8:43
Query, Key, and Value Vectors Explained12:27
Learn how Q, K, and V vectors power attention mechanisms inside Transformer and LLM architectures.
Scaled Dot-Product Attention Explained9:30
Transformer Encoder and Decoder Stacks11:58
How the Transformer Encoder Works6:19
Positional Encoding in Transformers9:09
Self-Attention, Masked Attention, and Cross-Attention7:47
Understand the attention variants used in encoder, decoder, and encoder-decoder Transformer models.
Multi-Head Attention Explained9:35
How the Transformer Decoder Works6:47
Real-World Transformer Applications4:34
Connect Transformer architecture to practical NLP, Generative AI, and LLM fine-tuning use cases.

Introduction to BERT Architecture1:40
Learn why BERT is one of the most important Transformer models for NLP fine-tuning.
BERT Explained: Bidirectional Transformer Encoder7:57
Why Context Matters in BERT10:04
BERT Paper Terminology: Part 18:42
BERT Paper Terminology: Part 27:43
BERT Architecture Deep Dive8:46
How Input Text is Processed in BERT8:26
MLM and NSP in BERT Pretraining13:24
Understand masked language modeling and next sentence prediction, the core ideas behind BERT pretraining.
BERT Fine-Tuning and Evaluation Workflow12:04
Learn how pretrained BERT is adapted for downstream NLP tasks through supervised fine-tuning and evaluation.

Project Intro: Fine-Tune BERT for Sentiment Classification2:09
Start the hands-on project where BERT is fine-tuned for multi-class Twitter sentiment classification.
BERT Classifier Architecture for Text Classification5:14
Learn how a classification head is added to BERT for supervised text classification.
Load the Twitter Sentiment Dataset4:51
Load the Twitter multi-class sentiment dataset and prepare it for analysis and training.
Analyze the Twitter Sentiment Dataset4:38
Tokenization Workflow for BERT Fine-Tuning5:57
Understand how raw tweet text is converted into token IDs that BERT can process.
Train-Test Split and Dataset Preparation6:17
Tokenize the Sentiment Dataset7:38
BERT Model Configuration Deep Dive6:59
Load BERT with a Classification Head7:15
Load a pretrained BERT model and attach a classification head for custom sentiment labels.
Configure Training Arguments5:35
Build Evaluation Metrics6:16
Train BERT with Hugging Face Trainer4:45
Fine-tune BERT using Hugging Face Trainer and the prepared sentiment dataset.
Evaluate the Fine-Tuned BERT Model6:45
Evaluate the trained model on test data and check classification performance.
Plot the Confusion Matrix5:04
Save BERT and Predict on Custom Text8:34
Save the fine-tuned BERT model and run predictions on new custom text examples.
Build a Streamlit Sentiment Prediction App8:36
Build a simple Streamlit app that uses the fine-tuned BERT model for real-time sentiment prediction.

Introduction to Knowledge Distillation for BERT1:16
Learn why knowledge distillation is used to create smaller, faster Transformer models for production use.
Knowledge Distillation Explained5:32
DistilBERT Loss Functions6:16
DistilBERT Paper Walkthrough: Part 16:33
DistilBERT Paper Walkthrough: Part 25:59
MobileBERT Introduction7:12
MobileBERT Parameter Settings6:31
MobileBERT Knowledge Distillation6:50
MobileBERT Paper Walkthrough: Part 16:54
MobileBERT Paper Walkthrough: Part 29:29
MobileBERT Paper Walkthrough: Part 38:39
TinyBERT Introduction5:05
TinyBERT Paper Walkthrough7:31

Project Intro: Fake News Detection with Distilled BERT Models1:49
Start a classification project using DistilBERT, MobileBERT, and TinyBERT for fake news detection.
Load the Fake News Dataset5:27
Load and clean the fake news dataset used for binary text classification.
Analyze the Fake News Dataset7:38
Prepare Train, Test, and Validation Splits5:50
Tokenize Fake News Text Data10:50
Tokenize article titles or text so distilled BERT models can process the dataset.
Build DistilBERT, MobileBERT, and TinyBERT Models8:42
Configure lightweight Transformer models for fake news classification.
Train Models for Fake News Detection6:13
Fine-tune distilled BERT models using Hugging Face training workflows.
Evaluate Fake News Detection Models5:32
Evaluate model accuracy and compare performance on validation or test data.
Benchmark Distilled Models Against BERT: Part 110:43
Compare DistilBERT, MobileBERT, TinyBERT, and BERT-Base for performance and efficiency.
Benchmark Distilled Models Against BERT: Part 212:51
Continue the benchmark comparison and interpret tradeoffs between speed, size, and accuracy.

Project Intro: Restaurant Search NER with DistilBERT1:52
Start a named entity recognition project using restaurant search data and DistilBERT.
Named Entity Recognition Explained5:07
Learn what NER is and how it identifies entities such as locations, cuisines, and restaurant-related terms.
BIO and IOB Tagging for NER6:31
Understand the tagging format used to label tokens for named entity recognition.
Load the Restaurant NER Dataset: Part 16:30
Load the Restaurant NER Dataset: Part 25:30
Prepare a Hugging Face NER Dataset: Part 16:07
Prepare a Hugging Face NER Dataset: Part 29:14
Build the NER Tokenization Pipeline5:26
Build the DistilBERT tokenizer workflow for token classification.
Align NER Labels with Tokenized Inputs13:06
Learn how to align word-level NER labels with subword tokens, a key step in token classification.
Create Sequence Evaluation Metrics10:23
Fine-Tune DistilBERT for NER5:45
Fine-tune DistilBERT for restaurant search named entity recognition.
Save the NER Model and Run Predictions4:09
Save the fine-tuned NER model and test it on custom restaurant search text.

Project Intro: Fine-Tune T5 for Summarization1:33
Start a sequence-to-sequence project where T5 is fine-tuned for custom dialogue summarization.
Text Summarization in NLP Explained4:10
Learn what summarization is and how it is used in NLP and Generative AI workflows.
Benchmark T5 and BART for Summarization11:18
T5 Transformer and SAMSum Dataset Overview6:44
Load the SAMSum dataset and understand why it is useful for dialogue summarization.
Analyze the SAMSum Dataset6:29
Tokenization for Text Generation9:25
Prepare dialogue and summary pairs for T5 using sequence-to-sequence tokenization.
Fine-Tune T5 for Custom Summarization6:05
Train T5 on the SAMSum dataset using Hugging Face Trainer and seq2seq data collation.
Generate Custom Summaries with Fine-Tuned T55:35
Run inference with the fine-tuned T5 model to generate summaries for custom dialogue text.

Project Intro: Fine-Tune ViT for Food Image Classification1:52
Start a computer vision project using Vision Transformer for Indian food image classification.
Vision Transformer Paper Walkthrough: Part 17:04
Vision Transformer Paper Walkthrough: Part 212:20
Load the Indian Food Image Dataset6:44
Load the Hugging Face image dataset and inspect labels for food classification.
Image Preprocessing and Transforms for ViT12:34
Prepare images using resizing, normalization, tensors, and image processor settings for ViT.
Build Image Classification Metrics2:26
What Is Inside a ViT Model?5:47
Fine-Tune Vision Transformer for Classification8:46
Fine-tune a pretrained Vision Transformer model for Indian food image classification.
Save, Load, and Test the Fine-Tuned ViT Model5:46
Save the trained ViT model and run image classification inference on custom images.

Requirements

Basic Python programming knowledge is required to follow the coding projects and fine-tuning notebooks.
Basic understanding of machine learning or deep learning will be helpful but not strictly required.
Basic NLP knowledge is useful, but important concepts are explained step by step in the course.
A computer with internet access is required. Google Colab or a GPU machine is recommended for training.
No prior Hugging Face experience is needed. You will learn Transformers and fine-tuning from the basics.

Description

Welcome to Fine Tuning LLM with Hugging Face Transformers for NLP, a practical and project-based course designed to help you understand and fine-tune modern Transformer models for real-world AI applications.

This course starts from the basics of Hugging Face Transformers and gradually takes you into advanced fine-tuning workflows. You will learn how pipelines work, how checkpoints and models are used, how Hugging Face datasets are loaded, and how Auto Classes simplify model loading, tokenization, training, and inference.

After building a strong foundation, you will go deeper into Transformer architecture. You will understand Seq2Seq models, attention mechanism, Q, K, V vectors, scaled dot-product attention, encoder-decoder stacks, positional encoding, self-attention, masked self-attention, cross-attention, and multi-head attention.

The course also covers BERT architecture in detail. You will learn how BERT processes input, how masked language modeling and next sentence prediction work, and how BERT is fine-tuned for downstream NLP tasks.

Then you will move into hands-on projects where you will fine-tune Transformer models for practical use cases such as sentiment classification, fake news detection, named entity recognition, text summarization, and image classification using Vision Transformers.

You will also learn knowledge distillation concepts using DistilBERT, MobileBERT, and TinyBERT. This will help you understand how smaller and faster Transformer models are created for real-world production use cases.

In the advanced sections, you will learn how to fine-tune LLMs on custom datasets using PEFT, LoRA, QLoRA, and 4-bit quantization. You will fine-tune models like Phi and LLaMA-style models for custom text generation and instruction/chat-based tasks.

The course also includes modern Audio LLM content using Qwen3-TTS. You will learn Qwen3-TTS architecture, voice cloning, emotion control, audio data preparation, Whisper-based transcription, supervised fine-tuning, and uploading your fine-tuned audio model to Hugging Face.

By the end of this course, you will have a strong practical understanding of Hugging Face Transformers and LLM fine-tuning across NLP, vision, and audio use cases.

What You Will Learn

Understand Hugging Face Transformers from basic to advanced level
Use Hugging Face pipelines for NLP, vision, and audio tasks
Understand Transformer architecture, attention, encoder, decoder, and positional encoding
Learn BERT architecture, MLM, NSP, and BERT fine-tuning workflow
Fine-tune BERT for multi-class sentiment classification
Build and deploy a Streamlit app using a fine-tuned model
Understand knowledge distillation with DistilBERT, MobileBERT, and TinyBERT
Fine-tune lightweight Transformer models for fake news detection
Fine-tune DistilBERT for Named Entity Recognition
Fine-tune T5 for custom text summarization
Fine-tune Vision Transformer for Indian food image classification
Understand PEFT, LoRA, QLoRA, and 4-bit quantization
Fine-tune LLMs on custom datasets
Fine-tune a LLaMA base model into a chat/instruction model
Understand Qwen3-TTS architecture and voice cloning
Fine-tune Qwen3-TTS on custom audio data
Upload fine-tuned models to Hugging Face

Who this course is for:

Python developers who want to learn Hugging Face Transformers, NLP, and LLM fine-tuning through hands-on projects.
Data scientists and machine learning engineers who want to fine-tune BERT, T5, ViT, LLaMA, and other models.
NLP engineers who want to build real-world Transformer projects for classification, NER, summarization, and generation.
AI engineers who want to learn PEFT, LoRA, QLoRA, custom LLM fine-tuning, and instruction tuning workflows.
Students and researchers who want to understand Transformer architecture, BERT, knowledge distillation, and LLM training.
Generative AI learners who want to explore text, vision, and audio model fine-tuning using Hugging Face.

LLM Fine-Tuning with Hugging Face: LoRA, QLoRA, PEFT

What you'll learn

Explore related topics

Course content

Introduction9 lectures • 28min

Hello Transformers25 lectures • 2hr 45min

Transformers Architectures and Basic LLM Concepts14 lectures • 1hr 48min

BERT Architecture Theory9 lectures • 1hr 19min

Fine-Tuning BERT for Multi-Class Sentiment Classification for Twitter Tweets16 lectures • 1hr 37min

Knowledge Distillation for BERT - DistilBERT, MobileBERT and TinyBERT [Theory]13 lectures • 1hr 24min

Fine-Tune Distilled BERT Models for Fake News Detection10 lectures • 1hr 16min

Fine-Tune DistilBERT for Named Entity Recognition12 lectures • 1hr 20min

Fine-Tune T5 for Custom Text Summarization8 lectures • 51min

Fine-Tune Vision Transformer for Image Classification9 lectures • 1hr 3min

Requirements

Description

Who this course is for: