
Course Introduction!!!
Get code files here!!!
Build the architectural foundation needed to understand attention, encoders, decoders, and modern LLM fine-tuning.
Learn how Q, K, and V vectors power attention mechanisms inside Transformer and LLM architectures.
Understand the attention variants used in encoder, decoder, and encoder-decoder Transformer models.
Connect Transformer architecture to practical NLP, Generative AI, and LLM fine-tuning use cases.
Learn why BERT is one of the most important Transformer models for NLP fine-tuning.
Understand masked language modeling and next sentence prediction, the core ideas behind BERT pretraining.
Learn how pretrained BERT is adapted for downstream NLP tasks through supervised fine-tuning and evaluation.
Start the hands-on project where BERT is fine-tuned for multi-class Twitter sentiment classification.
Learn how a classification head is added to BERT for supervised text classification.
Load the Twitter multi-class sentiment dataset and prepare it for analysis and training.
Understand how raw tweet text is converted into token IDs that BERT can process.
Load a pretrained BERT model and attach a classification head for custom sentiment labels.
Fine-tune BERT using Hugging Face Trainer and the prepared sentiment dataset.
Evaluate the trained model on test data and check classification performance.
Save the fine-tuned BERT model and run predictions on new custom text examples.
Build a simple Streamlit app that uses the fine-tuned BERT model for real-time sentiment prediction.
Learn why knowledge distillation is used to create smaller, faster Transformer models for production use.
Start a classification project using DistilBERT, MobileBERT, and TinyBERT for fake news detection.
Load and clean the fake news dataset used for binary text classification.
Tokenize article titles or text so distilled BERT models can process the dataset.
Configure lightweight Transformer models for fake news classification.
Fine-tune distilled BERT models using Hugging Face training workflows.
Evaluate model accuracy and compare performance on validation or test data.
Compare DistilBERT, MobileBERT, TinyBERT, and BERT-Base for performance and efficiency.
Continue the benchmark comparison and interpret tradeoffs between speed, size, and accuracy.
Start a named entity recognition project using restaurant search data and DistilBERT.
Learn what NER is and how it identifies entities such as locations, cuisines, and restaurant-related terms.
Understand the tagging format used to label tokens for named entity recognition.
Build the DistilBERT tokenizer workflow for token classification.
Learn how to align word-level NER labels with subword tokens, a key step in token classification.
Fine-tune DistilBERT for restaurant search named entity recognition.
Save the fine-tuned NER model and test it on custom restaurant search text.
Start a sequence-to-sequence project where T5 is fine-tuned for custom dialogue summarization.
Learn what summarization is and how it is used in NLP and Generative AI workflows.
Load the SAMSum dataset and understand why it is useful for dialogue summarization.
Prepare dialogue and summary pairs for T5 using sequence-to-sequence tokenization.
Train T5 on the SAMSum dataset using Hugging Face Trainer and seq2seq data collation.
Run inference with the fine-tuned T5 model to generate summaries for custom dialogue text.
Start a computer vision project using Vision Transformer for Indian food image classification.
Load the Hugging Face image dataset and inspect labels for food classification.
Prepare images using resizing, normalization, tensors, and image processor settings for ViT.
Fine-tune a pretrained Vision Transformer model for Indian food image classification.
Save the trained ViT model and run image classification inference on custom images.
Welcome to Fine Tuning LLM with Hugging Face Transformers for NLP, a practical and project-based course designed to help you understand and fine-tune modern Transformer models for real-world AI applications.
This course starts from the basics of Hugging Face Transformers and gradually takes you into advanced fine-tuning workflows. You will learn how pipelines work, how checkpoints and models are used, how Hugging Face datasets are loaded, and how Auto Classes simplify model loading, tokenization, training, and inference.
After building a strong foundation, you will go deeper into Transformer architecture. You will understand Seq2Seq models, attention mechanism, Q, K, V vectors, scaled dot-product attention, encoder-decoder stacks, positional encoding, self-attention, masked self-attention, cross-attention, and multi-head attention.
The course also covers BERT architecture in detail. You will learn how BERT processes input, how masked language modeling and next sentence prediction work, and how BERT is fine-tuned for downstream NLP tasks.
Then you will move into hands-on projects where you will fine-tune Transformer models for practical use cases such as sentiment classification, fake news detection, named entity recognition, text summarization, and image classification using Vision Transformers.
You will also learn knowledge distillation concepts using DistilBERT, MobileBERT, and TinyBERT. This will help you understand how smaller and faster Transformer models are created for real-world production use cases.
In the advanced sections, you will learn how to fine-tune LLMs on custom datasets using PEFT, LoRA, QLoRA, and 4-bit quantization. You will fine-tune models like Phi and LLaMA-style models for custom text generation and instruction/chat-based tasks.
The course also includes modern Audio LLM content using Qwen3-TTS. You will learn Qwen3-TTS architecture, voice cloning, emotion control, audio data preparation, Whisper-based transcription, supervised fine-tuning, and uploading your fine-tuned audio model to Hugging Face.
By the end of this course, you will have a strong practical understanding of Hugging Face Transformers and LLM fine-tuning across NLP, vision, and audio use cases.
What You Will Learn
Understand Hugging Face Transformers from basic to advanced level
Use Hugging Face pipelines for NLP, vision, and audio tasks
Understand Transformer architecture, attention, encoder, decoder, and positional encoding
Learn BERT architecture, MLM, NSP, and BERT fine-tuning workflow
Fine-tune BERT for multi-class sentiment classification
Build and deploy a Streamlit app using a fine-tuned model
Understand knowledge distillation with DistilBERT, MobileBERT, and TinyBERT
Fine-tune lightweight Transformer models for fake news detection
Fine-tune DistilBERT for Named Entity Recognition
Fine-tune T5 for custom text summarization
Fine-tune Vision Transformer for Indian food image classification
Understand PEFT, LoRA, QLoRA, and 4-bit quantization
Fine-tune LLMs on custom datasets
Fine-tune a LLaMA base model into a chat/instruction model
Understand Qwen3-TTS architecture and voice cloning
Fine-tune Qwen3-TTS on custom audio data
Upload fine-tuned models to Hugging Face