Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Python: Write Your Own Deep Learning Framework From Scratch

Name: Python: Write Your Own Deep Learning Framework From Scratch
Rating: 5.0 (1 reviews)

Master Deep Learning by building a PyTorch-like framework with NumPy: Autograd Engine, MLP, CNN & RNN.

Created byx-BIT Development

Last updated 5/2026

English

What you'll learn

How to write a deep learning framework using pure Python and NumPy code.
How to build a functional Autograd Engine from scratch.
Be able to implement core classes like Variable, Function, and Module.
Be able to build a tensor engine that supports broadcasting and matrix operations.
How to implement activation functions like ReLU, Sigmoid, and Softmax.
How to build a Data Pipeline including Dataset and DataLoader for mini-batch training.
Be able to implement Optimizers like Stochastic Gradient Descent (SGD).
How to train and evaluate models on the MNIST dataset.
How to implement Convolutional Neural Networks (CNN) from the ground up.
Be able to understand the im2col algorithm for convolutions.
How to implement Recurrent Neural Networks (RNN) from the ground up.
How to develop Sequential model support for Recurrent Neural Networks (RNN).

Course content

12 sections • 90 lectures • 7h 29m total length

Introduction4:43
Build a PyTorch-like deep learning framework from scratch with NumPy, implement automatic differentiation and neural network abstractions, and train on MNIST with CNNs and RNNs.
How to Use the Resources0:19
Discover how to access course resources, download the source code zip, and use the provided materials to follow along with each lecture.

Setting Up Working Environment on Windows 10 & 116:10
Setting Up Working Environment on Linux (Ubuntu)6:04
Set up a Ubuntu-based development environment for building a deep learning framework by installing Python, creating a virtual environment, and installing NumPy; configure PyCharm with the virtual interpreter.
Setting Up Working Environment on MacOS5:14
Set up a macOS development environment for building a deep learning framework by installing Python, creating a virtual environment, and installing NumPy.

How Neural Networks Work4:10
Implementing Our First Class: Function4:38
Explore building a simple deep learning framework from scratch by implementing a base function class, a sine subclass, and using NumPy to perform forward computations.
Building the Variable Class2:47
Implement the variable class to manage values in the deep learning framework, initialize the value as an ndarray, and enforce type checks with explicit error handling.
Integrating Function and Variable Classes2:25
Helper Function: Implementing to_array2:45
Implement a to_array helper to ensure forward outputs are ndarrays. Use numpy is_scalar to convert scalars to ndarrays, keeping all results consistent for deep learning computations.
Introduction to Computation Graphs3:06
Explore computation graphs, forward propagation, and backward propagation to understand how gradients guide deep learning model training.
Backpropagation Logic in Computation Graphs4:45
Adding Backpropagation Support to Our Framework6:27
Automating Backpropagation With Our Framework5:49
Implementing the Gradient Check Function4:23
Implement a gradient check function that uses central difference numerical differentiation to compare backpropagated gradients with numerical gradients, validating with allclose on a sine of sine example.

Optimizing Backpropagation Logic in Our Framework1:19
Adding Variable Arguments Support for Forward8:23
Implement variable arguments support for forward to handle multiple inputs by collecting xvalues and unpacking with *, and test with a two-input add function.
Adding Variable Arguments Support for Backward5:37
Refining the Gradient Check Function4:36
Dealing With the Same Inputs2:27
Demonstrates backpropagation when the same input is used twice in y = x0 + x0, showing gradient accumulation to avoid overwriting, and verifies the correct result with a test.
Clearing Gradients: Implementing the clear_grad Method2:10
Revisiting Backpropagation In Our Framework10:37
Clearing Gradients for Intermediate Nodes1:18
Implementing the Addition Method Part 14:07
Implement the addition operation by overloading the add method in the var class, converting inputs to ndarray objects and var objects, then test with sine of x plus x.
Implementing the Addition Method Part 23:25
Implementing the Subtraction Method4:13
Implement the subtraction operator in a deep learning framework, mirroring addition, and update the forward and backward passes to yield gradients 1 and -1 while testing the change.
Implementing the Multiplication Method4:28
Implement the multiplication operation in the Python deep learning framework, performing forward computation and back propagation to compute x and y gradients from upstream derivatives.
Implementing the Division Method3:28
Develop division in the backward pass by deriving x and y derivatives, update the forward division, and add true div and rtrue div in the var class, then test.
Implementing the Negation Method3:03
Apply the negate operation in backpropagation to derive dz/dx as minus 1, and implement y = -x with backward = -gy.
Implementing the Power Method3:36

Introduction to Tensors4:17
Implementing the Reshape Method6:25
Implement the reshape method to alter tensor shapes without changing values using NumPy reshape in the forward pass, and reshape gradients to the input shape in backpropagation.
Adding Tensor Support for Gradient Check5:54
Rewrite the gradient check to support tensor inputs by computing per-element derivatives with an elementwise loop, using nditer and in-place updates, summing y0 and y1, and storing results in grads.
Adding Broadcast Support in Our Framework9:11
Helper Function: Implementing sum_to4:19
We summarize the sum_to helper function that compresses input x to a target out shape using sum with keepdims, handling delta dim and delta axis, and validating final shape.
Implementing the Sum Method for Tensors8:08
Helper Function: sum_backward_shape4:04
Matrix Multiplication: Implementing the MatMul Method6:59
Implement a matmul class to perform matrix multiplication using NumPy's dot function, compute X and W gradients via backpropagation with transposed weights, and verify with a gradient check.
Implementing the Transpose Method10:14
Implementing the Exp Method for Tensors3:27
Implement the exp method for tensors and perform backpropagation on the exp node, using dz/dx = z for gradient computation. Use numpy exp to compute y = e^x.

Introduction to Neural Networks5:54
Explore neural networks by building the basic structure with input, hidden, and output layers. Discover how weights and bias drive z computations through matrix multiplication and a sigmoid activation.
Building Our First Simple Layer6:06
Adding the Loss Function: Mean Squared Error3:57
Explore mean squared error as the loss function for training a neural network, and apply gradient descent to update weights and biases to minimize loss.
Implementing Our First Neural Network7:07
Introduction to the Module Class2:02
Implement the module class as the base for layers, enabling forward flow from input x to output z, and manage its own parameters rather than manual w1, b1, w2, b2.
Implementing the Module Class3:21
Implementing the Parameter Class6:22
Implement a parameter class and integrate it into a module class, using a set to track parameters, __setattr__ overrides, and recursive, yield-based parameter traversal with clear grad support.
Implementing the Linear Layer Class5:52
Define a linear layer class that inherits from the module class, initializes weights and bias, handles input size, and uses forward to perform matrix multiplication within a deep learning framework.
Custom Models: Implementing MyNet Class8:22
Implementing the Optimizer Class: SGD4:11

Restructuring Our Code into a Modular Framework1:49
Reorganize the framework by placing core components—var class, function class, and forward/backward calculations—into core, move helpers into helper, place module class and linear class into layers, and SGD into optimizer.
NanoTorch: Establishing the Framework Structure10:42
Reorganize the nanotorch project by moving code into dedicated helper, core, layers, functions, and optimizers modules, updating imports while preserving the network's training behavior.

Introduction to Dataset and DataLoader1:01
Add a dataset and a dataloader to the NanoTorch framework, and test them with the MNIST handwriting dataset of 28 by 28 grayscale images (60,000 training, 10,000 test).
Using Minibatch To Train Our Network7:35
Implementing the Dataset Base Class8:04
Implementing the DataLoader Class11:52
Implement a dataloader class for the NanoTorch framework to automate batch retrieval, shuffling, and iteration, making training cleaner with batch x and batch labels.
Preparing the MNIST Database0:51
Download and unzip the MNIST dataset from the resources, then use the train images and train labels to train your network and begin implementing the MNIST dataset class.
Implementing the MNIST Dataset Class8:53
Implement the MNIST dataset class to load training and test data from gzip files and map data and label paths, then reshape 28x28 grayscale images for use in the framework.
Introduction to Softmax and Cross-Entropy3:53
Learn how softmax converts neural outputs into category probabilities and how cross-entropy computes loss for multi-category problems, with examples from MNIST and output slicing.
Implementing the Log Method2:47
Implement the log method in a custom deep learning framework, with forward y equals log(x) and backward gradient gy by computing dz/dx = 1/x.
Adding Support for Slicing Operation5:54
Implement slicing in your deep learning framework, covering forward slicing, backpropagation into sliced positions, and a getitem method for the var object using numpy zeros.
Implementing the Clip Class6:58
Implement the clip class to truncate values to min and max limits, apply forward clipping with numpy, and perform backpropagation using a mask to zero gradients for clipped elements.
Implementing Softmax CrossEntropy Function6:49
Implement the softmax and cross entropy loss in nanoTorch by defining the softmax function, clipping probabilities, computing the log, selecting true labels, and averaging the scalar loss for multiclass problems.
Adding Data Preprocessing Classes7:14
Introduce data preprocessing classes for MNIST: convert raw unsigned int images to float, flatten images, and apply min-max scaling. Integrate transforms into the dataset to prepare data for training MyNet.
Final Framework Integration1:30
Performance Metrics: Adding the accuracy Function4:54
Implement an accuracy function for the MNIST classifier by extracting final predictions from y_predict and comparing them to true labels. Compute the average with numpy mean to measure training accuracy.
Implementing the ReLU Class4:16
Final Milestone: Evaluating Performance on MNIST Database3:45
Evaluate your neural network on the mnist test set using a test dataloader, compute and print the accuracy, and compare training versus test performance in your from-scratch deep learning framework.

Introduction to Convolutional Neural Networks (CNN)0:27
CNN Fundamentals: Understanding Convolutional Operations9:07
Explore the convolutional neural network from input to output, covering conv operations, padding, stride, and multi-channel feature maps to understand output shapes.
Implementing CNN Class in NanoTorch Part 1: The CNN Class7:47
Implementing CNN Class in NanoTorch Part 2: Helper Functions2:33
Implement guideconfOutsize and Pair helper functions in the helper file to compute the CNN output size from input, kernel, stride, and padding using integer division for NanoTorch.
Implementing CNN Class in NanoTorch Part 3: The Conv2d Layer9:43
Implement the conf2d layer in the cnn class, initializing parameters, weights, and bias. Forward passes use stride and padding, with the conf2d function handling core convolution and backpropagation.
Implementing CNN Class in NanoTorch Part 4: The Theory Behind Conv2d6:19
Implementing CNN Class in NanoTorch Part 5: The Conv2d Function8:18
Implement the conv2d function in nanoTorch, preparing inputs and weights, applying stride and padding, reshaping kernels, and performing matrix multiplication to produce output maps with shape n, oc, oh, ow.
Implementing CNN Class in NanoTorch Part 6: The Im2Mat Class3:38
Implementing CNN Class in NanoTorch Part 7: Image to Matrix (Im2Col)16:06
Master image to matrix and matrix to image in a CNN class with im2col, including batch size, padding, and stride, to enable efficient convolution via matrix multiplication.
Final Milestone: Evaluating Performance on MNIST Database3:57
Import CNN components such as Conf2D, ImageToMatrix, and MatrixToImage, apply Flatten preprocessing, and evaluate on the Amnesty data site using the CNN instead of the MLP, achieving rising accuracy.

Requirements

Basic Python programming skills (familiarity with classes, functions, and NumPy basics).
Basic Calculus and Linear Algebra, specifically derivatives and the Chain Rule, matrix multiplication. If you're not a fan of math, you can simply follow the code to see how it works in action
Basic Deep Learning concepts: Knowing the basics of how models train and common architectures like CNNs and RNNs. We’ll cover the basics, and more importantly, we’ll take it a step further through learning by doing.
A curiosity to see how a deep learning framework is built and a willingness to follow along with the code.
No prior experience in deep learning framework development is required—we will build everything step by step.

Description

Welcome to Python: Write Your Own Deep Learning Framework From Scratch.

This course teaches you how to build a simple, PyTorch-like deep learning framework from scratch. It covers the core mechanics of automatic differentiation and neural network abstractions. In this course, I will take you through the process of building a modular working system step by step, using only Python and NumPy.

The first part of the course teaches all you need to know (computation graphs, backpropagation logic, gradient checking, etc.) before you can build a functional autograd engine. In this part, we start with scalar-valued variables and move on to handling complex logic, such as dealing with the same inputs and advanced operators. You will learn how to automate the chain rule and verify your engine’s accuracy.

The second part of the course teaches you how to transition from scalars to tensors. You will learn how to implement broadcasting, matrix multiplication, and shape manipulation. We will then restructure our code into a modular framework called NanoTorch. By the end of this part, you will implement essential framework components like Datasets, DataLoaders, and Optimizers to train models on the real-world MNIST dataset.

The final part of the course focuses on implementing core neural network architectures. We will deep-dive into Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). You will see how to implement the im2col algorithm for efficient convolution and handle sequential data for time-series tasks. Ultimately, we will write fully functional CNN and RNN architectures from the ground up, ensuring an in-depth understanding of these powerful models.

In this course you will learn:

How to write a deep learning framework using pure Python and NumPy code.
How to build a functional Autograd Engine from scratch.
Be able to implement core classes like Variable, Function, and Module.
Be able to build a tensor engine that supports broadcasting and matrix operations.
How to implement activation functions like ReLU, Sigmoid, and Softmax.
How to build a Data Pipeline including Dataset and DataLoader for mini-batch training.
Be able to implement Optimizers like Stochastic Gradient Descent (SGD).
How to train and evaluate models on the MNIST dataset.
Be able to understand the im2col algorithm for convolutions.
How to implement Convolutional Neural Networks (CNN) from the ground up.
How to implement Recurrent Neural Networks (RNN) from the ground up.
How to develop Sequential model support for Recurrent Neural Networks (RNN).

At the end of the course, you should be able to develop your own deep learning framework and understand the low-level mechanics of deep learning structures.

Who this course is for:

Students who learned deep learning concepts and want to put them into practice by building their own engine.
People curious about the fundamental mechanisms of automatic differentiation and how autograd engines work under the hood.
Students who want to build a hobby deep learning framework but don't know how and where to start.
Anyone who wants to fully understand how deep learning works by building every component from the ground up.
Developers who want to skip the math entirely and focus only on the code implementation to make it work.
People who are curious about how Deep Learning frameworks like PyTorch really work.

Python: Write Your Own Deep Learning Framework From Scratch

What you'll learn

Explore related topics

Course content

Introduction2 lectures • 5min

Setup and Installation3 lectures • 17min

Building a Scalar-Valued Autograd Engine From Scratch: The Core Architecture10 lectures • 41min

Building a Scalar-Valued Autograd Engine From Scratch: Advanced Logic & Operator15 lectures • 1hr 3min

Building a Full-Featured Autograd Engine From Scratch: From Scalar to Tensors10 lectures • 1hr 3min

Neural Network Implementation: Building Modules and Optimizers10 lectures • 53min

Building Our Own Framework: NanoTorch2 lectures • 13min

NanoTorch in Action: Data Pipelines and MNIST Training16 lectures • 1hr 26min

NanoTorch in Action: Building Multi-Layer Perceptrons (MLP)2 lectures • 4min

NanoTorch in Action: Building Convolutional Neural Networks (CNN)10 lectures • 1hr 8min

Requirements

Description

Who this course is for: