
Introduction and course agenda
You'll learn how to compute convolutions over volumes and how to measure the computational cost of standard convolutions.
Learn about depthwise convolutions, how they differ from standard convolutions, and how to compute their computational costs.
Learn about pointwise (aka 1x1) convolutions, how they are used to model cross-channel interactions, and how to compute their computational cost
Learn about the alpha and rho parameters and how they help balance the tradeoff between computational cost and accuracy.
An introduction to the MobileNetV2, manifolds of interest, and linear bottlenecks
Learn about the novelty introduced in this paper: The inverted residual block.
An overview of the MobileNetV2 architecture
Learn about the hard swish activation and the squeeze and excitation blocks
Learn about the inner workings of a MobileNetV3 block.
In this example, I walk through the code example.
Note: In the near future this video will be updated. Keep an eye out for that.
This course provides a comprehensive understanding of MobileNet, a state-of-the-art deep learning architecture for resource-constrained devices such as smartphones and IoT devices. MobileNet is optimized for real-time image and video classification, making it an ideal choice for cutting-edge computer vision applications.
One of the key innovations in MobileNet is the use of depthwise separable convolutions, which allow for efficient computation and reduced memory footprint compared to traditional convolutional neural networks (CNNs). In this course, you'll learn about the computational costs of standard convolutions and how depthwise separable convolutions reduce computational overhead.
You'll also delve into the architecture of MobileNet, including the use of linear bottlenecks and inverted residuals to optimize performance. In addition, you'll explore squeeze and excitation layers, which add a self-attention mechanism to the network, allowing it to focus on the most important features in an input image.
The course includes hands-on demonstrations and practical exercises that allow you to experience the power of MobileNet in action. You'll perform image classification on the Describable Textures Dataset using the SuperGradients training library and see how MobileNet can solve real-world problems in computer vision.
In conclusion, this course is designed for anyone interested in deep learning, computer vision, or edge computing. Whether you're a computer science student, a machine learning engineer, or a researcher, you'll leave this course with a comprehensive understanding of MobileNet, its architecture, and its applications. So, don't miss out on this opportunity to advance your skills in deep learning on edge!