Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Master Computer Vision & Deep Learning: OpenCV, YOLO, ResNet

Name: Master Computer Vision & Deep Learning: OpenCV, YOLO, ResNet
Rating: 4.0 (40 reviews)

Unlock the Power of Object Detection with Deep Learning: YOLO, SSD, SVM, ResNet50, Inceptionv3 and CNNs

Created byVineeta Vashistha

Last updated 1/2025

English

What you'll learn

Master the fundamentals of deep learning, including neurons, neural networks, and activation functions
Discover the architecture and design of state-of-the-art object detection models, such as Faster R-CNN, RetinaNet, SDD, and YOLO
Build a real-world object detection application to automatically detect license plate numbers using Faster R-CNN
Learn about the architecture and design of image classification models, such as SVM, VGG-16, ResNet50, and InceptionV3
Develop an image classification application to detect and train traffic sign boards using SVM
Train an image classification model using ResNet to classify 20 different sets of multiple images
Understand the design of object tracking frameworks, such as Meanshift, SORT, and DeepSORT
Build a solution to track football players using object tracking

Course content

17 sections • 79 lectures • 3h 50m total length

Learning Path1:38
Begin with Python and OpenCV to build a solid image processing base for OCR projects, then master practical Python basics, Numpy, Pandas, and OpenCV concepts like thresholding, dilation, and erosion.
Course Starter - How to approach the course6:18
Discover how to maximize learning with captions for clarity and how to download resources. Engage via Q&A and familiarize yourself with tools setup and download code lectures.
Udemy Review1:51
Understand the Udemy review system and rate after evaluating all sections, projects, and downloadable resources. The course provides 24-hour in-course support to address concerns and enhance your learning journey.

Objectives0:44
Gain an overview of artificial intelligence and see how computer vision, machine learning, and deep learning fit within it, plus image basics like pixels, channels, and color models.
Artificial Intelligence Overview2:33
Discover how artificial intelligence enables machines to mimic human intelligence through computer vision, machine learning, and deep learning, with neural networks extracting high-level features from data for real-world applications.
What is Computer Vision ?2:08
Computer vision enables machines to see, identify, and process images like humans by automatically extracting information and labeling what is present through classification, localization, object detection, and segmentation.
Image Basics5:29
Explore image fundamentals by understanding pixels, channels, and color models, including RGB, LAB, and HSV, with grayscale and binary imaging insights and 0-255 value ranges and color perception concepts.

Objectives0:39
Guide learners through a step-by-step tool setup for Ubuntu and Windows environments, then explore PyCharm, Jupyter Notebook, and Google Colab for training models.
Tools Setup - Ubuntu1:04
guide students through setting up Ubuntu tools for computer vision development by installing python 3.6, optionally other stable versions, and PyCharm, following on-screen commands and the provided setup manual.
Tools Setup - Windows0:45
Install Python version 3.6 in the C folder and set up PyCharm on Windows using the provided links. Use the setup manual in resources to complete the Windows tool installation.
Using Pycharm for Coding6:26
Learn how to install and launch PyCharm, create or open projects, configure Python interpreters and virtual environments, install packages, run and debug Python code efficiently.
Using Jupyter Notebook and Shortcuts1:26
Using Google Colab11:17
Create, rename, upload, and open notebooks in Google Colab, then run code in cells with shared variables. Select cpu, gpu, or tpu runtimes and note 12 hours of execution.

Objectives0:31
Explore the basic building block of deep learning—the neuron—and its architecture, then study artificial neural networks, convolutional neural networks, and the role of activation functions.
What is a Neuron?2:02
Understand how a neuron functions as the computational unit in neural networks, receiving inputs through dendrites and sending outputs via axons, powering deep learning.
Neuron Architecture1:25
Neurons model brain using inputs, weights, and a bias to produce 0 or 1. They form an artificial neural network that classifies inputs via a weighted sum and activation function.
Artificial Neural Network3:04
Convolutional Neural Network6:12
Explore how a convolutional neural network analyzes images with convolutional layers, feature maps, pooling, and fully connected layers for classification, highlighting fewer parameters and fast training.
Activation Function2:44
Activation functions determine a neural network’s output, accuracy, and training efficiency, acting as gates that activate neurons and define binary step, linear, and non-linear activations for complex data.

Objectives0:58
Object Detection Overview2:26
Explore object detection in computer vision, comparing early models like Haar Cascade and Hog with advanced models such as R-CNN and YOLO, highlighting speed, dataset size, and accuracy.
Object Detection Architecture1:37
Compare one-stage and two-stage object detectors by describing single-network detection versus a two-step region proposal process. Retinanet and Yolo illustrate one-stage models; R-CNN, Faster R-CNN and FPN illustrate two-stage detectors.
Object Detection vs Object Tracking2:21
Differentiate object detection and object tracking by drawing boundary boxes and classifying objects in each frame, then track them across video with unique IDs.
R-CNN MODEL2:16
R-CNN uses selective search to generate about 2000 region proposals, extracts features with a convolutional network, classifies with an SVM, and refines bounding boxes with regression, though training is time-consuming.
FAST R-CNN MODEL1:52
Learn how Fast R-CNN reduces computation by sharing features across regions of interest, using RoI pooling, softmax classification, and bounding box regression, making it faster and more accurate than R-CNN.
Region Proposal Network (RPN)3:26
Explore how region proposal networks predict objectness and bounding boxes across a backbone feature map, using anchors and 3x3 and 1x1 convolutions to generate object proposals.
FASTER R-CNN MODEL2:36
R-FCN MODEL2:02
Explore R-FCN, a region-based fully convolutional detector that shares computation across the image, using ResNet-101 feature maps, RPN proposals, and position-sensitive RoI pooling to classify RoIs.

Objectives0:34
Explore the project object detection with Faster R-CNN by examining the high-level design, performing a code walkthrough, and following download and execution instructions to run the project.
Project Overview1:15
Perform object detection with faster r-cnn to identify 88 object types in a marketplace video, with a PyCharm walkthrough and downloadable code.
Code Walkthrough9:36
Code Download Instructions1:07
Download and unzip Faster-RCNN.zip, open the project in PyCharm, and run faster_rcnn_Object Detection.py with the pre-trained frozen_inference_graph.pb and coco.names for 88 object classes on TownCenterXVID.avi input after installing requirements.txt.

Objectives0:40
Explore the object detection model and its architecture, starting with the net model from Facebook, then cover the euro be three model, the tiny model, and the iPhone model.
RetinaNet1:24
RetinaNet, introduced by Facebook AI Research, targets dense and small object detection with a ResNet-based backbone, a feature pyramid, and two subnetworks for classification and regression using focal loss.
SSD MODEL2:46
Discover how the SSD model performs real-time object detection by predicting bounding boxes and class scores in a single end-to-end CNN, using multi-scale feature layers and non-maximum suppression.
YOLO V3 Model5:29
YOLO V3 TINY MODEL1:53
YOLOV4 Model4:08
Explore YOLOv4, an efficient object detector for production and parallel computation, featuring CSPDarknet53 backbone with SPPnet and PANet, three detection heads, and bag of freebies and specials that boost accuracy.
Quiz on Object Detection Concepts

Objectives0:35
Learn how to perform license plate recognition using yolov3, explore the project design, walk through the source code, and follow execution and download instructions.
Project Overview1:25
Demonstrate license plate recognition from webcam video using YOLO and a convolutional neural network to generate bounding boxes. Walk through PyCharm code and provide download instructions for execution.
Code Walkthrough10:46
Follow a code walkthrough for license plate detection with YOLOv3, OpenCV, and pytesseract, covering project setup in PyCharm, virtual environments, and running the model on videos, webcam, and images.
Code Download Instructions0:48
Download and unzip the License_Number_Plate_Detection_YOLOv3.zip from resources. Open PyCharm to access name.py and model.py, review test_dataset and yolo_utils with configuration, weights, and class name files, then run main.py.

Objectives0:41
Explore how image classification works in industry by examining the classification pipeline and comparing key models: SVM, decision tree, and K-NN.
Image Classification Overview2:05
Master image classification in computer vision with supervised learning, labeling images by content, and tackle challenges like scale and occlusion while reviewing models such as SVM, VGG16, and ResNet50.
Image Classification Pipeline2:20
Explore the four-stage image classification pipeline, from pre-processing with grayscale conversion, standardizing image sizes, and data augmentation, to object segmentation, feature extraction and training, and final object classification.
Support Vector Machine(SVM)2:44
Explore SVM for classification and regression in high-dimensional spaces, using kernel-based projection to maximize the margin and reduce overfitting in machine vision tasks.
Decision Tree4:00
Explore the decision tree, a supervised classifier for remote sensing images. Learn how root, interior, and leaf nodes split data using gain ratio and Gini index, and avoid overfitting.
K Nearest Neighbor(KNN)2:45
Apply the knn algorithm, a simple, nonparametric, lazy supervised learner for image classification and regression, using euclidean distance and majority voting to label new points.

Objectives0:37
Explore the YOLOv3 license plate project, review its design and architecture, walk through the code download instructions, and learn configuration, setup, and Google Colab training.
Project Overview1:10
Code Walkthrough10:00
Train a YOLOv3 model for license plate detection using transfer learning with a pretrained convnet as fixed feature extractor, then train a linear classifier.
Code Download Instructions0:49

Requirements

Basic knowledge of programming in Python
Familiarity with machine learning concepts

Description

Master Deep Learning and Computer Vision: From Foundations to Cutting-Edge Techniques

Elevate your career with a comprehensive deep dive into the world of machine learning, with a focus on object detection, image classification, and object tracking.

This course is designed to equip you with the practical skills and theoretical knowledge needed to excel in the field of computer vision and deep learning. You'll learn to leverage state-of-the-art techniques, including convolutional neural networks (CNNs), recurrent neural networks (RNNs), and advanced object detection models like YOLOv8.

Key Learning Outcomes:

Fundamental Concepts:
- Grasp the core concepts of machine learning and deep learning, including supervised and unsupervised learning.
- Understand the mathematical foundations of neural networks, such as linear algebra, calculus, and probability theory.
Computer Vision Techniques:
- Master image processing techniques, including filtering, noise reduction, and feature extraction.
- Learn to implement various object detection models, such as YOLOv8, Faster R-CNN, and SSD.
- Explore image classification techniques, including CNN architectures like ResNet, Inception, and EfficientNet.
- Dive into object tracking algorithms, such as SORT, DeepSORT, and Kalman filtering.
Practical Projects:
- Build real-world applications, such as license plate recognition, traffic sign detection, and sports analytics.
- Gain hands-on experience with popular deep learning frameworks like TensorFlow and PyTorch.
- Learn to fine-tune pre-trained models and train custom models for specific tasks.

Why Choose This Course?

Expert Instruction: Learn from experienced instructors with a deep understanding of deep learning and computer vision.
Hands-On Projects: Gain practical experience through a variety of real-world projects.
Comprehensive Curriculum: Cover a wide range of topics, from foundational concepts to advanced techniques.
Flexible Learning: Access course materials and assignments at your own pace.
24/7 Support: Get timely assistance from our dedicated support team.

Join us and unlock the power of deep learning to shape the future of technology.

Who this course is for:

Software engineers who want to learn deep learning and computer vision to develop cutting-edge machine learning solutions.
Machine learning enthusiasts who want to develop a portfolio of industry-relevant projects
Data scientists who want to expand their skills and knowledge in deep learning and computer vision
Students who want to gain hands-on experience with deep learning and computer vision
Professionals who want to transition into a career in machine learning

Master Computer Vision & Deep Learning: OpenCV, YOLO, ResNet

What you'll learn

Explore related topics

Course content

Course Starter3 lectures • 10min

Understanding Computer Vision and AI4 lectures • 11min

Tools Setup6 lectures • 22min

Neuron, Neural Network and Activation Function6 lectures • 16min

Object Detection - R-CNN, FAST R-CNN, RPN, FASTER R-CNN and R-FCN9 lectures • 20min

Project 1 - Object Detection using Faster R-CNN4 lectures • 13min

Object Detection - RetinaNet, SSD, YOLO, YOLOV3, YOLOV3 Tiny and YOLOV46 lectures • 16min

Project 2 - License Number Plate Recognition using YOLOV34 lectures • 14min

Image Classification Models - SVM, Decision Tree, KNN6 lectures • 15min

Project 3 - YOLOV3 Training for License Number Plate4 lectures • 13min

Requirements

Description

Who this course is for: