
Open and view images using the pillow library in python, loading test image.jpg from the same folder with image.open, and inspect the image object's mode rgb and size.
Learn to crop images in Python with Pillow using the image.crop method. Open an image, inspect its size, set left, top, right, and bottom coordinates, and produce a cropped result.
Learn to save images in python's image library using the save method after rotating or processing, saving intermediate and final results as jpg files in the same directory.
Open an image with OpenCV in Python using cv2.imread, inspect the pixel array as a numpy array, and display the image with matplotlib for OCR preprocessing.
Learn to binarize a color image to black and white in OpenCV by converting to grayscale and applying thresholding, then save and display the binary image.
Learn text template matching in an OCR workflow using Python, building a date regex to locate and draw bounding boxes around dates in images.
Welcome to Course "Optical Character Recognition (OCR) MasterClass in Python"
Optical character recognition (OCR) technology is a business solution for automating data extraction from printed or written text from a scanned document or image file and then converting the text into a machine-readable form to be used for data processing like editing or searching.
BENEFITS OF OCR:
Reduce costs
Accelerate workflows
Automate document routing and content processing
Centralize and secure data (no fires, break-ins or documents lost in the back vaults)
Improve service by ensuring employees have the most up-to-date and accurate information
Some Key Learning Outcomes of this course are:
Recognition of text from images using OpenCV and Pytesseract.
Learn to work with Image data and manipulate it using Pillow Library in Python.
Build Projects like License Plate Detection, Extracting Dates and other important information from images using the concepts discussed in this course.
Learn how Machine Learning can be useful in certain OCR problems.
This course covers basic fundamentals of Machine Learning required for getting accurate OCR results.
Build Machine Learning models with text recognition accuracy of above 90%.
You will learn about different image preprocessing techniques such as grayscaling, binarization, erosion, dilation etc... which will help to improve the image quality for better OCR results.