Ho to Build AI & Machine Learning Recognition Application
What you'll learn
- In the end of my course you will learn how to use machine learning in text dection with go. You used golang application in which you organized the application logic into multiple packages to easily maintain the application.
- You will use azure cognetive services for Computer Vision
- You will use azure cognetive services for Face
- You will use google Detect Text (OCR)
Course content
- Preview03:13
- Preview03:07
- Preview03:45
- 01:40How analyze foto
- 01:57Foto vision
- 01:55Doc vision
- 01:15Face detection
- 02:54Google OCR
- 02:00Google cloud
- 04:38Google OCR bot
- 06:51Google text detection
Requirements
- You should have basic level of knowledge in AI, machine learning, OCR and go language
Description
The Vision API can detect and extract text from images. There are two annotation features that support optical character recognition:
TEXT_DETECTION detects and extracts text from any image. For example, a photograph might contain a street sign or traffic sign. The JSON includes the entire extracted string, as well as individual words, and their bounding boxes.
DOCUMENT_TEXT_DETECTION also extracts text from an image, but the response is optimized for dense text and documents. The JSON includes page, block, paragraph, word, and break information.
Computer Vision
Extract rich information from images to categorize and process visual data—and perform machine-assisted moderation of images to help curate your services.
Analyze an image
This feature returns information about visual content found in an image. Use tagging, domain-specific models, and descriptions in four languages to identify content and label it with confidence. Use Object Detection to get location of thousands of objects within an image. Apply the adult/racy settings to help you detect potential adult content. Identify image types and color schemes in pictures.
Recognize celebrities and landmarks
Recognize more than 200,000 celebrities from business, politics, sports and entertainment, as well as 9,000 natural and manmade landmarks from around the world.
Face
Detect and compare human faces
Organize images into groups based on similarities
Identify previously tagged people in images
Run locally on-premises or in the cloud
Face verification
Check the likelihood that two faces belong to the same person. The API will return a confidence score about how likely it is that the two faces belong to one person.
Face detection
Detect one or more human faces in an image and get back face rectangles for where in the image the faces are, along with face attributes which contain machine learning-based predictions of facial features. The face attribute features available are: Age, Emotion, Gender, Pose, Smile, and Facial Hair along with 27 landmarks for each face in the image.
Emotion recognition
The Face API now integrates emotion recognition, returning the confidence across a set of emotions for each face in the image such as anger, contempt, disgust, fear, happiness, neutral, sadness, and surprise. These emotions are understood to be cross-culturally and universally communicated with particular facial expressions.
Who this course is for:
- It can be helpful from high school level
Instructor
I’m a solutions architect focused on building highly scalable native applications with a special in APIs, Microservices, containerized applications, and distributed systems. Currently specializes in NodeJS, JS, PHP & Golang. I’m passionate about building scalable back-end systems and Microservices in Go And NodeJS. I’m pragmatic minimalist who focuses on real-world practices for architecting solutions.
Worked with Erlang, Prolog ,Java, NodeJS, PHP, JS, Golang.
Web Servers: Apache HTTP Server, NGINX
Databases: PostgreSQL, Redis, MySQL, MongoDB
Back-End Programming Languages: PHP, Golang, Erlang, NodeJS
Back-End Development Skills: Google Web Toolkit (GWT), Docker, Git, RESTful, Kubectl, Kubernetes
Back-End Development Deliverables: Web Services, API, Database Architecture: RESTful API, PHP Script, Database
Other skills: Backend Rest APIArchitecture Design