Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Transcription 101: Google Speech to Text API with Python
Rating: 3.4 out of 5(32 ratings)
144 students

Transcription 101: Google Speech to Text API with Python

Talking to your technology? Now your technology understands you
Created byBruce Bookman
Last updated 6/2020
English

What you'll learn

  • Write Python programs that transcribe human speech into text using Google Cloud Speech to Text API
  • Learn how to use Machine Learning and Artificial Intelligence to transcribe audio with Google's toolset
  • Understand how to apply Speech-to-Text for specific usecases
  • Learn when accuracy is important, and when it is not
  • Explore how to tune the API configuration for specific outcomes
  • Benefit from an experienced instructor with real world application

Coding Exercises

This course includes our updated coding exercises so you can practice your skills as you learn.

See a demo
Image of coding exercise example

Course content

9 sections22 lectures1h 4m total length
  • Why Speech to Text? What is it good for?5:01

    Reviews the applications for Speech-to-Text

  • Prerequisites0:27
  • Try it now!0:31
  • API overview3:15
  • Section quiz

Requirements

  • Intermediate Python
  • Beginning Linux
  • Familiarity with terminal editors such as vi or nano

Description

Powerful AI

Google Speech-to-Text enables developers to convert audio to text by applying powerful neural network models in an easy-to-use API. The API recognizes more than 120 languages and variants to support your global user base. You can enable voice command-and-control, transcribe audio from call centers, and more. It can process real-time streaming or prerecorded audio, using Google’s machine learning technology.

Best results

In this course, you will learn not only how to write Python programs to process audio and produce transcripts, but you will also learn how to optimize the API configuration for the highest accuracy. 

Experienced Instructor

The course instructor has worked with Fortune 500 companies to guide them toward high accuracy and fit for purpose transcription.

Who this course is for:

  • Intermediate Python developers who wish to learn how to transcribe human speech
  • Beginner Python developers who wish to learn real world uses for Artificial Intelligence / Machine Learning
  • Anyone with a business case where extracting valuable data from audio content is helpful