Java TTS and STT - text to speech and speech to text course
3.7 (3 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
8 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Java TTS and STT - text to speech and speech to text course to your Wishlist.

Add to Wishlist

Java TTS and STT - text to speech and speech to text course

Working on projects regarding TTS and STT and also programming your multi-threaded mp3 player
3.7 (3 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
8 students enrolled
Created by ITSuff &Tricks
Last updated 12/2016
English
Price: $30
30-Day Money-Back Guarantee
Includes:
  • 3 hours on-demand video
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • write own programs for TTS and STT conversion. Beside knowing of how to use FreeTTS and Sphinx, you'll be able to use Google translate service for TTS and also write your own sound (music) player. Know differences between various speech to text and text to speech libraries.
View Curriculum
Requirements
  • Students will need to know java programming for beginners (if statements, while and for loops, concepts of functions, methods and classes). They will also need to know how to install JDK and Maven on their machine, and basic knowledge of using Eclipse IDE
Description

What is text to speech (TTS) and speech to text (STT)

Text to speech, abbreviated as TTS, is a form of speech synthesis that converts text into spoken voice output. Text to speech systems were first developed to aid the visually impaired by offering a computer-generated spoken voice that would "read" text to the user.

Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech computer or speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech.

Speech to text - Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers. It is also known as "automatic speech recognition" (ASR), "computer speech recognition", or just "speech to text" (STT). It incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields.

Speech-to-text software is a type of software that effectively takes audio content and transcribes it into written words in a word processor or other display destination. This type of speech recognition software is extremely valuable to anyone who needs to generate a lot of written content without a lot of manual typing. It is also useful for people with disabilities that make it difficult for them to use a keyboard.
Speech-to-text software may also be known as voice recognition software.

Learn text to speech and speech to text recognition in Java

This course is for You - even if you are newbie, or enthusiastic programmer/developer who is interested in text to speech and speech to text recognition, you can benefit from this course. If you already know Java, and want to look deeper in Java in many directions, and if you also want to get knowledge about Java programming in field of text to speech and speech to text recognition, this course can help you and inform you about many things in that field, guiding you through four projects, step by step, with clear examples and writing code from scratch to the end. Java follows technology world in many directions, and TTS and STT is just one of them. In this course you will be able to find all in one place in this field which is used so much today. 

Learn Java and Java API's

Even if you are new in programming, you will be able to understand the concepts and you will be shown how to write every single method, writing line by line - to finish successfully your projects. Course includes writing of four projects from scratch until finish. Link to source code included!

Contents and Overview 

3 hours of Java programming with clear examples, explanations and solutions, including these topics:

  • Text to speech with FreeTTS
  • Java TTS with Google Translate service
  • Programming sound player
  • Sphinx voice recognition (STT)
Who is the target audience?
  • Target students are intermediate and advanced programmers/developers who want to get more on TTS and STT. Also, course is made with clear and simple explanations so even beginners should make successful projects if they follow the instructions.
Compare to Other Java Courses
Curriculum For This Course
21 Lectures
02:56:03
+
Introduction
2 Lectures 04:07

Introduction video and notes about what will be covered in course

Preview 01:38

Video about properly setting our environment (projects were done on Eclipse IDE on Microsoft Windows)

Setting up environment
02:29
+
Text to speech with FreeTTS
5 Lectures 45:18

In this lesson we will set up our first project

Preview 05:13

In this lession we will write our class and method for getting all the voices

Writing our first program; getting all voices and printing them
05:36

In this lesson we will go further with our project and add some more code to our class - we will reproduce some sound from our input.

Reproducing sound from input
06:37

In this lesson we will set some pitch, range and pitch shift in our input that will be reproduced according to those settings

Setting pitch, range and pitch shift
06:49

In this lesson we will load text-to-speech with System properties, and use necessary resources after loading it.

Loading TTS with System properties programatically
21:03
+
Java TTS with Google Translate service
3 Lectures 30:59

In this lesson we will create our project and write method for getting some data from Google Translate service.

Creating our project and wiriting method for getting the audio from Google
11:31

In this lesson we will show how to programatically get multilanguage speech from Google translate, and we will execute our method.

Using method for getting the multilanguage speech output with Google service
07:48

In this lesson we will load our text we want to be spoken from our textual file from our disc and then we will send that data as string to Google service method, so we could recieve output sound.

Getting the multilanguage speech output through textual input file
11:40
+
Programming sound player
5 Lectures 51:52

In this lesson we will create our project and play some music from our disc.

Preview 08:10

In this lesson we will start to write sound player class which will have threads.

Starting to write sound player program with threads
07:28

In this lesson we will write implementation of threads - thread for actual sound reproduction and thread for information about sound duration.

Writing two threads: thread for playing sound and thread for sound duration info
10:42

In this lesson we will write implementation of third thread - for information about file size and about spent computer processor storage. Every second that data will be measured as we are using threads.

Writing third thread: for file size info and spent computer processor storage
12:26

In this lesson we will write implementation of method what will be used by the third thread.

Writing method for file size info and spent computer processor storage
13:06
+
Sphinx voice recognition
5 Lectures 43:13

In this lesson we will create our project and write microphone recognizer class

Creating our project and writing microphone recognizer
09:19

In this lesson we will start to write recognition logic, define grammar and xml config file, and we will also test voice recognition.

Implementing recognition logic, defining grammar file and testing recognition
10:13

In this lesson we will write new example for voice recognition, that will be used in later lessons for opening and closing of some programs.

Writing new example for voice recognition, writing microphone recognizer
06:36

In this lesson we will write logic for opening and close some programs with voce commands.

Writing program that will open and close various programs on voice recognition
07:12

In this lesson we will write implementation of opening and closing programs through voice recognition with Sphinx library.

Implementation of method for opening and closing programs with voice recognition
09:53
+
Source code
1 Lecture 00:34

In this video students will be able to find link for source code available for download.

Source code
00:34
About the Instructor
ITSuff &Tricks
3.5 Average rating
21 Reviews
2,031 Students
2 Courses
Software development

Hi everybody. I have worked for 9 years as PHP backend/web developer and 3 years as Java developer. I am passionate about Java: I love Spring framework, complex integrations, web services. Love to research and play with various technologies like Docker and microservices. I also know Python (love-crush for Flask framework), JavaScript, AngularJS, jQuery, SQL, XML, HTML, CSS, Adobe Flex, and so on. I love to investigate various API-s and I love to work on Linux and Windows.