What is text to speech (TTS) and speech to text (STT)
Text to speech, abbreviated as TTS, is a form of speech synthesis that converts text into spoken voice output. Text to speech systems were first developed to aid the visually impaired by offering a computer-generated spoken voice that would "read" text to the user.
Speech synthesis is the artificial production of human speech. A computer system used for this purpose is called a speech computer or speech synthesizer, and can be implemented in software or hardware products. A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech.
Speech to text - Speech recognition (SR) is the inter-disciplinary sub-field of computational linguistics that develops methodologies and technologies that enables the recognition and translation of spoken language into text by computers. It is also known as "automatic speech recognition" (ASR), "computer speech recognition", or just "speech to text" (STT). It incorporates knowledge and research in the linguistics, computer science, and electrical engineering fields.
Speech-to-text software is a type of software that effectively takes audio content and transcribes it into written words in a word processor or other display destination. This type of speech recognition software is extremely valuable to anyone who needs to generate a lot of written content without a lot of manual typing. It is also useful for people with disabilities that make it difficult for them to use a keyboard.
Speech-to-text software may also be known as voice recognition software.
Learn text to speech and speech to text recognition in Java
This course is for You - even if you are newbie, or enthusiastic programmer/developer who is interested in text to speech and speech to text recognition, you can benefit from this course. If you already know Java, and want to look deeper in Java in many directions, and if you also want to get knowledge about Java programming in field of text to speech and speech to text recognition, this course can help you and inform you about many things in that field, guiding you through four projects, step by step, with clear examples and writing code from scratch to the end. Java follows technology world in many directions, and TTS and STT is just one of them. In this course you will be able to find all in one place in this field which is used so much today.
Learn Java and Java API's
Even if you are new in programming, you will be able to understand the concepts and you will be shown how to write every single method, writing line by line - to finish successfully your projects. Course includes writing of four projects from scratch until finish. Link to source code included!
Contents and Overview
3 hours of Java programming with clear examples, explanations and solutions, including these topics:
In this lession we will write our class and method for getting all the voices
In this lesson we will go further with our project and add some more code to our class - we will reproduce some sound from our input.
In this lesson we will set some pitch, range and pitch shift in our input that will be reproduced according to those settings
In this lesson we will load text-to-speech with System properties, and use necessary resources after loading it.
In this lesson we will create our project and write method for getting some data from Google Translate service.
In this lesson we will show how to programatically get multilanguage speech from Google translate, and we will execute our method.
In this lesson we will load our text we want to be spoken from our textual file from our disc and then we will send that data as string to Google service method, so we could recieve output sound.
In this lesson we will create our project and play some music from our disc.
In this lesson we will start to write sound player class which will have threads.
In this lesson we will write implementation of threads - thread for actual sound reproduction and thread for information about sound duration.
In this lesson we will write implementation of third thread - for information about file size and about spent computer processor storage. Every second that data will be measured as we are using threads.
In this lesson we will write implementation of method what will be used by the third thread.
In this lesson we will create our project and write microphone recognizer class
In this lesson we will start to write recognition logic, define grammar and xml config file, and we will also test voice recognition.
In this lesson we will write new example for voice recognition, that will be used in later lessons for opening and closing of some programs.
In this lesson we will write logic for opening and close some programs with voce commands.
In this lesson we will write implementation of opening and closing programs through voice recognition with Sphinx library.
In this video students will be able to find link for source code available for download.