Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Mastering AI Voices Technologies
Rating: 2.8 out of 5(3 ratings)
467 students

Mastering AI Voices Technologies

Text To Speech, Voice Cloning, VITS, Piper, Tortoise TTS, Coqui AI, OpenVoices
Created byThanh Nguyen
Last updated 2/2025
English

What you'll learn

  • Use Text to speech systems
  • Can do voice cloning
  • Build AI Voice system
  • Finetune and improve text to speech system

Course content

7 sections27 lectures3h 11m total length
  • What is the course about?2:27

    Explore ai voice technologies, including text-to-speech and voice cloning, and learn to clone your own voice and build your own audio services for audiobooks, podcasts, or videos.

  • How to take the most from the course?3:32
  • How to use Google Colab?3:17
  • How to use Visual Studio Code?2:54

    Discover Visual Studio Code, a free open-source IDE with thousands of plugins and GitHub Copilot integration, ideal for backend or frontend development, with a simple install and familiar layout.

Requirements

  • Python basic

Description

Course: Mastering AI Voices Technologies


  • Part 1: Introduction to the Course: AI Voices Technology
    An overview of the course, how to approach learning, and an introduction to tools like Google Colab and Visual Studio Code for TTS development.

  • Part 2: Old Fashioned Text To Speech
    Exploring traditional TTS methods using libraries like pyttsx3 and gTTS, both offline and through Google Colab.

  • Part 3: Big Text To Speech Providers
    Introduction to major TTS providers such as 11lab, Speechify, and OpenAI, highlighting their services and capabilities.

  • Part 4: Open Source TTS: VITS
    A deep dive into VITS (Variational Inference Text-to-Speech) technology, including how to run it and why it's considered a powerful method for high-quality speech synthesis.

  • Part 5: Voice Cloning with Piper TTS
    An exploration of voice cloning techniques using Piper TTS, including setup, training, and voice cloning in multiple languages (e.g., English and Vietnamese).

  • Part 6: Voice Cloning with Open Voices
    Introduction to OpenVoices, focusing on voice cloning for English and multi-language support for cross-lingual voice synthesis.

  • Part 7: Voice Cloning with Tortoise TTS (XTTS)
    Learn about Tortoise TTS, an advanced system for voice cloning, and how to fine-tune it for different languages, including English and Vietnamese and more languages

Who this course is for:

  • Students in IT, AI, software engineering
  • Developers who want to build their own AI voice system
  • Anyone wants to learn about AI voice technologies