Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Mastering AI Models with Hands-On Google Colab Projects
Rating: 4.8 out of 5(2 ratings)
34 students

Mastering AI Models with Hands-On Google Colab Projects

Idea Implementation with HF, ReActAgent, DeepSeek, GPT-4o, GPT2, Llama3, Mistral-7B, NLLB, diffusers, HuBERT and Bark
Created byYu Li
Last updated 3/2025
English

What you'll learn

  • Create Question and Answer Chatbot Using Colab, Llama 3, Mistral-7B and GPT2 Models
  • Generate Images from Text Using Colab and Stable Diffusion Model
  • Image Recognition Using Colab and GPT-4o API
  • Generate Voice Using Colab and Bark Model
  • Generate Video Using Colab and text-to-video-ms-1.7b Model
  • Generate description of Image Using Colab and Deepseek Janus 1.3B Model
  • Using AI Agent ReActAgent to answer questions based on different PDF Files

Course content

12 sections37 lectures1h 19m total length
  • Use Cases in this course5:01

Requirements

  • No programming experience needed
  • Interesting of learning

Description

Table of Contents

  • How to use Google Colab

  • Translation text use NLLB Model

  • QnA with Llama 3  Model and Mistral-7B  Model based on FaQ

  • QnA with GPT-2  Model based on a JSON File

  • Image Generation with Stable Diffusion Model

  • Image Recognition with GPT-4o Model

  • Voice Generation with Bark Model

  • Text to Video with text-to-video-ms-1.7b Model

  • Bonus

    • Deepseek: Describe Image with Text Using the Janus-1.3B Model

    • AI Multi-Agent:  Building an AI Multi-Agent for Q&A on Two PDFs using ReAct, LlammaIndex, and OpenAI


Description


  • How to use Google Colab

    • How to create a Google Colab project

    • How to run a python project on Colab

    • How to choose GPU

    • How to use external File on Colab

    • How to use file from Google Driver on Colab

    • How to use Hugging Face token and ChatGPT token from Colab


  • Translation text use NLLB Model

    Based on a text in german, using NLLB Model we can translate it to english. NLLB Model supports for more than 200 languages as input and output.

  • QnA with Llama 3  Model and Mistral-7B  Model based on FaQ

    An FaQ text file was prepared, by using Llama 3 Model or Mistral-7B Model, questions about this FaQ file can be raised and the Chatbot will give answer based on the FaQ file.

  • QnA with GPT-2  Model based on a JSON File
    Information about a dummy company profile was prepared in a json file, by using GPT-2 Model, the Chatbot will give answer based on the company profile, if related question is raised.

  • Image Generation with Stable Diffusion Model

    Based on text, an image will be generated by using stable diffusion model.


  • Image Recognition with GPT-4o Model

    Given a image of a Sculpture, using GPT-4o model, it will tell you what is inside the image and give you information about the sculpture.


  • Voice Generation with Bark Model
    Based on the voice you provide, the Bark model tries to generate a voice file for text snippets.


  • Text to Video with text-to-video-ms-1.7b Model
    Given a text description about a scenario, the text-to-video-ms-1.7b will generate a video for you.

  • Bonus

    • Deepseek: Describe Image with Text Using the Janus-1.3B Model
      Given an image, the Janus-1.3B Model will generate a description of this image for you.

    • AI Multi-Agent:  Building an AI Multi-Agent for Q&A on Two PDFs using ReAct, LlammaIndex, and OpenAI
      Given two PDFs, ReActAgent AI Agent will use the correct one depending on the prompt to answer questions

Who this course is for:

  • People who are curious about AI
  • People who wanna interact with AI Models
  • People who are interested in showcases of using AI models
  • People who wanna run AI Models on Google Colab