Mastering AI Models with Hands-On Google Colab Projects
What you'll learn
- Create Question and Answer Chatbot Using Colab, Llama 3, Mistral-7B and GPT2 Models
- Generate Images from Text Using Colab and Stable Diffusion Model
- Image Recognition Using Colab and GPT-4o API
- Generate Voice Using Colab and Bark Model
- Generate Video Using Colab and text-to-video-ms-1.7b Model
- Generate description of Image Using Colab and Deepseek Janus 1.3B Model
- Using AI Agent ReActAgent to answer questions based on different PDF Files
Requirements
- No programming experience needed
- Interesting of learning
Description
Table of Contents
How to use Google Colab
Translation text use NLLB Model
QnA with Llama 3 Model and Mistral-7B Model based on FaQ
QnA with GPT-2 Model based on a JSON File
Image Generation with Stable Diffusion Model
Image Recognition with GPT-4o Model
Voice Generation with Bark Model
Text to Video with text-to-video-ms-1.7b Model
Bonus
Deepseek: Describe Image with Text Using the Janus-1.3B Model
AI Multi-Agent: Building an AI Multi-Agent for Q&A on Two PDFs using ReAct, LlammaIndex, and OpenAI
Description
How to use Google Colab
How to create a Google Colab project
How to run a python project on Colab
How to choose GPU
How to use external File on Colab
How to use file from Google Driver on Colab
How to use Hugging Face token and ChatGPT token from Colab
Translation text use NLLB Model
Based on a text in german, using NLLB Model we can translate it to english. NLLB Model supports for more than 200 languages as input and output.
QnA with Llama 3 Model and Mistral-7B Model based on FaQ
An FaQ text file was prepared, by using Llama 3 Model or Mistral-7B Model, questions about this FaQ file can be raised and the Chatbot will give answer based on the FaQ file.
QnA with GPT-2 Model based on a JSON File
Information about a dummy company profile was prepared in a json file, by using GPT-2 Model, the Chatbot will give answer based on the company profile, if related question is raised.Image Generation with Stable Diffusion Model
Based on text, an image will be generated by using stable diffusion model.
Image Recognition with GPT-4o Model
Given a image of a Sculpture, using GPT-4o model, it will tell you what is inside the image and give you information about the sculpture.
Voice Generation with Bark Model
Based on the voice you provide, the Bark model tries to generate a voice file for text snippets.Text to Video with text-to-video-ms-1.7b Model
Given a text description about a scenario, the text-to-video-ms-1.7b will generate a video for you.
Bonus
Deepseek: Describe Image with Text Using the Janus-1.3B Model
Given an image, the Janus-1.3B Model will generate a description of this image for you.AI Multi-Agent: Building an AI Multi-Agent for Q&A on Two PDFs using ReAct, LlammaIndex, and OpenAI
Given two PDFs, ReActAgent AI Agent will use the correct one depending on the prompt to answer questions
Who this course is for:
- People who are curious about AI
- People who wanna interact with AI Models
- People who are interested in showcases of using AI models
- People who wanna run AI Models on Google Colab
Instructor
I have been a software developer since 2009. I enjoy creating software that solves problems and find teaching software development both exciting and challenging. I am particularly interested in development projects involving AI, IoT, robotics, and E-Learning.I look forward to connecting and networking with you as we explore the fascinating world of software development together.