Generative AI (English Version): Unleashing Next-Gen AI

Name: Generative AI (English Version): Unleashing Next-Gen AI
Rating: 4.1 (96 reviews)

Generative AI - English version

Created byCoursat.ai Dr. Ahmad ElSallab

Last updated 4/2023

English

What you'll learn

Generative AI definition, areas of applications, mappings like txt2txt, img2txt, txt2img and txt2voice
How ChatGPT works, and the underlying tech behind like GPT, Large-Scale Language Models (LLM) and Transformers
How Latent Diffusion, StableDiffusion and DALL-E systems work
Generative Adversarial Networks (GANs) and Variational Auto Encoder (VAE)
The good, bad and ugly faces of GenAI, and how to adapt to the new tech
Build ChatGPT clone using OpenAI API and Streamlit
Build NLP applications using OpenAI API like Summarization, Text Classification and fine tuning GPT models
Build NLP applications using Huggingface transformers library like Language Models, Summarization, Translation, QA systems and others
Build Midjourney clone application using OpenAI DALL-E and StableDiffusion on Huggingface

Course content

8 sections • 59 lectures • 7h 20m total length

Introduction2:31
Course overview24:11

Unimodal mappings: Txt2txt and Language models5:03
Statistical Language Models (SLM)12:14
Neural Language Models (NLM) - Char level8:55
Neural Language Models (NLM) - Word level3:41
SLM and NLM in Python and Keras14:27
Seq2seq models7:20
Seq2seq + Attention models6:10
Explore seq2seq with attention, showing how encoder hidden states and attention scores guide decoding, reducing bottlenecks and enabling parallelism over the shift to transformer-style ideas.
Transformers12:29
Huggingface Transformer Pipeline10:32
Learn to use the Huggingface transformer pipeline to connect pre-trained models to inputs, handle preprocessing and post-processing, and perform tasks like sentiment analysis, zero-shot classification, NER, QA, and translation.
Large-Scale Language Models (LLM) - Transfer Learning in NLP13:09
Pre-trained Transformers5:52
BERT4:49
Explore how transfer learning from a pre-trained transformer encoder enables fine-tuning for text classification, using BERT’s CLS token and self-attention, built on next-sentence prediction and masked-language modeling.
GPT10:06
ChatGPT11:58
OpenAI API9:37
GPT-3 Finetuning6:12
GPT-3 Chatbot9:11
ChatGPT Clone in Google Colab8:41
ChatGPT Clone in Streamlit6:47
ChatGPT Clone Excercise1:52

Img2Img Encoder-Decoder4:13
Auto Encoder (AE)22:34
AE Visualization11:32
Variational Auto Encoder (VAE)10:30
Conditional VAE6:36
Coding AE in Keras15:15
Generative Adversarial Nets (GANs)2:17
Generating images from GANs0:37
Training GANs10:42
Train gan networks by balancing the discriminator and generator in a minimax game, alternating warm-up and generator steps to produce realistic images from z noise and trick the discriminator.
Coding GAN training in Keras6:16
Warm up the discriminator in Keras with real data (label one) and fake data (label zero); then train a GAN with a frozen discriminator to update generator via label flipping.
DCGAN3:42
Conditional GANs9:53
Condition image generation by feeding class labels to generator and discriminator with one hot encoding. Use embeddings for categorical input and follow warm-up and fine-tuning training loop for conditional GANs.
AttributeGAN6:05
AttributeGAN extends conditional GANs with multi-label attributes, enabling controlled image editing via a generator encoder-decoder and an attribute classifier, trained with reconstruction and cross-entropy losses.
How Good are GANs today?1:16
Domain adaptation with pix2pix and CycleGAN11:17

Multimodal Txt2Img generation3:26
Explore multimodal mappings in generative AI through text-to-image generation, encoder-decoder architectures, and diffusion models behind Midjourney and Dreamstudio. Discover how text prompts guide image creation from scratch beyond supervised learning.
Diffusion models14:27
Latent Diffusion Models (LDM)2:24
CLIP6:28
StableDiffusion3:27
Online tools for txt2img: DreamStudio and Midjourney1:51
OpenAI API - DALL-E3:49
Huggingface - StableDiffusion2:00
Excercise - Midjourney clone1:05
Img2Txt generation - Image Captioning3:44
Txt2Voice generation - VALL-E1:14

Requirements

AI, ML and Deep Learning foundations
NLP: RNN, LSTM, Transformers basics
CV: ConvNets

Description

Hello and Welcome to a new Journey in the vast area of Generative AI

Generative AI is changing our definition of the way of interacting with machines, mobiles and computers. It is changing our day-to-day life, where AI is an essential component.

This new way of interaction has many faces: the good, the bad and the ugly.

In this course we will sail in the vast sea of Generative AI, where we will cover both the theoretical foundations of Generative models, in different modalities mappins: Txt2Txt, Img2Txt, Txt2Img, Img2Txt and Txt2Voice and Voice2Text. We will discuss the SoTA models in each area at the time of this course. This includes the SoTA technology of Transformers, Language models, Large LM or LLM like Generative Pre-trained Transformers (GPT), paving the way to ChatGPT for Text Generation, and GANs, VAE, Diffusion models like DALL-E and StabeDiffusion for Image Generation, and VALL-E foe Voice Generation.

In addition, we will cover the practical aspects, where we will build simple Language Models, Build a ChatGPT clone using OpenAI APIs where we will take a tour in OpenAI use cases with GPT3.5 and ChatGPT and DALL-E. In addition we will cover Huggingface transformers and StableDiffusion.

Hope you enjoy our journey!

Who this course is for:

AI/ML Practitioners, Developers, Engineers and Researchers
NLP Engineers or Researchers
CV Engineers or Researchers
Data Scientists

Generative AI (English Version): Unleashing Next-Gen AI

What you'll learn

Explore related topics

Course content

Introduction2 lectures • 27min

What is Generative AI?5 lectures • 40min

Txt2Txt GenAI20 lectures • 2hr 49min

Img2Img GenAI15 lectures • 2hr 3min

Multi-modal GenAI11 lectures • 44min

The good, the bad and the ugly4 lectures • 34min

Conclusion1 lecture • 4min

Material1 lecture • 1min

Requirements

Description

Who this course is for: