
Explore seq2seq with attention, showing how encoder hidden states and attention scores guide decoding, reducing bottlenecks and enabling parallelism over the shift to transformer-style ideas.
Learn to use the Huggingface transformer pipeline to connect pre-trained models to inputs, handle preprocessing and post-processing, and perform tasks like sentiment analysis, zero-shot classification, NER, QA, and translation.
Explore how transfer learning from a pre-trained transformer encoder enables fine-tuning for text classification, using BERT’s CLS token and self-attention, built on next-sentence prediction and masked-language modeling.
Train gan networks by balancing the discriminator and generator in a minimax game, alternating warm-up and generator steps to produce realistic images from z noise and trick the discriminator.
Warm up the discriminator in Keras with real data (label one) and fake data (label zero); then train a GAN with a frozen discriminator to update generator via label flipping.
Condition image generation by feeding class labels to generator and discriminator with one hot encoding. Use embeddings for categorical input and follow warm-up and fine-tuning training loop for conditional GANs.
AttributeGAN extends conditional GANs with multi-label attributes, enabling controlled image editing via a generator encoder-decoder and an attribute classifier, trained with reconstruction and cross-entropy losses.
Explore multimodal mappings in generative AI through text-to-image generation, encoder-decoder architectures, and diffusion models behind Midjourney and Dreamstudio. Discover how text prompts guide image creation from scratch beyond supervised learning.
AI automation shifts tasks from people to tools, changing interfaces and roles without eliminating designers or writers; adopt and regulate usage while embracing open-source democratization and cautious experimentation.
Hello and Welcome to a new Journey in the vast area of Generative AI
Generative AI is changing our definition of the way of interacting with machines, mobiles and computers. It is changing our day-to-day life, where AI is an essential component.
This new way of interaction has many faces: the good, the bad and the ugly.
In this course we will sail in the vast sea of Generative AI, where we will cover both the theoretical foundations of Generative models, in different modalities mappins: Txt2Txt, Img2Txt, Txt2Img, Img2Txt and Txt2Voice and Voice2Text. We will discuss the SoTA models in each area at the time of this course. This includes the SoTA technology of Transformers, Language models, Large LM or LLM like Generative Pre-trained Transformers (GPT), paving the way to ChatGPT for Text Generation, and GANs, VAE, Diffusion models like DALL-E and StabeDiffusion for Image Generation, and VALL-E foe Voice Generation.
In addition, we will cover the practical aspects, where we will build simple Language Models, Build a ChatGPT clone using OpenAI APIs where we will take a tour in OpenAI use cases with GPT3.5 and ChatGPT and DALL-E. In addition we will cover Huggingface transformers and StableDiffusion.
Hope you enjoy our journey!