
Introduction to the Course
Deep Learning in the context of AI and Machine Learning
Deep Learning: Getting Rules from Data + Answers
The human brain: an inspiration for many of today's AI "godfathers"
Biological Neuron Action Potential (Signal Propagation in "Real" Neurons)
Activation Function of an Artificial Neuron
Comparison of a Biological Neural Network and a Simple Artificial Neural Network
We compare the human brain's 16 billion neurons with about 7000 connections each, totaling 112 trillion connections, to simple artificial neural networks, illustrating the vast complexity gap.
Analyze how GPT three with 175 billion parameters and trillion-parameter models like Pangas illustrate exponential growth in AI complexity, suggesting future AI could rival human cognitive capacity.
Follow back propagation in a neural network, from input through a hidden layer to output, evaluate prediction error, and apply gradients to adjust weights.
Calculate accuracy as (true positives + true negatives) / total, demonstrated with true positives 161 and true negatives 129 out of 320, illustrating overall model performance.
Explore how recall, or sensitivity, gauges a model's ability to identify positive cases. Pair it with specificity to reduce false positives in fraud detection and disease diagnostics.
Explore the course code repository on GitHub, open examples in Google Colab, switch runtime to GPU, and run code cells or all cells to align with the shared environment.
See how a convolutional neural network is structured and implemented. Gain insight into CNN concepts through a first general view and a typical code implementation.
Explore how neural networks process images by using ReLU in hidden layers and softmax in the output layer for multi-class classification, with 28x28 grayscale inputs and RGB color encoding.
Explore how a convolutional neural network processes input data, images as pixel grids, grayscale values 0-255, and color channels red, green, blue, for filtering and image recognition.
Slide a three by three filter over the input image, perform element-wise multiplication with each patch, and sum results to form feature map, with filter values learned during CNN training.
CNNs, with their architecture, apply to many tasks by combining convolution, ReLU, and pooling layers to learn complex features for classification, object detection, segmentation, and probabilistic control.
Engage with interactive playgrounds to test and visualize CNN predictions, deepening your understanding of CNN-based deep learning concepts through hands-on code demos and GitHub-backed experimentation.
Shows a convolutional neural network for mnist digit classification, highlighting training dynamics and hyperparameters: learning rate, batch size, momentum, nesterov momentum; architecture with conv, relu, pooling, softmax, and random crops.
Explore recurrent neural networks and how they recognize patterns in sequences, enabling time series forecasting through variable-length data and sharing parameters across the sequence.
Identify the vanishing gradients problem, where gradients shrink through repeated multiplication of weights less than one, causing updates to vanish and the training process to stall.
RNNs capture differences in sequence order to understand context in time series. They distinguish subtle meaning changes with the same words, a key strength for modeling sequential data.
Explore how LSTM networks overcome standard RNN limits with forget, store, update, and output gates that control the cell state and sustain gradient flow for long-term dependencies.
Utilize an LSTM-based time-series model to predict the S&P 500 index price using historical data since 1990, including volatility, rates, unemployment, sentiment, and new indicators beyond the original paper.
Explore how indicators behave over time in time series data by visualizing historical trends and their relation to closing prices, building an intuitive understanding before modeling.
Explore transformers as the leading model for language processing since 2017, powering ChatGPT, and extending to image and time sequence processing, while noting emerging successor architectures.
Understand how the transformer model uses an encoder-decoder architecture, with multi-head attention, feed-forward layers, and positional encodings, to compute output probabilities via softmax for efficient language processing.
Explore how word embeddings map semantic relationships in a vector space, showing how similar contexts link words like man and woman, guiding translation and text generation.
Explain how positional encoding in the transformer adds position information to input embeddings, enabling parallel sequence processing and unique sine and cosine encodings for each token position.
Explore the transformer architecture with a focus on multi-head attention, enabling the model to attend to different input parts simultaneously and boost translation and summarization performance.
See how a single attention head in a transformer links the word dog to other words, using query, keys, and values to weight dependencies and focus on sentence parts.
The transformer generates key vectors from word embeddings using a key weight matrix, enabling comparison with query vectors to compute attention scores.
Process input embeddings with positional encodings through self-attention and feed-forward networks in a transformer encoder to capture complex patterns and pass the final result for further processing.
Explore the transformer encoder architecture, including add and normalize layers, residual connections, and the combined power of self-attention and feedforward networks to capture language dependencies.
Explore transformer models in language processing through code demos and transformers beyond code, with playgrounds from various companies and platforms.
Welcome to the Deep Learning and Generative Artificial Intelligence course! This comprehensive course is designed for anyone interested in diving into the exciting world of deep learning and generative AI, whether you're a beginner with no programming experience or an experienced developer looking to expand your skill set.
What You Will Learn:
Foundations of Deep Learning and Artificial Neural Networks: Gain a solid understanding of the basic concepts and architectures that form the backbone of modern AI.
Convolutional Neural Networks (CNNs): Learn how to implement and train CNNs for image classification and object detection tasks using Python and popular deep learning libraries.
Long Short-Term Memory (LSTM) Networks: Explore the application of LSTM networks to predict and analyze time series data, enhancing your ability to handle sequential data.
Transformer Models: Dive into the world of Transformer models, including GPT-type models, and learn how to construct, fine-tune, and deploy these models for various natural language processing tasks.
Generative Adversarial Networks (GANs): Understand the principles behind GANs and learn how to create and train them to generate realistic synthetic images and data.
Variational Auto-Encoders (VAEs): Discover how to build and utilize VAEs for data compression and generation, understanding their applications and advantages.
Style Transfer and Stable Diffusion: Experiment with style transfer techniques and stable diffusion methods to creatively alter and enhance images.
Course Features:
Interactive Coding Exercises: Engage with hands-on coding exercises designed to reinforce learning and build practical skills.
User-Friendly Demos and Playgrounds: For those who prefer a more visual and interactive approach, our course includes demos and playgrounds to experiment with AI models without needing to write code.
Real-World Examples: Each module includes real-world examples and case studies to illustrate how these techniques are applied in various industries.
Project-Based Learning: Apply what you've learned by working on projects that mimic real-world scenarios, allowing you to build a portfolio of AI projects.
Who Should Take This Course?
Aspiring AI Enthusiasts: Individuals with no prior programming experience who want to understand and leverage AI through intuitive interfaces.
Developers and Data Scientists: Professionals looking to deepen their understanding of deep learning and generative AI techniques.
Students and Researchers: Learners who want to explore the cutting-edge advancements in AI and apply them to their studies or research projects.