Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Google Cloud Machine Learning Engineer Certification Prep

Name: Google Cloud Machine Learning Engineer Certification Prep
Rating: 4.5 (2097 reviews)

Building, Deploying, and Managing Machine Learning Services at Scale

Created byDan Sullivan, Dan Sullivan

Last updated 2/2024

English

What you'll learn

Understand how to use Google Cloud services to build, deploy, and manage machine learning models in production
Use Vertex AI, BigQuery, Cloud Dataflow, and Cloud Dataproc in ML pipelines
Tune training and serving pipelines
Choose appropriate infrastructure, including virtual machines, containers, GPUs and TPUS
How to secure data in ML operations while protecting privacy
Monitor machine learning models in production and know when to retrain models
Explore datasets to identify problems and resolve issues such as class imbalance and insufficient data

Course content

17 sections • 71 lectures • 5h 15m total length

Introduction1:38
Frame machine learning problems from a business perspective, decide when to apply machine learning, design scalable solutions, build data pipelines and features, develop models, test them, and monitor performance.
How to Get Help When You are Stuck1:14
Open the course console, use the overview and QR tabs to search questions, post new ones, and view current or all lectures, or message Dan Sullivan on LinkedIn for answers.

Identifying Business Problems that Benefit from ML7:49
Assess whether a problem suits machine learning by examining data size and task type, from classification and identification to prediction and segmentation, with examples like spam, fraud, and image analysis.
Defining ML Success Criteria5:45
Define the project scope and success criteria, identify the problem type (classification, prediction, or clustering), and establish metrics like accuracy, precision, and recall alongside business goals.
Steps to Building ML Models7:55
Utilizing ML Models in Production3:30
Learn to deploy machine learning models in production using ML ops, CI/CD pipelines, monitoring, and scalable infrastructure. Explore data validation, data preparation, retraining, and model registry for reliable, repeatable deployment.
Quiz

Supervised Learning - Classification8:24
Supervised learning uses labeled data to solve classification and regression, leveraging features from structured data, images, or text, with algorithms like logistic regression, decision trees, neural networks, and ensemble methods.
Supervised Learning - Regression3:25
Use regression algorithms to predict numeric values, including linear, polynomial, and decision tree regressions. Build a baseline model with linear regression and minimize prediction error using MSE, RMSE, and MAE.
Unsupervised Learning5:45
explains unsupervised learning without label data, covering clustering with k-means, association rules, and dimensionality reduction via principal component analysis and auto encoders, with grouping, anomaly detection, and data compression.
Semi-supervised Learning3:10
Discover semi-supervised learning that blends labeled and unlabeled data to build models when labeling is costly, using anchor points and clustering on low-dimensional manifolds.
Reinforcement Learning2:51
Explore reinforcement learning, where an agent learns from trial and error via positive or negative feedback from an environment to maximize rewards, as in games or robotics in complex environments.
ML Model Input Structure5:45
Convert inputs into numeric feature vectors by encoding continuous, discrete, and categorical attributes, flattening images, and using word counts or term frequency inverse document frequency, then train and evaluate.
ML Model Output Structure1:57
Model output structure varies by task; classification yields a category indicator, regression a numeric value, clustering a group of items, and PCA a set of vectors, sometimes including probability distributions.
Risks to Successful ML Model Development3:47
Identify data-related and process risks that threaten successful machine learning model development. Address insufficient data, imbalanced data, data quality issues, bias, labeling consistency, data poisoning, and privacy and confidentiality compliance.
Quiz

3 Categories of Machine Learning Problems3:16
Explore the three categories of machine learning—unsupervised learning, supervised learning, and reinforcement learning—covering clustering, anomaly detection, principal component analysis, regression, classification, and fraudulent credit card transactions.
2 Approaches to Machine Learning1:05
Compare two broad approaches to machine learning: symbolic artificial intelligence and deep learning with neural networks, and trace their dominance and the rise of large neural nets.
Symbolic Machine Learning5:44
Model domains with symbols and apply rule-based inferences in symbolic machine learning, using decision trees and random forests, Naive Bayes, SVM, and kNN on iris classification.
Neural Networks and Machine Learning4:20
Explore how neural networks and deep learning map input features to outputs using weighted inputs, non-linear activation such as sigmoid, tanh, and ReLU, across multiple layers.

Features and Labels2:29
Explore how features are input attributes that describe an entity and how labels are the predicted outputs, illustrated with iris, selling price, fraud detection, and income examples.
Feature Engineering5:17
Enhance model quality by engineering features through transformations, deriving new features and mapping existing ones, including one-hot encoding, scaling, bucketing, and feature crosses, with time series and text data.
Model Building3:48
Define the problem, collect data, and establish an evaluation method, then train, validate, and test a machine learning model through iterative data preparation and hyperparameter tuning.
Evaluating Models4:49
Evaluate models using accuracy, precision, and recall for classification and mean squared error for regression; learn to use a confusion matrix and separate train and test data.
Gradient Descent and Backpropagation7:23
Discover how gradient descent minimizes loss by updating weights with a learning rate and how back propagation efficiently computes gradients to train neural nets.
Troubleshooting Machine Learning Models5:10
Explore common model problems such as underfitting and overfitting, and learn fixes like adjusting complexity, training time, and regularization. Understand bias and variance tradeoffs and their impact on generalization.
Building Models in Google Cloud3:41
Explore practical model building in Google Cloud using AutoML (Tables, vision, language), AI Platform Training, Kubeflow, DataProc with Spark ML, and BigQuery ML for SQL-based training and deployment.
Using Pretrained Models2:38
Explore Google's pre-trained models for vision, language, and conversation, including transfer learning, video metadata, named entities, sentiment analysis, translation, speech to text, text to speech, and Dialogflow.

Overview of ML Pipelines6:11
Explore machine learning pipelines from development to production, using containers, orchestration, feature store, and model registry to enable scalable, repeatable deployments, monitoring, and automated continuous delivery.
3 Steps to Production3:42
Wrap your model as a restful service, containerize it, and deploy it into production with Kubernetes or Cloud Run, enabling health checks and monitoring.
Comprehensive ML Services3:39
Leverage queue flow on Kubernetes for end-to-end ml workflows—scaffolding, pipelines, hyperparameter tuning, and model serving—plus Vertex AI AutoML on Google Cloud for diverse data.
Quiz

Introduction to Vertex AI3:04
Explore Vertex AI as a comprehensive platform for preparing datasets, labeling tasks, pipelines, model training, experiments, and model registry, and delivering predictions via endpoints or batch jobs.
Vetex AI Datasets5:53
Master Vertex AI data sets, abstracting storage details into a single unit for training, with support for tabular, image, text, and video data from cloud sources.
Vertex AI Featurestore4:35
Explore how Vertex AI feature store manages prepared data by storing features from transformed data, using entity types and features for house sale price, and monitor distributions with alerts.
Vertex AI Workbences3:43
Explore Vertex AI workbench notebooks, comparing managed notebooks and user managed notebooks. Learn how to tailor environments with PyTorch, OS choices, GPUs, security, and networking for deep learning workflows.
Vetex AI Training5:23
Train a tabular model in Vertex II with AutoML on a corrected dataset, then evaluate with precision-recall, ROC, and the confusion matrix, and iterate with feature engineering to improve performance.
Introduction to Cloud Storage7:55
Explore cloud storage for machine learning, focusing on object storage for large training data and unstructured data. Learn storage classes, redundancy, lifecycle policies, retention, object holds, and security controls.
Introduction to BigQuery6:11
Explore how BigQuery, a managed Google Cloud analytics database, enables data warehousing with SQL, scales to petabytes, and supports datasets, tables, views, models, and partitioning with clustering to reduce scanning.
Introduction to Cloud Dataflow2:52
Introduction to Cloud Dataproc3:21
Quiz

Virtual Machines and Containers6:12
Explore creating scalable machine learning production environments with Google Cloud Compute Engine virtual machines, managed instance groups, and container orchestration using Cloud Run and Kubernetes Engine.
GPUs and TPUs2:36
Choose between GPUs and TPUs on Google Cloud for deep learning training, considering precision, cost, and scalability across model sizes.
Edge Devices2:26
Deploy ML models to edge devices to reduce latency in industrial settings, vehicle fleets, and remote sensors. Optimize for constrained memory and CPU with TensorFlow Lite.
Securing ML Models5:31
Protecting Privacy in ML Models6:19
Identify sensitive data in datasets, protect it without harming model performance, and establish governance with secure access, encryption in transit and at rest, masking, tokenization, and data coarsening.
Quiz

Basic Statistics for Data Exploration3:19
Explore descriptive statistics to summarize data, using measures like mean, median, variance, and standard deviation, and understand distributions, central tendency, and data spread for both numeric and categorical features.
Encoding Data5:25
Feature Selection4:26
Practice feature selection to train efficient, high-performing models. Apply Pearson's correlation, Spearman's rank coefficient, ANOVA, Kendall's rank coefficient, chi square, and mutual information as described.
Class Imbalance6:15
Feature Crosses4:04
Use feature crosses to create synthetic features by multiplying two features, boosting predictive performance on nonlinear relationships with non neural network algorithms.
TensorFlow Transforms2:34
Quiz

Organizing and Optimizing Training Sets4:39
Organize training sets using Vertex II managed data sets to label, annotate, track data lineage, and generate statistics, while automatically splitting data into training, testing, and validation sets.
Handling Missing Data5:59
Handle missing data in training sets by deleting rows, imputing with mean/median/mode, or carrying forward last observed values; weigh simplicity against bias and data leakage, especially in time series.
Handling Outliers in Data6:01
Identify why outliers occur, from errors to natural minority cases, and apply z-scores, interquartile range, box plots, and DBSCAN to detect and decide how to handle them.
Avoiding Data Leakage3:13
Identify data leakage, where training data include information unavailable at prediction time, and learn examples like future-based imputations, session total counts, and city proxies that bias model performance.
Quiz

Requirements

Familiarity with basic cloud concepts
Understanding of some use cases of machine learning

Description

Machine Learning Engineer is a rewarding, in demand role, and increasingly important to organizations moving building data intensive services in the cloud. The Google Cloud Professional Machine Learning Engineer certification is one of the field's most recognized credentials. This course will help prepare you to take and pass the exam. Specifically, this course will help you understand the details of:

Building and deploying ML models to solve business challenges using Google Cloud services and best practices for machine learning
Aspects of machine learning model architecture, data pipelines structures, optimization, as well as monitoring model performance in production
Fundamental concepts of model development, infrastructure management, data engineering, and data governance
Preparing data, optimizing storage formats, performing exploratory data analysis, and handling missing data
Feature engineering, data augmentation, and feature encoding to maximize the likelihood of building successful models
Understand responsible AI throughout the ML development process and apply proper controls and governance to ensure fairness in machine learning models.

By the end of this course, you will know how to use Google Cloud services for machine learning and just as importantly, you will understand machine learning concepts and techniques needed to use those services effectively.

Unlike courses that set out to teach you how to use particular Google Cloud services, this course is designed to teach you services as well as all the topics covered in the Google Cloud Professional Machine Learning Exam Guide, including machine learning fundamentals and techniques.

The course begins with a discussion of framing business problems as machine learning problems followed by a chapter on the technical framing on ML problems. We next review the architecture of training pipelines and supporting ML services in Google Cloud, such as:

Vertex AI Datasets
AutoML
Vertex AI Workbenches
Cloud Storage
BigQuery
Cloud Dataflow
Cloud Dataproc.

Machine learning and infrastructure and security are reviewed next.

We then shift focus to building and implementing machine learning models starting with managing and preparing data for machine learning, building machine learning models, and training and testing machine learning models. This is followed by chapters on machine learning serving and monitoring and tuning and optimizing both the training and serving of machine learning models.

Machine learning operations, also known as MLOps, borrow heavily from software engineering practices. As a machine engineer, you will use your understanding of software engineering practices and apply them to machine learning. Machine learning engineers know how to use ML tools, build models, deploy to production, and monitor ML services. They also know how to tune pipelines and optimize the use of compute and storage resources.

Machine learning engineers and data engineers complement each other. Data engineers build services and pipelines for collecting, storing, and managing data while machine learning engineers use those data services as a starting point for accessing data and building ML models to solve specific business problems.

Who this course is for:

ML Engineers who wish to pass the Google Cloud Professional Machine Learning certification exam.
Beginner machine learning engineers wanting to understand MLOps
Software developers who want to use ML services to use ML as an alternative to coding solutions
Cloud architects who want to understand how to design for machine learning serivces
Data engineers who want to expand their skillset to include machine learning operations
Data analysts and data scientists who want to use machine learning in their work.

Google Cloud Machine Learning Engineer Certification Prep

What you'll learn

Explore related topics

Course content

Introduction2 lectures • 3min

Framing Business Problems as Machine Learning Problems4 lectures • 25min

Technical Framing of ML Problems8 lectures • 35min

Introduction to Machine Learning4 lectures • 14min

Building Machine Learning Models8 lectures • 35min

Machine Learning Training Pipelines3 lectures • 14min

Machine Learning and Related Google Cloud Services9 lectures • 43min

Machine Learning Infrastructure and Security5 lectures • 23min

Exploratory Data Analysis and Feature Engineering6 lectures • 26min

Managing and Preparing Data for Machine Learning4 lectures • 20min

Requirements

Description

Who this course is for: