Mastering AWS Glue, QuickSight, Athena & Redshift Spectrum
4.3 (959 ratings)
Course Ratings are calculated from individual students’ ratings and a variety of other signals, like age of rating and reliability, to ensure that they reflect course quality fairly and accurately.
5,055 students enrolled

Mastering AWS Glue, QuickSight, Athena & Redshift Spectrum

Master serverless analytics with AWS Glue, QuickSight, Athena, & Redshift Spectrum (includes preview features with labs)
Bestseller
4.3 (959 ratings)
Course Ratings are calculated from individual students’ ratings and a variety of other signals, like age of rating and reliability, to ensure that they reflect course quality fairly and accurately.
5,055 students enrolled
Created by Siddharth Mehta
Last updated 5/2020
English
Current price: $69.99 Original price: $99.99 Discount: 30% off
5 hours left at this price!
30-Day Money-Back Guarantee
This course includes
  • 18.5 hours on-demand video
  • 2 articles
  • 14 downloadable resources
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
Training 5 or more people?

Get your team access to 4,000+ top Udemy courses anytime, anywhere.

Try Udemy for Business
What you'll learn
  • Confidently work with AWS Serverless services to develop Data Catalogue, ETL, Analytics and Reporting on a Data Lake
  • Develop deep knowledge in Glue, Athena, Redshift Spectrum and QuickSight
  • Build a serverless data lake on AWS using structured and unstructured data
  • Architect Serverless Analytics solutions on AWS cloud platform
Course content
Expand all 193 lectures 18:26:56
+ Introduction
4 lectures 14:28

Instructor and Course Introduction

Preview 05:23

Pre-requisites - What you'll need for this course

Pre-requisites - What you'll need for this course
04:20

Course Objectives

Course Objectives
01:46

Course Content, Convention and Resources

Course Content, Convention and Resources
02:59
+ AWS Serverless Analytics and Data Lake Basics
3 lectures 31:33

Section Agenda

Preview 01:45

Learn about basics of Serverless Computing and which AWS Services fits into it

What is Serverless Computing ?
09:54

Learn basics of AWS Serverless Data Lake Architecture

Basics of AWS Serverless Data Lake Architecture
19:54
+ Amazon S3 - Test-Data Setup
3 lectures 08:16

Section Agenda

Preview 01:21

Setup sample data on S3 buckets that would be used throughout this course

Lab: Sample Data Setup on Amazon S3
03:26

Configure S3 Storage Analytics

Lab: Amazon S3 - Analytics Configuration
03:29
+ Amazon Redshift - Cluster and Sample Data Setup
5 lectures 49:07

Section Agenda

Preview 01:48

Introduction to Amazon Redshift

Amazon Redshift - Introduction and Pre-requisites
05:11

Develop Amazon Redshift Cluster

Amazon Redshift - Developing a Redshift Cluster
15:00

Install and setup SQL Client to work with Amazon Redshift

Amazon Redshift - Installing Client Tools
13:24

Load sample data in Redshift cluster

Amazon Redshift - Installing Sample Data
13:44
+ AWS Glue - Architecture and Setup
9 lectures 01:16:49

Section Agenda

Preview 01:40

Learn AWS Glue Architecture with diagrams

AWS Glue - Architecture
14:10

Learn frequently used AWS Glue Terms and their meanings

AWS Glue - Terminology
10:50

Learn about different applications and features of AWS Glue

AWS Glue - Applications
07:42

Learn internal architecture of AWS Glue

AWS Glue - Internals
12:34

Learn about the cost economics of AWS Glue

AWS Glue - Cost
04:22

Setup IAM Role and policies to use with AWS Glue

Lab: AWS Glue - Security and Privileges Setup
10:34

Learn about the networking concepts and settings required for AWS Glue

AWS Glue - Advance Network Configuration
09:33

Configure network settings for AWS Glue

Lab: AWS Glue - Advance Network Configuration
05:24
+ AWS Glue - Database Objects
5 lectures 30:54

Section Agenda

Preview 01:13

Learn about the concept of Data Catalog in AWS Glue

AWS Glue - Data Catalog
04:46

Learn to develop databases in AWS Glue

Lab: AWS Glue - Databases
07:36

Learn to develop tables in AWS Glue

AWS Glue - Tables
06:04

Develop tables manually in AWS Glue

AWS Glue - Designing Tables
11:15
+ AWS Glue - Crawlers
10 lectures 01:15:42

Section Agenda

Preview 01:44

Learn about the concept of Crawler in AWS Glue

AWS Glue - Introduction to Crawlers
09:26

Learn about the concept of classifiers in AWS Glue

Lab - Introduction to AWS Glue Classifiers
04:17

Develop crawlers in AWS Glue - Lab 1

Lab 1 - AWS Glue - Developing Data Catalog with Crawlers
18:00

Develop crawlers in AWS Glue - Lab 2

Lab 2 - AWS Glue - Developing Data Catalog with Crawlers
02:52

Develop crawlers in AWS Glue - Lab 3

Lab 3 - AWS Glue - Developing Data Catalog with Crawlers
10:41

Develop crawlers in AWS Glue - Lab 4

Lab 4 - AWS Glue - Developing Data Catalog with Crawlers
07:36

Develop crawlers in AWS Glue - Lab 5

Lab 5 - AWS Glue - Developing Data Catalog with Crawlers
08:09

Develop crawlers in AWS Glue - Lab 6

Lab 6 - AWS Glue - Developing Data Catalog with Crawlers
05:22

Develop crawlers in AWS Glue - Lab 7

Lab 7 - AWS Glue - Developing Data Catalog with Crawlers
07:35
+ AWS Glue - ETL Jobs
10 lectures 01:15:36

Section Agenda

Preview 01:05

Learn to develop serverless ETL jobs with AWS Glue

Introduction to AWS Glue Jobs
06:22

Learn to develop serverless ETL jobs with AWS Glue

Lab 1 - Developing AWS Glue Jobs
18:40

Learn about different ETL job properties in AWS Glue

AWS Glue Job Properties
05:25

Learn to develop serverless ETL jobs with AWS Glue

Lab 2 - Developing AWS Glue Jobs
02:26

Learn to develop serverless ETL jobs with AWS Glue with Redshift as data source

Lab 3 - Assignment : Importing Data from Redshift
04:03

Learn to develop serverless ETL jobs with AWS Glue

Lab 4 - Developing AWS Glue Jobs
05:45

Learn to develop Python scripts and properties for serverless ETL jobs using AWS Glue

AWS Glue Job Scripts and Properties
04:54

Learn to develop Python scripts and properties for serverless ETL jobs using AWS Glue

Lab 5 - Developing AWS Glue Jobs
13:33

Learn about built-in ETL Transformations in AWS Glue

AWS Glue - Built-in ETL Transformations and Job Bookmarks
13:23
+ AWS Glue - Triggers
4 lectures 18:52

Section Agenda

Preview 01:08

Learn about Triggers in AWS Glue

Introduction to AWS Glue Triggers
04:58

Learn about Triggers in AWS Glue

Lab 1 - Developing AWS Glue Triggers
07:25

Learn about Triggers in AWS Glue

Lab 2 - Developing AWS Glue Triggers
05:21
+ AWS Glue - Dev Ops Setup
6 lectures 31:37

Section Agenda

Preview 01:21

Learn about AWS Glue Development Endpoints

Lab: Creating a AWS Glue Development Endpoint
07:32

Learn to install and setup Apache Zeppelin

Lab: Installing and configuring Apache Zeppelin
06:58

Learn to install Git and setup Port Forwarding

Lab: Port Forwarding Configuration
06:13

Learn to integrate AWS Glue Development Endpoint with Apache Zeppelin Notebook

Lab: Integrating AWS Glue Development Endpoint with Apache Zeppelin
05:22

Learn monitoring options available for AWS Glue

AWS Glue Monitoring
04:11
Requirements
  • Basic knowledge of database and data warehouse concepts
  • Working knowledge of AWS Concepts and Tools like AWS Console, S3, VPC, Security Group, AZ, IAM, Role, Policy etc
  • Basic working knowledge of any SQL style query language
  • Working knowledge of Redshift would be an advantage, but is not mandatory. Course covers Redshift cluster development
  • Course includes demo of all the labs. An AWS Account would be required to try labs hands-on.
Description

PS:

  1. Please do NOT join the course if you do NOT have any basic working knowledge of AWS Console and AWS Services like S3, IAM, VPC, Security Groups etc. AWS Beginners may struggle understanding some of the topics.

  2. Course explains all the labs. If you want to practice labs, it would require AWS Account and may cost $$.

  3. Basic working knowledge of Redshift is recommended, but not a must.

  4. This course has been designed for intermediate and expert AWS Developers / Architects / Administrators.

  5. Course covers each and every feature that AWS has released since 2018 for AWS Glue, AWS QuickSight, AWS Athena, and Amazon Redshift Spectrum, and it regularly updated with every new feature released for these services.

Serverless is the future of cloud computing and AWS is continuously launching new services on Serverless paradigm. AWS launched Athena and QuickSight in Nov 2016, Redshift Spectrum in Apr 2017, and Glue in Aug 2017. Data and Analytics on AWS platform is evolving and gradually transforming to serverless mode.


Businesses have always wanted to manage less infrastructure and more solutions. Big data challenges are continuously challenging the infrastructure boundaries. Having Serverless Storage, Serverless ETL, Serverless Analytics, and Serverless Reporting, all on one cloud platform had sounded too good to be true for a very long time. But now its a reality on AWS platform. AWS is the only cloud provider that has all the native serverless components for a true Serverless Data Lake Analytics solution.


It's not a secret that when a technology is new in the industry, professionals with expertise in new technologies command great salaries. Serverless is the future, Serverless is the industry demand, and Serverless is new. It's the perfect time and opportunity to jump into Serverless Analytics on AWS Platform.


In this course, we would learn the following:

1) We will start with Basics on Serverless Computing and Basics of Data Lake Architecture on AWS.

2) We will learn Schema Discovery, ETL, Scheduling, and Tools integration using Serverless AWS Glue Engine built on Spark environment.

3) We will learn to develop a centralized Data Catalogue too using Serverless AWS Glue Engine.

4) We will learn to query data lake using Serverless Athena Engine build on the top of Presto and Hive.

5) We will learn to bridge the data warehouse and data lake using Serverless Amazon Redshift Spectrum Engine built on the top of Amazon Redshift platform.

6) We will learn to develop reports and dashboards, with a powerpoint like slideshow feature, and mobile support, without building any report server, by using Serverless Amazon QuickSight Reporting Engines.

7) We will finally learn how to source data from data warehouse, data lake, join data, apply row security, drill-down, drill-through and other data functions using the Serverless Amazon QuickSight Reporting Engines.


This course understands your time is important, and so the course is designed to be laser-sharp on lecture timings, where all the trivial details are kept at a minimum and focus is kept on core content for experienced AWS Developers / Architects / Administrators. By the end of this course, you can feel assured and confident that you are future-proof for the next change and disruption sweeping the cloud industry.

I am very passionate about AWS Serverless computing on Data and Analytics platform, and am covering A-to-Z of all the topics discussed in this course.

So if you are excited and ready to get trained on AWS Serverless Analytics platform, I am ready to welcome you in my class !

Who this course is for:
  • Anyone who wants to learn AWS Serverless technologies for data and analytics should take this course
  • Data Professionals seeking to learn Serverless Storage, Serverless ETL, Serverless Data Analysis and Serverless Reporting should take this course