Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

AWS Data Engineer Bootcamp: The Complete Guide

Name: AWS Data Engineer Bootcamp: The Complete Guide
Rating: 4.3 (35 reviews)

Master AWS data Engineer Course with Real Time Projects

Bestseller

Created bymanish tiwari

Last updated 3/2026

English

What you'll learn

Master the core concepts of AWS Data Engineering and understand how modern data platforms are built on AWS
Design and build end-to-end data pipelines on AWS from data ingestion to analytics
Create scalable Data Lakes using Amazon S3 for storing structured and semi-structured data
Perform ETL (Extract, Transform, Load) using AWS Glue with real-world practical examples
Build serverless data pipelines using AWS Lambda
Query datasets using Amazon Athena without managing servers
Build event-driven architectures using Amazon SNS
Monitor and troubleshoot pipelines using AWS CloudWatch
Gain hands-on experience with AWS services used by professional Data Engineers

Course content

8 sections • 225 lectures • 29h 22m total length

Introduction8:02

INTRODUCTION10:10
CREATE AWS FREE ACCOUNT9:02
Amazon s3 Introduction5:57
AWS S3 Benefits & Use Case9:09
AWS S3 PRACTICAL LAB8:23
AWS s3 Storage Classes11:13
AWS S3 Practical VERSIONING & Access Policy8:00
AWS IAM INTRODUCTION - ROLE POLICY USER GROUP10:03
AWS IAM - CREATE USERS , GROUP & POLICY9:33
AWS IAM Role5:24
Enable Multi Factor Authentication on AWS3:18
Enable multi-factor authentication on AWS by registering a virtual MFA device, installing an authenticator app, scanning a QR code, and using MFA codes to log in securely.
AWS Lambda Introduction9:45
AWS Lambda Practical Lab -1 - First Lambda Function9:17
AWS Lambda Trigger Lab9:13
AWS Data Engineering Project 1 (lambda+s3)15:19
AWS Glue Introduction12:07
AWS Crawler & Data Catalog14:23
AWS Glue Practical Lab-1 (TXT TO JSON )16:57
AWS Glue Practical Lab - 2 - Transformation10:45
AWS Glue Practical - Filter Transformation8:17
Use AWS glue to read data from S3, apply a filter where payment status equals success, and write the result to S3 in parquet.
AWS GLUE - JOB BOOKMARK4:33
AWS GLUE - JOB BOOKMARK Practical | Incremental Load11:18
AWS GLUE- SQL QUERY TRANSFORMATION6:10
AWS GLUE- CHANGE SCHEMA TRANSFORMATION5:51
AWS GLUE- CONDITIONAL ROUTER TRANSFORMATION7:27
AWS GLUE- AGGREGATE TRANSFORMATION4:35
Learn how to use AWS Glue aggregate transformation to group data by country and compute sums, averages, maximum values, and counts, with a practical CSV example.
AWS GLUE - Derived Column Transformation11:31
Master the derived column transformation in AWS Glue to modify existing columns or derive new ones with expressions. Apply load date, year, month, and case-based logic in your ETL pipelines.
AWS GLUE- Concatenate Transformation2:38
AWS GLUE- EXPLODE ARRAY Transformation4:36
AWS GLUE- Flatten Transformation2:58
AWS Athena Introduction11:42
AWS ATHENA Practical - Lab (AWS Glue Catalog)9:28
Create an s3 bucket, upload an employee csv, and build a glue catalog using a crawler. Then query the data in aws athena and set a query output location.
AWS Athena - Query s3 without Crawler7:03
EC2 - introduction9:59
ec2 (Create Windows )9:55
ec2 Type And Linux Machine4:39
Launch and connect to an Ubuntu Linux EC2 instance, configure the free tier, set up a key pair and network, and practice basic Linux commands.
EC2 Instance Type3:56
AWS SNS (Simple Notification Service)13:24
AWS SNS Practical6:24
AWS SNS Practical - Send Notification based on S3 file upload7:08
Set up an S3 bucket to trigger an SNS notification on file upload, create an SNS topic with an email subscription, and verify email delivery of the notification.
AWS CLOUDWATCH - INTRODUCTION9:59
AWS CLOUDWATCH - DEMO7:44
AWS EMR -Elastic MAP Reduce9:59
AWS EMR CLUSTER CREATION - PRACTICAL18:07
AWS EMR STUDIO SETUP -ATTCH TO NOTEBOOK PRACTICAL5:33
AWS RDS - Introduction7:18
AWS RDS CREATION - SQL SERVER SETUP PRACTICAL13:41
AWS Kinesis5:54
AWS Kinesis Lab-17:00

What is Databricks8:48
Databricks Advantage5:06
Databricks Free Edition2:35
Databricks Overview8:18
Lakehouse in Databricks10:41
Unifies data lake and data warehouse in a single open-format platform, enabling acid transactions, schema enforcement, time travel, and fast analytics on all data types with the Databricks Lakehouse.
Unity Catalog in Databricks12:54
Create Unity Catalog in Databricks3:15
Managed vs External Tables in Databricks9:47
Explain the difference between managed and external tables in Databricks. Managed stores metadata and data in Databricks; external keeps data in cloud storage and metadata in Databricks, with drop behavior.
Volumes in Databricks8:04
Create & Upload Volumes in Databricks8:01
Pyspark Setup & Intro in Databricks5:57
Set up a PySpark workspace in Databricks, create a catalog, database, and volume, upload a CSV, and prepare a notebook to practice reading files and building a data frame.
Create Dataframe Using Pyspark9:40
Create Dataframe Using JSON Or Parquet5:58
Select Transformation - PYSPARK4:52
With Column & With Column Renamed - PYSPARK8:20
Filter Data - PYSPARK9:13
Distinct vs DropDuplicates - PYSPARK4:58
Sort & OrderBy - PYSPARK3:57
GroupBy - PYSPARK4:35
Learn to use the PySpark group by transformation to group rows by state and apply aggregations such as sum, average, max, min, and count to calculate totals and statistics.
Join - PYSPARK12:19
Union - PYSPARK3:26
How to Handle Null Values - PYSPARK6:41
Collect() in pyspark0:51
Struct Type & Struct Field - PYSPARK6:12
Pivot & Unpivot - PYSPARK9:00
UDF - PYSPARK8:56
Temp View - PYSPARK7:07
Windows Function - PYSPARK15:38
PartitionBy , Repartition And Coalesce - PYSPARK11:09
Date Format - PYSPARK5:03
Different Date Functions11:28
Explode Function - PYSPARK5:30
Databricks Files - PYSPARK8:14
Lakehouse Federation - PYSPARK5:04
Foreign Catalog & Tables - PYSPARK4:32
Learn to connect to external data sources by configuring a catalog, creating a foreign connection catalog, and establishing credentials for mysql or other systems, then query external tables via unity catalog.
Databricks SQL - PYSPARK8:47
Databricks SQL Overview - PYSPARK9:56
Databricks SQL Parameters - PYSPARK8:19
Databricks SQL Snippets - PYSPARK8:39
Databricks SQL Revision(Join, ROW_NUMBER)CTE, - PYSPARK10:02
Scheduling - Query4:33
Monitoring - Query4:48
Databricks Cache - Query Performance2:41
Databricks Alert8:00
Dashboard & Visalization - DATABRICKS10:45
Genie in Databricks8:05
Lakehouse Jobs in Databricks2:50
Discover how lakehouse jobs in Databricks automate notebooks, SQL queries, and pipelines, scheduling tasks at specific times and enabling ETL-like orchestration with monitoring and retries.
Lakehouse Jobs practical in Databricks18:42
If -Else Jobs Practical in Databricks8:02
For - Each Jobs Practical in Databricks3:58
Schedule and Trigger in Databricks3:10
Compute Jobs In Databricks0:59
Notification Jobs In Databricks1:31
Set up databricks job notifications to alert on failures due to schema changes or cluster issues, with email or Microsoft Teams alerts for success, failure, or warnings.
Monitoring Jobs In Databricks.3:24
Spark Structure Streaming6:19
AUTO LOADER INTRO- DATABRICKS7:04
AUTO LOADER - ARCHITECTURE10:40
AUTO LOADER - PRACTICAL INTRO2:00
AUTO LOADER - PRACTICAL SETUP4:17
AUTO LOADER - SCRIPT PRACTICAL8:26
AUTO LOADER - INCREMENTAL LOAD2:19
AUTO LOADER -SCHEMA HANDLE1:42
Copy INTO- databricks3:33
COPY INTO PRACTICAL databricks7:43
Delta Live Tables - Databricks7:13
DLT - Problem its solving4:54
DLT - ADVANTAGES2:24
KEY USE CASE - DLT4:14
DLT dataset - streaming table , view , materialized view4:35
DLT PRACTICAL OVERVIEW7:41
Create Streaming Table -DLT6:33
Materialized view - DLT3:09
DLT PRACTICAL - View2:07
DLT PRACTICAL - STREAMING , MATERIALIZED VIEW , VIEW12:53
SPARK SQL - 128:24
SPARK SQL --2 ( like , distinct , limit)4:30
SPARK SQL Joining & Union24:02
SPARK SQL -WINDOWS FUNCTION - Rank , Dense Rank , Row number10:03
SPARK SQL - WINDOWS FUNCTION - LEAD & LAG5:03
Databricks Github integration - CI CD5:58
Databricks Github Integration practical10:20
Databricks Branches Creation and Steps to follow in Production3:34
Run production Job in Databricks pointing Github2:50

SQL Introduction4:50
What is Databases8:06
Explore what a database is, compare relational and non-relational databases, and learn SQL basics, plus how to create a database using SQL Server Management Studio.
Database Practical6:05
Insert Statement in SQL7:02
SQL Constraints5:58
Not Null & Unique Constraints5:12
Check & Default Constraints4:10
Primary Key in SQL4:51
Foreign Key in SQL4:41
Filter & Sort in SQL5:34
Master filtering and sorting data in SQL by using where and order by clauses, filtering for salary above 70,000 and null locations to drive analysis.
Delete vs Drop vs Truncate in SQL5:53
Update in SQL4:24
Learn to perform sql update operations: set null locations to India, set Siam salary to 1 lakh, and update null salaries, using update, set, and where clauses.
Conditional Statement in SQL7:03
Aggregate Function in SQL3:22
Group by in SQL5:18
Like Operator in SQL4:50
Having Clause in SQL5:56
Top or Limit in SQL5:43
Distinct in SQL1:57
Coalesce in SQL6:27
Joining in SQL13:32
Union vs Union All in SQL6:14
Alter in SQL5:40
Windows Function in SQL9:46
Rank, Dense Rank, Row Number in SQL10:02
Lead and Lag in SQL8:18
Explore lead and lag window functions in SQL to access previous and next row values, using over and order by transaction date, with practical sales table examples.
CTE in SQL8:01
Views in SQL5:55
Learn how views in sql act as virtual tables that generate data on demand, simplify complex queries, secure data by limiting columns, and maintain consistency with underlying data.
Store Procedure in SQL10:24
Sub Query in SQL5:47
Triggers in SQL8:12

Introduction To Python Playlist4:41
What is Python And Feature7:17
Python Data Type And Variables3:56
Python Input() Function6:26
Python Operators5:05
Python If Else Statements14:59
Python For Loop10:38
While Loop in Python4:44
List in Python8:56
List Method in Python8:25
List Coding Problem -1 (lowest & highest)4:22
List Coding Problem -2 (swap two element in list)3:25
List Coding Problem -3 (reverse list)2:24
Learn how to reverse a Python list using built-in methods and without them, including using list.reverse and a for loop, with practical examples.
List Coding Problem -4 (sum and average of given list)2:46
List Coding Problem -5 ( even and odd from list)2:07
List Coding Problem -6 ( remove duplicate from list)1:21
List Comprehension6:21
Tuples in Python10:07
Sets in Python8:14
Dictionary in Python10:31
String in Python8:54
String Coding Questions in Python7:04
Functions in Python11:35
Args and Kwargs in Python5:26
Lambda in Python4:54
OOPS in Python8:09
Map, Filter and Reduce in Python10:43
Constructor and Method in Python8:35
Inhertitence in Python5:52
Encapslation in Python6:47
Iterable vs Iterator4:32
Module in Python8:29
OS Module in Python6:58
Date and Datetime in Python8:21
Try -Except in Python6:42

DATA ENGINEER INTERVIEW -114:54
PYTHON DATA ENGINEER INTERVIEW14:10
TOP SQL DATA ENGINEER INTERVIEW17:38
TOP P YSPARK DATA ENGINEER INTERVIEW11:49
Master duplicate handling in PySpark by removing duplicates with distinct, and identifying duplicates via group by and count on customer id and transaction id.
TOP PYTHON CODING DATA ENGINEER INTERVIEW11:27
Practice solving common Python coding questions for data engineer interviews, including finding the second largest element, reversing numbers, missing numbers, reversing words, and checking anagrams, palindromes, and merging lists.
TOP SQL DATA ENGINEER INTERVIEW -29:48
PYSPARK CODING INTERVIEW QUESTION - ANSWER8:45
PYTHON CODING - SORTING , DUPLIATE6:06
Solve python coding interview questions: remove duplicates from a list by building a unique result, and sort a list without built-in functions using a swap-based loop.
PYTHON CODING - TWO SUM5:14
PYTHON CODING - MOVE ZERO TO END , SECOND LARGEST6:44
PYTHON CODING - GROUP ANAGRAM5:09
PYTHON CODING - FIND AVG MOVING RATE6:15
SQL INTERVIEW QUESTION - STOCK PRICE12:05
DATA ENGINEER INTERVIEW - SQL LEAST FARE8:01
SQL - JOINING INTERVIEW7:01
START SCHEMA VS SNOWFLAKE SCHEMA9:40
DATA WAREHOUSE VS DATALAKE8:31
Compare data warehouse and data lake to understand structured data versus unstructured and semi-structured data, ETL vs ELT processing, and how each supports reporting and raw data storage.
NORMALIZATION VS DENORMALIZATION9:01
Normalization reduces redundancy by splitting data into multiple tables, while denormalization stores data in a single table with duplication to speed up queries.
DIFFERENT FILE FORMATS11:45

Requirements

Basic understanding of computers and the internet
No prior AWS experience is required — everything will be explained from scratch
A computer with internet connection to access AWS services and practice the labs

Description

Master AWS Data Engineering – Build Real World Data Pipelines on AWS

Become a professional AWS Data Engineer by mastering the most important AWS data engineering services used by companies worldwide. This course is designed to help you build real-world data pipelines, data lakes, ETL workflows, streaming pipelines, and analytics solutions using AWS.

If you want to become an AWS Data Engineer, Cloud Data Engineer, Big Data Engineer, or prepare for AWS Data Analytics and AWS Data Engineer roles, this course will give you practical hands-on experience with the most important AWS services.

You will learn how to design and build end-to-end data engineering pipelines using AWS services like S3, Glue, Lambda, Kinesis, Redshift, Athena, EMR, SNS, CloudWatch and more.

This course focuses heavily on real-world projects and hands-on labs, so you will gain the practical skills needed to work as an AWS Data Engineer in production environments.

What You Will Learn

Build end-to-end AWS Data Engineering pipelines

Create Data Lakes using Amazon S3

Perform ETL using AWS Glue

Process big data using Amazon EMR

Build real-time streaming pipelines using Amazon Kinesis

Run serverless data pipelines using AWS Lambda

Query data using Amazon Athena

Build Data Warehouses using Amazon Redshift

Implement event-driven architectures using SNS and SQS

Monitor pipelines using AWS CloudWatch

Design production-grade AWS data architecture

Understand best practices for AWS Data Engineering

Work with structured and semi-structured data

Build batch and streaming data pipelines

Learn data lake architecture on AWS

Implement data ingestion, transformation, and analytics

AWS Services Covered in this Course

This course covers the most important AWS services used in Data Engineering and Big Data pipelines.

Data Storage

Amazon S3
Data Lake Architecture

ETL & Data Processing

AWS Glue
AWS Lambda

Streaming Data

Amazon Kinesis

Big Data Processing

Amazon EMR
Spark on AWS

Data Analytics

Amazon Athena

Monitoring & Automation

AWS CloudWatch
Event Driven Pipelines

Messaging & Notifications

Amazon SNS

Real-World AWS Data Engineering

In this course you will build multiple real-world AWS data engineering projects such as:

• Build a Data Lake on AWS S3
• Create ETL pipelines using AWS Glue
• Query data using Amazon Athena
• Build serverless pipelines using AWS Lambda
• Create event-driven architectures using SNS
• Monitor pipelines using CloudWatch

These projects simulate real production scenarios used by modern data engineering teams.

Why Learn AWS Data Engineering?

Data Engineering is one of the highest paying roles in cloud and big data.

Companies are rapidly moving their data platforms to AWS, and they need skilled AWS Data Engineers who can design scalable data lakes, ETL pipelines, and analytics systems.

By learning AWS Data Engineering, you can open career opportunities like:

AWS Data Engineer
Cloud Data Engineer
Big Data Engineer
Data Platform Engineer
Analytics Engineer

Who This Course is For

Data Engineers Cloud Engineers
Software Engineers
Big Data Engineers
Python Developers
ETL Developers
Anyone who wants to become an AWS Data Engineer

Requirements

Basic understanding of:

SQL
Cloud concepts
Data engineering basics (helpful but not required)

No prior AWS experience is required — everything is explained from beginner to advanced level.

Who this course is for:

Aspiring AWS Data Engineers who want to build real-world data pipelines on AWS
Data Engineers who want to learn or upgrade their skills in AWS Data Engineering services
Cloud Engineers who want to specialize in Data Engineering on AWS
ETL Developers who want to transition into modern cloud data engineering

AWS Data Engineer Bootcamp: The Complete Guide

What you'll learn

Explore related topics

Course content

Introduction1 lecture • 8min

AWS DATA ENGINNER SERVICES49 lectures • 7hr 7min

COMPLETE DATABRICKS & PYSPARK83 lectures • 9hr 48min

COMPLETE SQL31 lectures • 3hr 19min

COMPLETE PYTHON35 lectures • 4hr

PROJECTS3 lectures • 1hr 25min

DATA ENGINEER INTERVIEW PREPARATION19 lectures • 3hr 4min

GITHUB -CI CD4 lectures • 32min

Requirements

Description

Who this course is for: