Data Engineering Bootcamp

Name: Data Engineering Bootcamp
Rating: 4.5 (289 reviews)

Learn Data Engineering basics: data architecture, ETL vs ELT, cloud pipelines, and workflow orchestration

Created byNavid Shirzadi, PhD, P.Eng.

Last updated 5/2026

English

What you'll learn

Understand core data engineering concepts
Apply data architecture principles
Build cloud-based data pipelines
Process and transform data for analytics
Orchestrate and automate workflows
Complete an end-to-end data engineering project

Course content

8 sections • 50 lectures • 9h 20m total length

Course Content6:14
Master end-to-end data engineering with a hands-on ETL project, learning ETL vs ELT, data ingestion with AWS S3, data lake vs data warehouse, and orchestration with Prefect and Docker.
Course Information3:34
Explore data engineering fundamentals in a structured bootcamp, coding along in Python from requirements and installation to basic concepts and a final exercise, with practical debugging steps.
Source Files1:03

Data Ingestion - Cloud Connection15:24
Data Ingestion - Define the Data Path6:40
Data Ingestion - Get the List of Data Files6:12
Data Ingestion - Upload Data21:30
Upload CSV files to a structured S3 raw data folder in the data lake, track progress with a counter, handle errors without stopping the pipeline, and report final upload results.
Data Ingestion - Verification16:05
Data Ingestion - Execution, Debugging, Evaluation6:33

Introduction5:30
Explore data processing in a data engineering bootcamp by building ETL pipelines and comparing ETL with ELT, using Python for local extract, transform, and load from an AWS data lake.
ETL Pipeline - Structure5:58
ETL Pipeline - Extracting Function24:07
Build an etl pipeline extract function that downloads csv files from s3, loads them into data frames, and organizes them in a data sets dictionary for later transformation.
ETL Pipeline - Transforming Part125:20
ETL Pipeline - Transforming Part213:59
ETL Pipeline - Transforming Part317:05
ETL Pipeline - Transforming Part411:26
ETL Pipeline - Business Metrics Part123:48
Exercise - Product Performance Metric1:55
Exercise Solution5:52
Exercise - Sales Revenue Metric1:35
Exercise Solution5:03
ETL Pipeline - Loading to S3 Part115:45
ETL Pipeline - Loading to S3 Part29:24
ETL Pipeline - Final Execution15:54

Introduction13:12
Flow - Simple Example11:57
Explore how prefect orchestrates a simple hello world function using flow decorators, automatic logging, and a dashboard to monitor flows, deployments, and tasks.
Task - Simple Example11:36
Deployment - Simple Example15:01
Work Pool - Simple Example14:48
Orchestration - Hands-On Project Part138:10
Orchestration - Hands-On Project Part216:25
Build and deploy a data orchestration workflow by creating a work pool, defining workers and deployments, and automating etl tasks with a Prefect flow and yaml configuration.

Requirements

No prior data engineering experience required
Basic programming knowledge
Fundamental understanding of data

Description

Data is the new oil—but without the right systems to collect, store, and process it, data quickly becomes unusable. That’s where data engineering comes in. This Data Engineering Bootcamp is designed to take you from foundational concepts to a complete, hands-on project where you’ll build and deploy an end-to-end data pipeline.

We’ll start with the basics of data engineering, exploring what it is, how it differs from roles like analysts and scientists, and why it’s such a critical skill in today’s data-driven world. You’ll learn about the data engineering workflow, data roles, and real-world scenarios through interactive quizzes and activities.

Next, we’ll dive into data architecture—comparing traditional vs. modern approaches, understanding data storage paradigms, and exploring ETL vs. ELT and batch vs. streaming pipelines. You’ll put your knowledge into practice with worksheets and design exercises that reinforce key concepts.

The highlight of the course is the hands-on project, where you’ll:

Ingest raw data into an AWS S3 data lake
Process and transform datasets for analytics
Organize and store results in multiple formats
Orchestrate workflows with Prefect for automation, scheduling, and monitoring

By the end of this course, you’ll not only understand the theory but also gain practical, job-ready experience in building cloud-based data pipelines. Whether you’re an aspiring data engineer, a data analyst looking to level up, or a career changer entering the data field, this bootcamp will give you the confidence and skills to succeed.

Who this course is for:

Aspiring data engineers who want to break into the field and learn modern tools and workflows from scratch.
Data analysts or scientists who want to strengthen their knowledge of data pipelines, architecture, and cloud data. workflows.
Software engineers or IT professionals looking to transition into data engineering roles.
Students and career changers eager to gain hands-on experience with real-world projects in data engineering.

Data Engineering Bootcamp

What you'll learn

Explore related topics

Course content

Introduction3 lectures • 11min

Introduction to Data Engineering6 lectures • 37min

Required Installations4 lectures • 24min

Hands-On Project Introduction6 lectures • 1hr 8min

Hands-On Project - Data Ingestion6 lectures • 1hr 12min

Hands-On Project - ETL Pipeline Development15 lectures • 3hr 3min

Hands-On Project - Orchestration7 lectures • 2hr 1min

Hands-On Project - Containerization3 lectures • 44min

Requirements

Description

Who this course is for: