
Get a clear overview of the course, the pipeline you’ll build, and how Declarative Pipelines and Lakeflow change the way you work in Databricks.
Set up your Databricks environment, get familiar with the interface, and explore the raw e-commerce dataset using SQL and basic visualizations.
Understand how Delta Lake works and why features like ACID transactions, schema enforcement, and time travel are critical for reliable pipelines.
Learn how to organize data using catalogs, schemas, and tables, and understand governance, access control, and data lineage in Databricks.
Learn the core idea behind Spark Declarative Pipelines and how the Bronze → Silver → Gold structure organizes your data workflows.
Ingest raw data into a Bronze table using declarative pipelines while preserving the original structure for traceability.
Clean and transform the data by fixing data types, standardizing fields, handling null values, and preparing it for downstream use.
Create analytics-ready tables with business metrics like revenue and customer activity using aggregations on the refined data.
Run and schedule your declarative pipeline in Lakeflow Designer and verify how data flows through Bronze, Silver, and Gold.
Start building a new pipeline visually in Lakeflow Designer using a canvas-based approach and guided configuration.
Extend the visual pipeline by adding transformations, combining datasets, and generating the underlying SQL logic.
Understand how streaming works in declarative pipelines and the difference between triggered and continuous execution modes.
Set up a streaming data source by connecting Databricks to an AWS message queue and configuring ingestion.
Build a streaming pipeline that ingests real-time data into Bronze and processes it into Silver with basic transformations.
Recap the key concepts from the course and understand how to apply declarative pipelines in real-world projects.
Building data pipelines in Databricks used to mean a lot of notebook logic, Spark code, and manual orchestration.
But with Spark Declarative Pipelines and Lakeflow Designer, this changes.
In this course, you’ll learn how to build end-to-end data pipelines by defining what you want with SQL, while Databricks handles the execution, dependencies, and orchestration for you.
You’ll build a complete Bronze → Silver → Gold pipeline using a real e-commerce dataset. Starting from raw data, you’ll ingest, clean, transform, and aggregate it into analytics-ready tables.
Along the way, you’ll work with Delta Lake to ensure reliability and reproducibility, and use Unity Catalog to organize and govern your data.
Once the pipeline is built, you’ll schedule, run, and monitor it, turning it into a real, operational workflow.
Then, we will go one step further.
With Lakeflow Designer and Genie, you’ll learn how pipelines can be built visually, almost no-code, while still generating the same underlying logic.
As a bonus, we will also explore streaming pipelines using AWS Kinesis so you understand how the same declarative model works for near real-time data.
By the end of this course, you’ll understand a modern way of building data pipelines in Databricks, from raw data to production-ready workflows.