Databricks: Spark Declarative Pipelines & Lakeflow Designer

Name: Databricks: Spark Declarative Pipelines & Lakeflow Designer
Rating: 4.2 (10 reviews)

Build end-to-end data pipelines in Databricks using SQL, Spark Declarative Pipelines, and Lakeflow Designer

New

Created byAndreas Kretz

Last updated 4/2026

English

What you'll learn

Build full data pipelines in Databricks using Spark Declarative Pipelines and SQL
Design structured Bronze → Silver → Gold (Medallion) pipelines from raw data to analytics-ready outputs
Understand how Databricks handles execution, dependencies, and orchestration automatically
Work with Delta Lake and Unity Catalog in a real pipeline setup
Schedule, monitor, and operate pipelines using Databricks Jobs and Lakeflow
Build and extend pipelines visually using Lakeflow Designer and Genie
Understand how streaming pipelines work with triggered and continuous modes

Course content

6 sections • 15 lectures • 1h 59m total length

Course Overview & What You Will Build2:50
Get a clear overview of the course, the pipeline you’ll build, and how Declarative Pipelines and Lakeflow change the way you work in Databricks.
Getting Started in Databricks & Exploring the Dataset13:15
Set up your Databricks environment, get familiar with the interface, and explore the raw e-commerce dataset using SQL and basic visualizations.

Introduction to Declarative Pipelines & Medallion Architecture5:30
Learn the core idea behind Spark Declarative Pipelines and how the Bronze → Silver → Gold structure organizes your data workflows.
Building the Bronze Layer11:11
Ingest raw data into a Bronze table using declarative pipelines while preserving the original structure for traceability.
Building the Silver Layer (Cleaning & Modeling)13:13
Clean and transform the data by fixing data types, standardizing fields, handling null values, and preparing it for downstream use.
Building the Gold Layer (Business Aggregations)7:29
Create analytics-ready tables with business metrics like revenue and customer activity using aggregations on the refined data.
Scheduling & Running the Chapter 3 Pipelines in Lakeflow Designer6:30
Run and schedule your declarative pipeline in Lakeflow Designer and verify how data flows through Bronze, Silver, and Gold.

Streaming Modes in Declarative Pipelines2:35
Understand how streaming works in declarative pipelines and the difference between triggered and continuous execution modes.
Connecting Databricks to an AWS Message Queue5:52
Set up a streaming data source by connecting Databricks to an AWS message queue and configuring ingestion.
Building a Simple Declarative Streaming Pipeline14:42
Build a streaming pipeline that ingests real-time data into Bronze and processes it into Silver with basic transformations.

Requirements

Basic SQL knowledge
Basic understanding of data concepts (tables, transformations, pipelines)
No prior Databricks experience required
No Spark knowledge required
A Databricks Free Edition account (sufficient for most of the course)

Description

Building data pipelines in Databricks used to mean a lot of notebook logic, Spark code, and manual orchestration.

But with Spark Declarative Pipelines and Lakeflow Designer, this changes.

In this course, you’ll learn how to build end-to-end data pipelines by defining what you want with SQL, while Databricks handles the execution, dependencies, and orchestration for you.

You’ll build a complete Bronze → Silver → Gold pipeline using a real e-commerce dataset. Starting from raw data, you’ll ingest, clean, transform, and aggregate it into analytics-ready tables.

Along the way, you’ll work with Delta Lake to ensure reliability and reproducibility, and use Unity Catalog to organize and govern your data.

Once the pipeline is built, you’ll schedule, run, and monitor it, turning it into a real, operational workflow.

Then, we will go one step further.

With Lakeflow Designer and Genie, you’ll learn how pipelines can be built visually, almost no-code, while still generating the same underlying logic.

As a bonus, we will also explore streaming pipelines using AWS Kinesis so you understand how the same declarative model works for near real-time data.

By the end of this course, you’ll understand a modern way of building data pipelines in Databricks, from raw data to production-ready workflows.

Who this course is for:

Data Engineers who want to build structured pipelines in Databricks
Analytics Engineers who want to move beyond notebook-based workflows
Data Analysts who want to understand how modern pipelines are built
Anyone learning Databricks and looking for a practical, hands-on project
People who want to learn a simpler, more structured way of building pipelines

Databricks: Spark Declarative Pipelines & Lakeflow Designer

What you'll learn

Explore related topics

Course content

Orientation, Setup & Dataset Exploration2 lectures • 16min

Delta Lake & Unity Catalog Foundations2 lectures • 17min

Building Declarative Pipelines (Medallion Architecture)5 lectures • 44min

Orchestrating Declarative Pipelines with Lakeflow Designer2 lectures • 16min

Bonus: Streaming with Declarative Pipelines3 lectures • 23min

Summary & Next Steps1 lecture • 3min

Requirements

Description

Who this course is for: