Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Databricks Certified Data Engineer Associate - Preparation
Bestseller
Rating: 4.5 out of 5(17,634 ratings)
116,113 students

Databricks Certified Data Engineer Associate - Preparation

Complete preparation for Databricks Data Engineer Associate certification + hands-on training
Last updated 6/2026
English

What you'll learn

  • Understand how to use Databricks Lakehouse Platform and its tools
  • Build ETL pipelines using Apache Spark SQL and Python
  • Process data incrementally in batch and streaming mode
  • Orchestrate production pipelines
  • Understand and follow best security practices in Databricks

Course content

8 sections59 lectures5h 19m total length
  • Course Overview1:36

    This course overview outlines the Databricks certified data engineer associate preparation, covering lakehouse platform, etl with spark sql and python, incremental data processing, production pipelines, and data governance.

  • New Exam Version (Update in Progress)0:55
  • What is Databricks5:05

    Learn how Databricks combines a multi-cloud lakehouse built on Apache Spark, with the cloud service, runtime, and workspace, plus DBFS and Delta Lake support for batch and streaming analytics.

  • Free trial on Azure3:46

    Learn to sign up for a 14-day free trial of Databricks on Azure, including creating a resource group, selecting the 14-day free premium option, and launching a workspace.

  • Exploring Workspace4:00

    Navigate the Databricks workspace interface, including the left sidebar and workspace explorer, to organize notebooks, folders, and data assets across SQL, data engineering, and machine learning.

  • Course Materials1:43

    Import notebooks into the Databricks workspace via git folders with a GitHub repository; clone, access the course materials, and follow along to recreate solutions.

  • Creating Cluster6:51

    Create a Databricks cluster by navigating to compute, naming it, and choosing a single or multi-node setup with a driver and workers, runtime version 13.3 LTS, and auto termination.

  • Notebooks Fundamentals12:04

    Create and manage Databricks notebooks, switch languages with magic commands, use markdown for notes, run cells, and export, import, or revert revisions for modular workflows.

  • New Notebook Features0:36
  • Git folders8:03

    Explore git folders (Databricks Repos) for source control by integrating with git providers like GitHub, then clone, commit, push, and manage branches and pull requests.

Requirements

  • Basic SQL knowledge will be required
  • Basic Python programming experience will be required
  • Knowledge of cloud fundamentals will be beneficial, but not necessary

Description

If you are interested in becoming a Certified Data Engineer Associate from Databricks, you have come to the right place! This study guide will help you with preparing for this certification exam.


By the end of this course, you should be able to:

  • Understand how to use and the benefits of using the Databricks Lakehouse Platform and its tools, including:

    • Data Lakehouse (architecture, descriptions, benefits)

    • Data Science and Engineering workspace (clusters, notebooks, data storage)

    • Delta Lake (general concepts, table management and manipulation, optimizations)

  • Build ETL pipelines using Apache Spark SQL and Python, including:

    • Relational entities (databases, tables, views)

    • ELT (creating tables, writing data to tables, cleaning data, combining and reshaping tables, SQL UDFs)

    • Python (facilitating Spark SQL with string manipulation and control flow, passing data between PySpark and Spark SQL)

  • Incrementally process data, including:

    • Structured Streaming (general concepts, triggers, watermarks)

    • Auto Loader (streaming reads)

    • Multi-hop Architecture (bronze-silver-gold, streaming applications)

    • Delta Live Tables (benefits and features)

  • Build production pipelines for data engineering applications and Databricks SQL queries and dashboards, including:

    • Jobs (scheduling, task orchestration, UI)

    • Dashboards (endpoints, scheduling, alerting, refreshing)

  • Understand and follow best security practices, including:

    • Unity Catalog (benefits and features)

    • Entity Permissions (data objects Privileges)


With the knowledge you gain during this course, you will be ready to take the certification exam.

I am looking forward to meeting you!

Who this course is for:

  • Anyone aiming to pass the Databricks Data Engineer Associate certification exam
  • University students looking for a career in Data Engineering
  • Data Engineers moving from other technologies and aiming to apply their skills to Databricks
  • Data Engineers/ Data Warehouse Developers currently working on on-premises technologies
  • Anyone new to Databricks and want to save time by learning Databricks fundamentals