Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Azure Databricks Data Engineering: Build a Lakehouse
Rating: 4.7 out of 5(36 ratings)
2,690 students
Created bySakibul Hasan
Last updated 3/2026
English

What you'll learn

  • Design and implement a professional Medallion Architecture (Bronze, Silver, Gold) using Azure Databricks
  • Build and deploy scalable ETL pipelines using Azure Databricks, PySpark, and Spark SQL to transform raw API data.
  • Connect Native BI directly to Databricks Gold tables to create interactive dashboards and visualize business insights.
  • Implement professional DevOps practices by integrating Databricks with GitHub for version control and CI/CD.
  • Orchestrate automated data workflows using Databricks Workflows to schedule daily data ingestion and processing.

Course content

6 sections26 lectures1h 6m total length
  • Course Overview2:31

    Build a live industrial-grade lakehouse with Azure Databricks data engineering, automating a data factory that pulls and cleans government data and updates a daily refreshed interactive dashboard.

  • The Architecture: Why Medallion?5:24
  • Technical Blueprint: The Tools & The Flow3:31
  • Unlocking Cloud Access (Free Tier Strategy)2:52

    Learn to unlock cloud access with the Azure free account, deploy Databricks workspaces and storage, and manage costs with auto termination and credits plus 12 months of free services.

Requirements

  • No prior Azure cloud experience is required; the course guides you through setting up a free enterprise lab environment.
  • Basic familiarity with SQL concepts (SELECT, FROM, WHERE) is recommended but not mandatory.
  • A computer (Windows or Mac) with internet access to log into the Azure Portal.

Description

Stop watching tutorials. Start building.

Are you tired of "Hello World" examples that don't reflect the real world? Do you want to build a portfolio-ready Data Engineering project that you can actually show to hiring managers?

Welcome to the Dubai Real Estate Lakehouse Project.

In this course, we won't just learn syntax; we will build a production-grade data platform. You will take on the role of a Lead Data Engineer tasked with analyzing the chaotic, high-volume property market of Dubai. Your mission? To build an automated "Daily Pulse" system that ingests, cleans, and visualizes market trends every single morning—entirely in the cloud.

What makes this course different?

  • 100% Cloud-Native: No Power BI. No Tableau. No expensive licenses. You will learn the modern way to visualize data using Databricks Native Dashboards.

  • Zero Cost Strategy: I will show you how to build this entire enterprise architecture using the Azure Free Account ($200 credit) and smart cost-management techniques. You don't need a corporate email or a paid subscription.

  • End-to-End Pipeline: We don't skip steps. You will build everything from the Raw Data Ingestion to the final Executive Dashboard.

What you will learn:

  • Azure Infrastructure: Deploy Data Lakes (Gen2), Key Vaults, and Databricks Workspaces using the Azure Portal.

  • The Medallion Architecture: Architect a professional Bronze (Raw), Silver (Clean), and Gold (Aggregated) data flow.

  • PySpark & SQL Mastery: Write robust transformations to handle messy JSON data, enforce schemas, and deduplicate records.

  • Delta Lake Internals: Master "Time Travel," ACID transactions, and Schema Enforcement to treat files like database tables.

  • Orchestration: Replace manual runs with automated Databricks Workflows (Jobs) that run on a Cron schedule.

  • Native BI: Build stunning, auto-updating dashboards directly inside Databricks using SQL visualizations.

The Project: "The Dubai Pulse"

You will build a system that tracks millions of dollars in property transactions.

  1. Ingest: Fetch raw JSON transaction logs into the Bronze layer.

  2. Clean: Fix data quality issues, handle missing values, and standardize currency formats in the Silver layer.

  3. Model: Create "Star Schema" fact tables in the Gold layer for high-performance analytics.

  4. Visualize: Launch a live URL dashboard showing "Top 10 Investment Areas" and "Monthly Sales Trends."

Who is this course for?

  • Aspiring Data Engineers who need a flagship project for their portfolio/resume.

  • SQL/Python Developers looking to move into Big Data and Spark.

  • Data Analysts who want to learn how to build their own pipelines and move up the stack.

Requirements:

  • No prior Cloud experience required (we start from zero).

  • Basic familiarity with SQL or Python is helpful but not mandatory.

  • A Google/Microsoft email address to sign up for the Free Azure Account.

  • Note: A Corporate/Work email is NOT required for this course.

Enroll today and build the Data Lakehouse of tomorrow.

Who this course is for:

  • Aspiring Data Engineers looking to build a portfolio-ready project using the industry-standard Modern Data Stack.
  • Data Analysts who want to transition into engineering by learning how to build the pipelines that feed their reports.
  • Python or SQL Developers interested in learning Big Data processing with Apache Spark and cloud architecture.