Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Build the Perfect Data Stack for Analytics Engineering
18 students
Last updated 10/2025
English

What you'll learn

  • Build a production-ready DBT project and understanding everything about DBT set up
  • Having best practices about data modeling and SQL code convention
  • How to automate everything that is too time consuming : testing, documentation and cleaning
  • Monitor and optimize data warehouse costs

Course content

14 sections51 lectures4h 3m total length
  • Introduction4:01

    This course teaches you how to build a robust analytics data stack to monitor business performance, identify issues, evaluate ROI, and make data-driven decisions.

    We'll explore why modern analytics has evolved from Excel (prone to inconsistency and errors) to SQL-based workflows that provide structure, readability, and reliability. You'll learn the Extract-Load-Transform (ELT) framework for data manipulation, focusing specifically on the Transform stage using DBT (Data Build Tool).

    Prerequisites: Basic SQL knowledge is required. The course assumes you understand how to write SELECT queries and use WHERE clauses.

    What you'll build: A production-ready data transformation pipeline following software engineering best practices, enabling your team to create clean, documented, and cost-optimized analytics datasets.

  • Understanding DBT: Core Concepts Through a Practical Example4:33

    This section introduces DBT using a supermarket revenue tracking example. You'll learn how DBT transforms raw transaction data (purchases and customer loyalty information) into actionable daily revenue reports.

    Key concepts covered:

    • Design-first approach: Always visualize your desired output before writing SQL

    • What DBT adds beyond SQL: Organization, table referencing, scheduling, and software engineering best practices

    • DBT's folder structure: Separate folders for raw data (using source()) and transformed data (using ref())

    • Referencing system: source() for raw data not created by DBT, ref() for tables created by other DBT models

    • Why organize raw data separately: Even with thousands of tables, creating one staging query per raw table improves readability and maintainability

    By the end of this section, you'll understand how DBT brings software development principles (version control, modularity, documentation) to data analytics workflows.

  • Running DBT Models: Command-Line Execution and Dependencies3:41

    This section explains how to execute your DBT models using command-line interface (CLI), just like software developers run code in a terminal.

    Key concepts covered:

    • What is a terminal? A command-line interface to navigate folders and execute files without clicking

    • Basic DBT commands:

      • dbt run --select revenue_daily - Runs a specific model

      • dbt run --select +revenue_daily - Runs a model plus all its upstream dependencies

    • Why command-line execution matters: Enables scheduling and automation of your data pipelines

    • Software engineering parallel: DBT brings developer workflows to analytics, making data transformation schedulable and reproducible

    What's next: You'll learn how to set up a complete DBT project from scratch, following best practices for building a robust data stack.

Requirements

  • Know how to code in SQL
  • Maybe an idea of what DBT is

Description

Master the complete analytics engineering workflow by building a production-ready data stack from scratch using DBT (Data Build Tool), the industry-standard transformation framework trusted by data teams worldwide.

This comprehensive course takes you from zero to advanced DBT practitioner, covering everything needed to build, deploy, and maintain scalable data pipelines in real-world production environments. You'll learn the exact methodologies and best practices I've developed over 12+ years working across data analyst, data scientist, and analytics engineer roles in fast-growing startups.

What you'll build:

  • Complete three-layer data architecture (staging, intermediate, mart) following software engineering principles

  • Automated CI/CD pipelines with DBT Cloud for pull request testing and production deployments

  • Cost monitoring system to track and optimize data warehouse expenses

  • Self-healing testing framework with automated failure remediation

  • Production-grade incremental models for efficient data processing

Key topics covered:

  • DBT project setup with development/production environment separation

  • Granularity-based data modeling that scales from thousands to billions of rows

  • Version control workflows with Git and automated quality enforcement via pre-commit hooks

  • SQL linting with SQLFluff and automated documentation generation

  • Workflow automation using Makefiles and GitHub Actions

  • Query cost attribution and optimization strategies

  • Advanced DBT features: seeds, macros, snapshots, and custom tests

Who this is for: Data analysts transitioning to analytics engineering, data engineers building transformation layers, or anyone responsible for maintaining data pipelines serving hundreds of employees and millions of rows.

By the end, you'll have a battle-tested, production-ready data stack that actually works at scale—not just theory, but proven practices from real company environments.

Who this course is for:

  • anyone interested in analytics engineering or in data platform construction