
Introduce Marco Marti, an Airflow expert and data engineer, highlighting his Udemy courses, YouTube channel data with Mark, and his role at Astronomer promoting Airflow at scale in production.
Explore the course goals for Apache Airflow 3: advanced dag authoring, covering taskflow api foundations, dag scheduling, dynamic task mapping, and ai integration with airflow.
Set up a local Apache Airflow development environment using the astro CLI or a Docker Compose file. Install Docker, start Airflow, and access the UI at localhost:8080 with login airflow.
Rewrite a two-task dag using the Taskflow API with dag and task decorators, showcasing data passing via returns and Xcom, and comparing simplicity to the old approach.
Switch from the traditional bash operator to the task bash decorator to run bash commands from Python functions, enabling complex logic and streamlined dependencies.
Apply default arguments in Airflow DAGs to set retries for all tasks, and use task group default_args with a pool named fetch crypto prices to serialize those tasks.
Explore running Airflow DAGs continuously with the continuous preset, and learn how data intervals and max active runs differ from true streaming in Airflow 3.
Learn to schedule dag runs in airflow 3 by modeling assets that trigger downstream dags on asset updates, enabling cross-dag dependencies without sensors or trigger operators.
Use the or (pipe) and end operators with parentheses to schedule a downstream dag on multiple assets, enabling one or more assets to trigger the downstream dag.
Discover how to dynamically generate assets in Airflow 3 using asset aliases, replacing static assets with runtime-created files and downstream dags.
Learn to access asset metadata in downstream dags with inlets, outlets, and asset aliases, read the asset URI and content from events, and store extra information in the metadata object.
Explore a different approach in Airflow 3 by defining assets with the asset decorator, turning outputs into dependencies instead of tasks; note this method is not very mature.
Explore dynamic task mapping in Airflow 3, where the scheduler creates task copies at runtime from data inputs using expand and partial, enabling map and reduce across mapped outputs.
Transform and filter dynamic task outputs with map in Apache Airflow 3. Learn to map over extracted file lists, apply a filter by extension, and skip zip files.
Learn to map tasks on dictionaries with expand to give each file a distinct path by returning dictionaries from extract files and filtering by extension.
discover how to combine multiple upstream outputs in Airflow using zip to create file-path tuples, by adjusting tasks to return lists and applying zip for downstream processing.
Explore the Airflow AI SDK and its decorators, and review the repository and DAGs folder for dag examples that illustrate the decorators you can use.
Airflow 3: Advanced DAG Authoring
Take your Apache Airflow skills to the next level. This course dives deep into the powerful features of Airflow 3 that separate beginners from production-ready data engineers.
You'll master the TaskFlow API—from the basics to advanced patterns like dynamic DAG generation, task groups, pools, and resource management. Learn how to build flexible, scalable pipelines using dynamic task mapping with advanced techniques like reduce, expand, and more. Explore modern scheduling strategies including assets, conditional scheduling, and event-driven pipelines with services like AWS SQS. Plus, discover how to integrate AI into your workflows using LLMs, the AI SDK, and human-in-the-loop approval patterns.
What you'll learn:
Write clean, Pythonic DAGs using the TaskFlow API
Generate DAGs dynamically and reuse tasks like a pro
Master dynamic task mapping for flexible, data-driven workflows
Schedule pipelines using assets, event-driven triggers, and continuous scheduling
Integrate AI and LLMs directly into your Airflow tasks
Implement human-in-the-loop workflows for AI approvals
Every video has its corresponding source code so it's easy for you to follow along.
Who this course is for: Data engineers and developers with foundational Airflow knowledge who want to write more efficient, maintainable, and production-grade DAGs using Airflow 3's latest features.
I hope you are ready for the course. Let's do it!
Marc Lamberti