
Explore Apache Airflow with Google Cloud Composer, detailing use cases, architecture, and cloud vs custom setups, plus hands-on workflow concepts like tasks, variables, and BigQuery integration.
Explore how data pipelines coordinate tasks with dependencies—from extraction and transformation to loading—using Apache Airflow, with use cases like weather dashboards, predictions, and external deliveries.
Identify tasks and dependencies in a data pipeline and explain how orchestration with Apache Airflow coordinates various program units: shell scripts, Python programs, or database procedures for automated, reliable processing.
Understand what Apache Airflow is and how Google Cloud Composer offers environment options to run it, with Python-based directed acyclic graph workflows for automating data pipelines.
Airflow uses a directed acyclic graph to define tasks and dependencies, with operators like bash, python, and sensors that run scripts or functions without cycles.
Explore how Apache Airflow architecture orchestrates workflows via the web server dashboard, scheduler, queue, and worker nodes, driving DAG execution and task visualization.
Explain the single-node and multinode Airflow architectures in Google Cloud Composer, detailing the scheduler and worker roles, and highlighting high availability and scalability.
Provision a Google Cloud Composer environment by enabling the API and creating the environment. Explore auto scaling and the differences between Composer versions 1 and 2.
Provision a Google Cloud Composer environment by configuring name, location, resources, image version, and Python version. The VM provisioning installs Python and Apache Airflow and takes about 20–25 minutes.
Navigate the Google Cloud Composer Apache Airflow web UI to monitor environment health, view dag runs, trigger tasks, view logs, and manage connections, dag files, and data.
Discover how Apache Airflow program structure organizes imports, arguments, task setups, and schedules, with graph view and execution status indicators guiding task dependencies.
Create and submit an Apache Airflow DAG using python and bash operators, define tasks with dependencies, schedule and monitor runs, and understand failure handling and logging in the UI.
Master templating in Apache Airflow using macros and dynamic parameters, including the execution date, to drive task execution and control branching.
Explore templating in Apache Airflow, including upstream dependencies, template IDs, and dynamic dates, and see how they drive task execution and logging in Google Cloud Composer.
Discover how Apache Airflow variables store configuration values in the metadata database, avoiding hard-coded settings, and learn how to define read, update, and use them in Google Cloud Composer.
Demonstrates defining and importing Airflow variables in a DAG, retrieving values by keys, and using bash and python operators to display and compute results.
Learn how to use Airflow with the bash operator and templating to run a Python script from another folder or different machine, enabling cross‑mission scripting in Google Cloud Composer.
Learn how to call a bash script from Airflow in Google Cloud Composer, manage script uploads, load data from the data bucket, and execute across different folders and machines.
Apache Airflow is an open-source platform to programmatically author, schedule and monitor workflows.
Cloud Composer is a fully managed workflow orchestration service that empowers you to author, schedule, and monitor pipelines that span across clouds and on-premises data centers. Built on the popular Apache Airflow open source project and operated using the Python programming language, Cloud Composer is free from lock-in and easy to use.
With Apache Airflow hosted on cloud ('Google' Cloud composer) and hence,this will assist learner to focus on Apache Airflow product functionality and thereby learn quickly, without any hassles of having Apache Airflow installed locally on a machine.
Cloud Composer pipelines are configured as directed acyclic graphs (DAGs) using Python, making it easy for users of any experience level to author and schedule a workflow. One-click deployment yields instant access to a rich library of connectors and multiple graphical representations of your workflow in action, increasing pipeline reliability by making troubleshooting easy.
This course is designed with beginner in mind, that is first time users of cloud composer / Apache airflow. The course is structured in such a way that it has presentation to discuss the concepts initially and then provides with hands on demonstration to make the understanding better.
Note : This course also has AI enabled role play on "Interviewing for a Data Engineering Role: Apache Airflow Proficiency" - An interactive Chat.
The python DAG programs used in demonstration source file (9 Python files) are available for download toward further practice by students.
Happy learning!!!