
What will be covered in this course and an introduction about my self.
Description of the workflows, What are workflows? This video also explains different types of workflows. What is the relationship of workflows with Apache Airflow?
You will find an explanation of the Medallion Architecture in this lecture.
I have explained the problem statement that Apache Airflow will solve for us.
First Glance of Apache Airflow.
I have explained about creating a free trial account on GCP, Google Cloud Platform, and then creating a VM (Virtual Machine) and let the machine ready for next lecture in which we will install airflow.
I have provided scripts as resources for this lecture. In this lecture, I have explained about how to install plain vanilla airflow.
I have explained how to install Apache Airflow in Docker Container. I have also attached docker-compose script along with this lecture.
This is part one of the Airflow Config file explanation.
In this lecture, I have explained about rest of the airflow configuration.
How different components of Airflow work together, is explained in this lecture. Those components are Scheduler, Webserver and Meta Database.
I have explained about Dags, and Control Flow in this lecture.
Tasks, Operators, and Sensors are described in this lecture.
Continued explanation about Scheduler, XComs, Variables, Connections, and Hooks.
Explanation about Dag Page in Airflow UI
I have Presented the top navigation bar explanation in this Lesson.
UI elements explained on the Dag Details Page and multiple Views are also explained in this lesson.
Explanation of the branching operator Dag in which I have make use of Airflow Variables
I have explained how XComs can be used to transmit information between Tasks.
Explanation about the Medallion Dag Code, which is written in Scala. I have attached the source code along with this vide.
Batch Use Case of Medical Devices Dataset. I have attached the source code along with this video.
I have explained how we can use JDBC and MySQL Operators which are provided in Airflow. I have attached the source code along with this video.
You will find an explanation about custom operators in this lecture, How to write code for Custom Operators, and how to deploy Custom Operators.
Faker Python Package is being used to generate a dummy dataset, and then on top of that, I have explained how to write ETL Pipeline and orchestrate in Apache Airflow.
I have shared the code for how we can create E-commerce ETL Pipeline using Apache Spark. There is an assignment associated with this lecture.
Explanation about Snowflake Dag
Explanation about Elasticsearch Dag, How we can create a connection with Elasticsearch.
Airflow is an extensive tool and has many different features. I will explain in details how we can take benefit from Airflow in our Big Data-related projects. In this Apache Airflow I will start from basic concepts, then move to Installation, Later will deep dive into advanced concepts. In the end, I will create a few big data-related projects and create Airflow Dag to give you a flavor of how all components of Airflow work together.
Please note that Apache Airflow is not a Data Flow tool rather its a Workflow Orchestration tool, With such a nice and beautiful User Interface and a bundle of rich components available. When data is flowing through different processes it is difficult to monitor each and every process, if we are working in CLI based environment. Rather using Apache Airflow we can get notified by color scheming of tasks, and we can start the task right from where it has failed.
I will show you different use cases of Big Data universe, in which we can take leverage from Apache Airflow.
After completing this course Part 1, you will be find your self equipped with a new tool just to solve your Big Data use cases.