Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Data Engineering Project SQL, Python, Airflow, Docker, CI/CD
Rating: 4.5 out of 5(603 ratings)
5,627 students

Data Engineering Project SQL, Python, Airflow, Docker, CI/CD

Become a Data Engineer by Learning APIs, SQL, Python, Docker, Airflow, CI/CD, Functional/ Data Quality Tests and more!
Last updated 2/2026
English

What you'll learn

  • Build Python scripts for data extraction by interacting with APIs using Postman, loading into the data warehouse and transforming (ELT)
  • Use PostgreSQL as a data warehouse. Interact with the data warehouse using both psql & DBeaver
  • Discover how to containerize data applications using Docker, making your data pipelines portable and easy to scale.
  • Master the basics of orchestrating and automating your data workflows with Apache Airflow, a must-have tool in data engineering.
  • Understand how to perform unit, integration & end-to-end (E2E) tests using a combination of pytest and Airflow's DAG tests to validate your data pipelines.
  • Implement data quality tests using SODA to ensure your data meets business and technical requirements.
  • Learn to automate deployment pipelines using GitHub Actions to ensure smooth, continuous integration and delivery.

Course content

7 sections66 lectures5h 12m total length
  • Welcome!0:50
  • Prerequisties0:39
  • Tools Installation for Course - [IMPORTANT]2:29
  • Project Overview4:24

    Extract YouTube data via YouTube API and load it into a Postgres data warehouse via ELT with Python, then perform data quality checks with soda and enable CI/CD with Docker.

  • Building the Code0:40

    Build the code from the ground up with the class, using GitHub to store and version the project, and reference the final code as you move into data extraction.

  • APPENDIX0:02

Requirements

  • At least 8 GB of RAM, though 16 GB is better for smoother performance
  • Python, Docker & Git installation to run/access the code course
  • Beginner-level SQL knowledge is required
  • Intermediate-level Python knowledge is required
  • Basic understanding of Docker is needed
  • Knowledge of CI/CD is a plus but not necessary

Description

Data Engineering is the backbone of modern data-driven companies. To excel, you need experience with the tools and processes that power data pipelines in real-world environments. This course gives you practical, project-based learning with the following tools PostgreSQL, Python, Docker, Airflow, Postman, SODA and Github Actions. I will guide you as to how you can use these tools.


What you will learn in the course:


  1. Python for Data Engineering: Build Python scripts for data extraction by interacting with APIs using Postman, loading into the data warehouse and transforming (ELT). In this course we use Python version 3.10.

  2. SQL for Data Pipelines: Use PostgreSQL as a data warehouse. Interact with the data warehouse using both psql & DBeaver

  3. Docker for Containerized Deployments: Discover how to containerize data applications using Docker, making your data pipelines portable and easy to scale.

  4. Airflow for Workflow Automation: Master the basics of orchestrating and automating your data workflows with Apache Airflow, a must-have tool in data engineering. In this course we use Airflow version 2.9.2.

  5. Testing and Data Quality Assurance: Understand how to perform unit, integration & end-to-end (E2E) tests using a combination of pytest and Airflow's DAG tests to validate your data pipelines. Implement data quality tests using SODA to ensure your data meets business and technical requirements.

  6. CI/CD for Automated Testing & Deployment: Learn to automate deployment pipelines using GitHub Actions to ensure smooth, continuous integration and delivery.

Who this course is for:

  • Aspiring Data Engineers: If you already have basic SQL & intermediate-level Python and want to learn Data Engineering by working with real tools and projects, this course will help you build strong foundational skills and practical experience to start your career.
  • Early-Career Data Professionals: If you have some experience in data-related roles (Data Analyst, Junior Data Engineer, Data Scientist) and want to deepen your understanding of essential tools like Docker, CI/CD, and automated testing, this course will help you level up your engineering skills.