Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Automated CI/CD in Azure Databricks using GitHub Actions
Rating: 4.5 out of 5(1 rating)
23 students

Automated CI/CD in Azure Databricks using GitHub Actions

Master the end-to-end automation of CI/CD pipelines in Azure Databricks using GitHub Actions.
Created byAbiola Abiola
Last updated 6/2025
English

What you'll learn

  • Gain a solid understanding of Continuous Integration (CI) and Continuous Deployment (CD) principles, and how they can simplify and enhance deployment workfllow
  • Learn how to set up both development and production environments in Azure, and how to integrate GitHub with Azure Databricks effectively.
  • Explore branch protection rules and merging strategies to promote safe, efficient collaboration within your development team.
  • Dive into the process of creating CI pipelines, including the setup and coding of YAML files to automate builds.
  • Build and implement a complete CI pipeline, ensuring your Azure Databricks notebooks are validated and seamlessly integrated before deployment.
  • Understand the steps for setting up a Continuous Deployment pipeline to automatically promote changes from development to production environments.
  • Perform comprehensive end-to-end testing of the CI/CD pipeline to confirm all components function correctly before go-live.

Course content

1 section11 lectures1h 18m total length
  • Introduction - What is CI/CD?1:23
  • Azure Services Setup - Resource Groups2:18
  • Provision Dev and Prod Databricks Workspaces4:38
  • Create GitHub Repository3:15
  • Clone GitHub Repository in Azure Databricks Workspaces9:06
  • Git-based Feature Development Workflow9:59
  • Set up GitHub Secrets for CI/CD6:13
  • Create GitHub Actions Workflow19:21
  • Visualize Data in Power BI Desktop using DirectQuery6:13
  • Test and Verify CI/CD for Resilience14:36
  • Conclusion0:59

Requirements

  • Microsoft Azure Account
  • GitHub Account
  • Basic Programming skills

Description

This hands-on course offers a comprehensive, step-by-step guide to building a fully automated CI/CD pipeline tailored for Azure Databricks using GitHub Actions. You'll begin by understanding the core principles of continuous integration and continuous deployment in a data engineering context, with a strong focus on real-world applications, scalable development practices, and modern DevOps methodologies. As the course progresses, you'll learn how to configure GitHub Actions to securely connect with your Databricks workspace, leveraging access tokens and environment secrets to enable robust automation and secure, repeatable deployments.

Through practical demonstrations, you’ll discover how to validate notebooks automatically, enforce coding standards, and sync changes directly into your Databricks repositories. The course will show you how to structure your workflows to handle deployments across multiple environments—such as development, staging, and production—ensuring code quality, consistency, and traceability at every stage. You’ll also gain hands-on experience in triggering Databricks jobs programmatically, managing workspace assets via the Databricks CLI, and setting up reusable GitHub Actions to streamline future automation efforts.

By the end of the course, you’ll have the skills and confidence to implement enterprise-ready CI/CD pipelines that reduce manual intervention, improve deployment accuracy, boost productivity, and accelerate your team's data delivery lifecycle within the Azure Databricks ecosystem.

Who this course is for:

  • This course is for beginners and professionals interested in becoming an Azure Data Engineer and DevOps Engineer.