Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

MS SQL to Databricks Spark ETL Training for Data Engineers

Name: MS SQL to Databricks Spark ETL Training for Data Engineers
Rating: 5.0 (9 reviews)

Learn to integrate MS SQL & Databricks and Spark to design reliable ETL workflows that prepare data for modern analytics

Created byOak Academy, OAK Academy Team, Ali̇ CAVDAR

Last updated 7/2026

English

What you'll learn

Understand how Databricks works and why it is a leading platform for modern data engineering
Set up, navigate, and manage your Databricks workspace and user interface
Work confidently with notebooks, files, and Databricks compute clusters
Improve development speed using productivity shortcuts and essential notebook commands
File and Notebook ManLearn Lakehouse Architecture and the Medallion (Bronze–Silver–Gold) data design patternagement in Databricks
Master Delta Lake fundamentals, including ACID transactions and Delta Log operations
Use Unity Catalog for centralized governance, permissions, and data organization
Create and manage catalogs, schemas, tables, and volumes
Build ETL pipelines using Apache Spark and apply them to real datasets
Explore and transform the Olist dataset from raw Bronze to clean Silver
Detect duplicates, missing data, schema issues, and apply data quality checks
Clean and enrich Customers, Sellers, Products, Orders, Order Items, Payments & Reviews data
Deduplicate and validate geolocation and reference tables in Silver
Perform analytical transformations for Gold-layer reporting
Conduct customer distribution, seller metrics, and product category analysis
Build unified gold-level order analytics and high-quality analytical joins
Start learning from scratch and learn about every MS SQL Server topic with examples
Learn SQL basics with SSMS (SQL Server Management Studio)
Use SQL commands to filter, sort and manipulate strings, dates numerical data from different sources
User privileges, permission commands and roles
Learn how to create, alter and drop tables

Course content

28 sections • 125 lectures • 18h 26m total length

What is database?3:19
Data is any sort of information which is stored in computer memory. This information can later be used for a website, an application or any other client to store for future purpose.ms sql sql server mssql sql
RDBMS (Relational Database Management System)2:24
DBMS is a collection of programs which enables its users to access database, manipulate data and help in representation of data. It also helps control access to the database by various users.ms sql sql server mssql sql
What is SQL/Query?2:49
SQL or SEQUEL is Structured Query Language, which is a computer language for storing, manipulating and retrieving data stored in relational database.sql server management studio ms sql server 2019 ms sql database ssms
FAQ about MS SQL5:16
SQL Server is designed for functionality. Whether you’re interested in using SQL Server for integration services, data analysis, or even machine learning applications, Oak Academy has a variety of courses to show you how it’s done.

Starting To SSMS (SQL Server Management Studio)5:08
In this part we learn how to connect MS SQL Server Management Studio (SSMS) and we will learn about interface of the SSMS such as Object Explorer.ms sql sql server mssql sql ms sql server mssql server
Create, Alter and Drop Database8:46
In this part we will learn how to create a database, alter and delete database with graphically in SSMS and using SQL query
Create, Rename and Drop Table13:42
In this part we will learn how to create a table in a database alter and delete the table with graphically in SSMS and using SQL query
Data Types and Constraints5:52
SQL data type is an attribute that specifies type of data of any object. Each column, variable and expression has related data type in SQL.oak academy learn sql from scratch sql server management studio ms sql server 2019 ms sql database ssms
Foreign Key Constraint5:14
A foreign key is a key used to link two tables together. This is sometimes called as a referencing key.mssql server dba sql query server learn ms sql server from scratch database administration mssql sql queries
Unique & Check Constraint5:31
The UNIQUE Constraint prevents two records from having identical values in a particular column. In the tableEmployees table, for example, you might want to prevent inserting multiple phone number.ms sql server 2017 sql server management studio ssms administering ms sql server ms sql query sql database sql from scratch sql server database
Quiz

Insert Data to Table10:05
In this part we will learn how to insert data to a table in two different ways; graphically with SSMS and using SQL query
Update Data in A Table5:42
In this part we will learn how to update data in a table in two different ways; graphically with SSMS and using SQL querySQL isn’t just for software engineers and system administrators. SQL also helps business analysts gain important insights into new marketplaces and make more informed strategic decisions. SQL instructors on Udemy have experience with all levels and applications of SQL programming.
Delete Data and AdventureWorks6:17
AdventureWorks database is a sample database provided by Microsoft that comes with SQL Server. We will learn how to instal and setup AdventureWorks database on our SQL Server.SQL is a special-purpose programming language used for the interaction and manipulation of data in relational databases. In DZone’s 2019 Guide to Databases, 98% of developer survey respondents said their organizations use relational databases regularly, which frequently rely on SQL to query the data. With Udemy courses on SQL, you can learn one of the most in-demand skills for professions ranging from product managers to marketers to software engineers.

Select Statement and Alias7:09
In SQL, Select statements are used to retrieve data from tables in database. These kinds of queries begin with the word ‘SELECT’SQL stands for Structured Query Language. It is a language used to manage data in a relational database. It allows you to query, manipulate, and define the data in a relational database. It has become a standard language, which means that the language the database uses will be based on SQL despite which relational database you use. This makes it a very versatile language and an important skill to have if you work with databases or data. The language closely resembles English, making it easy for English-speakers to learn and read at a glance, even for a beginner. Statements consist of clauses, identifiers, operators, and constants. Together, you can create statements such as “SELECT customer_name FROM Purchase_Table WHERE order_number = 253”. The resulting statement is structured so a computer can read, yet also easy for English-speakers with little programming
Where Clause and Operators10:49
Retrieving the all data from a table may cause speed problems. OR Sometimes, we may need to get specific rows from a table. In this case we will need to use Where clauseMany career paths benefit from understanding SQL. Almost any industry that uses data can benefit from someone that specializes in relational databases. Some jobs specifically focus on working with a database, such as a Database Manager or Database Developer role. However, more and more jobs that do not solely concentrate on database maintenance require knowledge of SQL. Mostly all websites and applications use a database, so various developer positions require knowledge of SQL. Similarly, careers that involve working with data require skills in SQL. These careers can include Data Scientists, Database Developer, DB Admins, Business Analysts, Data Analysts, Big Data Experts, and Big Data Architects. Occupations that involve a database are growing. As the amount of data we collect increases, careers around the management, analysis.
Quiz

Sorting and Grouping Data13:28
There may be a situation when you have multiple duplicate records in a table. While fetching such records, it makes more sense to fetch only unique records instead of fetching duplicate records.You should expect to learn the basics of the language in less than a month, or even as little as 2 weeks, if you are dedicated. But to be comfortable and perform non-trivial operations, it takes a few months to get your bearings in SQL. Because SQL is close to the English language, it is not very difficult to read, write, and understand if you're familiar with the English language. Even if you have no programming background, you should be able to learn SQL quickly. It does take a lot of practice to become a good SQL developer. Reading and writing SQL is just one component of being a SQL programmer. It is even more important to know how to structure data and manage relationships between tables. You need a deep understanding of relational databases to create a data structure that will work well with the type of data you have gathered
Quiz

Requirements

Just you, your keyboard, and your passion for becoming a data engineer!
No prior experience with Databricks, Spark, or the Lakehouse required
Motivation to build complete end-to-end pipelines using Databricks & Apache Spark
Curiosity about modern cloud platforms and large-scale ETL workflows
Interest in data engineering and real-world data pipelines
Basic understanding of Python (functions, loops, variables — just the essentials)
A stable internet connection to access Databricks
A working computer (Windows, Mac, or Linux)

Description

Welcome to "MS SQL to Databricks Spark ETL Training for Data Engineers" course.

Learn to integrate MS SQL & Databricks and Spark to design reliable ETL workflows that prepare data for modern analytics

MS SQL Server is one of the most widely used relational database systems in the world. It provides a powerful environment for data storage, querying, optimization, and enterprise-level analytics. With T-SQL, you can write complex queries, manage relational structures, and prepare data for downstream ETL workloads.

Databricks is a unified analytics and data engineering platform built on Apache Spark, designed for large-scale data processing, ETL workflows, and collaborative development. It enables efficient data transformation, Delta Lake storage, and enterprise-grade governance via Unity Catalog.

In this course, we will take you through everything you need to know to master data engineering using MS SQL, Databricks, and Apache Spark, supported by diagrams, hands-on examples, and real ETL pipeline development.

Designed for all skill levels, this course takes you step-by-step from beginner concepts to advanced data engineering techniques. With practical demonstrations, clear explanations, and engaging projects, you'll master the essential components of modern ETL workflows.

This course will empower you to build reliable, production-ready data pipelines by fully leveraging MS SQL and Databricks. You’ll gain the skills to clean, extract, transform, validate and optimize data, along with the problem-solving techniques required to tackle real-world ETL challenges—giving you a strong competitive edge in the data engineering field.

Ready to build powerful ETL pipelines with MS SQL and Databricks? This course is the perfect starting point!

What You Will Learn:

ETL Pipeline Architecture (MS SQL & Databricks):Understand how modern ETL workflows operate. Learn SQL-based preprocessing, Databricks notebook logic, and Spark job execution flow.

MS SQL Foundations for Data Engineering:Master SQL queries, joins, subqueries, views, stored procedures, triggers, constraints, indexing, and performance tuning.

Databricks Workspace & Notebooks:Learn how to navigate the Databricks interface, manage databases, use collaborative notebooks, and configure clusters.

Apache Spark Fundamentals:Understand Spark DataFrames, lazy evaluation, transformations, actions, distributed processing, and optimized execution methods.

Delta Lake & Modern Storage Concepts:Learn Delta Lake features such as ACID transactions, Delta Log, schema evolution, upserts, and time travel.

Unity Catalog & Data Governance:Hands-on experience with secure data management, catalogs, schemas, tables, permissions, and lineage.

Data Cleaning & Transformation (Bronze → Silver → Gold):Master medallion architecture using real datasets. Perform deduplication, missing data handling, normalization, validation, and enrichment operations.

SQL + Spark Data Processing:Combine MS SQL preprocessing with Spark transformations for scalable ETL across large datasets.

Performance Optimization (SQL & Spark):Learn SQL indexing, query tuning, execution plans, Spark partitioning, caching, broadcast joins, and optimization best practices.

Deploying ETL Workflows:Understand job scheduling, Databricks Jobs, cluster policies, and automation techniques.

By the end of this course, you'll be confident in building robust and scalable ETL pipelines with MS SQL and Databricks, fully prepared to tackle real-world data engineering projects.

What is Databricks?

Databricks is a cloud-based unified environment built on Apache Spark, designed for large-scale data processing, ETL, and analytics. It offers collaborative notebooks, scalable compute, Delta Lake storage, and strong governance tools.

What is MS SQL Server?

MS SQL Server is a relational database management system used to store structured data, write complex queries, optimize performance, and support enterprise-level analytics and ETL workflows.

What is Apache Spark?

Apache Spark is a distributed data processing engine built for fast and scalable ETL, analytics, streaming and machine learning workloads. Databricks enhances Spark with optimized execution and enterprise-ready features.

Why would you want to take this course?

Our answer is simple: The quality of teaching

OAK Academy based in London is an online education company OAK Academy gives education in the field of IT, Software, Design, development in Turkish, English, Portuguese, and a lot of different language on Udemy platform where it has over 2000 hours of video education lessons.

When you enroll, you will feel the OAK Academy`s seasoned developers' expertise

Video and Audio Production Quality

All our content is created/produced as high-quality video/audio to provide you the best learning experience

You will be,

Seeing clearly
Hearing clearly
Moving through the course without distractions

You'll also get:

Lifetime Access to The Course
Fast & Friendly Support in the Q&A section
Udemy Certificate of Completion Ready for Download

We offer full support, answering any questions

Dive in now into the "MS SQL to Databricks Spark ETL Training for Data Engineers" course.

Learn to integrate MS SQL & Databricks and Spark to design reliable ETL workflows that prepare data for modern analytics

Who this course is for:

Anyone who wants to learn data engineering through real, end-to-end Databricks workflows
Aspiring data engineers looking to gain industry-ready experience with Spark,Unity Catalog, and the Databricks ecosystem
Learners who want to strengthen their Python and SQL skills through practical data engineering projects
Anyone curious about how large-scale data systems work in real-world organizations
Those seeking a hands-on guide to building ETL pipelines using the Lakehouse and Medallion (Bronze–Silver–Gold) Architecture
Students, analysts, or professionals interested in Databricks, Apache Spark, or modern data platforms
Anyone who wants to learn data engineering through real, end-to-end Databricks workflows

MS SQL to Databricks Spark ETL Training for Data Engineers

What you'll learn

Explore related topics

Course content

Introduction to sql, sql server, microsoft sql server, microsoft sql4 lectures • 14min

Database Normalization2 lectures • 11min

Installation for MS SQL1 lecture • 5min

Starting to SSMS6 lectures • 44min

Manipulating Data for MS SQL3 lectures • 22min

Basic Select Statement2 lectures • 18min

Sorting and Grouping Data1 lecture • 13min

Wildcard Operations1 lecture • 8min

IF-THEN Statement1 lecture • 5min

Conversion Functions1 lecture • 9min

Requirements

Description

Who this course is for: