Microsoft Fabric - Apache Spark and Notebooks

Name: Microsoft Fabric - Apache Spark and Notebooks
Rating: 4.7 (71 reviews)

Learn the Microsoft Fabric Spark environment, notebooks, dataframes, filtering, aggregating, joining dataframes and more

Created byRandy Minder

Last updated 10/2023

English

What you'll learn

Apache Spark and Notebook Integration into Microsoft Fabric
Overview of the Microsoft Fabric Notebooks Environment
Dataframe Basics
Common Operations with Dataframes
Filtering data using Dataframes
Aggregating data using Dataframes
Joining Dataframes
Spark Libraries
Working with Dates
SQL Statements and Spark
Spark Configuration and Tuning
Exercises and Questions for Practice

Course content

5 sections • 37 lectures • 5h 15m total length

Introduction2:59
Introduction to the course
Important! You May Not Need to Purchase This Course!1:49
Important. Please listen to this brief lecture. It may not be necessary to purchase this course.
Getting Started with Fabric3:08
Before getting started

Before Getting Started3:27
Listen to this short lecture before the next lecture
Introduction to Apache Spark8:54
An introduction to Apache Spark in Fabric
Spark Libraries3:46
In this lecture we discuss the four major components or libraries that make up Spark
Dataframe Basics - Part 19:47
Part 1 of a discussion on Spark dataframes
Dataframe Basics - Part 219:20
Part 2 of a discussion on Spark dataframes
Dataframe Basics - Part 310:39
Part 3 of a discussion on Spark dataframes
Common Operations with Dataframes15:16
Common operations when using dataframes
Filtering Data22:21
In this lecture we discuss various ways to filter data in Spark
Aggregating Data15:20
In this lecture we talk about ways to aggregate data in Spark.
Joining Dataframes8:33
In this lecture we talk about how to join Spark dataframes
Working with Dates11:38
In this lecture we demonstrate how Spark can deal with dates
SQL Statements and Spark8:56
In this lecture we talk about using the spark sql function and its interaction with other Spark functions.
Microsoft Fabric 1.1. / Spark Runtime5:54
In this lecture we talk about the Microsoft Fabric / Spark runtime
Spark Configuration10:43
In this lecture we talk about configuring Spark
Spark Autotuning4:25
In this lecture we talk a bit about Spark autotuning

Exercise #15:11
Exercise using notebooks
Exercise #216:59
Exercise using notebooks
Question #10:33
Question to test your knowledge
Question #20:38
Question to test your knowledge
Exercise #310:20
Exercise using notebooks
Exercise #46:33
Exercise using notebooks
Question #3 and #40:39
Questions to test your knowledge
Exercise #58:33
Exercise using notebooks
Exercise #67:38
Exercise using notebooks
Exercise #76:24
Exercise using notebooks
Exercise #84:50
Exercise using notebooks
Exercise #94:04
Exercise using notebooks
Exercise #105:52
Exercise using notebooks

Requirements

Some working experience with Microsoft Fabric and Spark would be useful but not absolutely necessary

Description

This course takes you into the Microsoft Fabric experience of Apache Spark and Notebooks, where the power of distributed computing meets the elegance and fun of interactive coding in Python, R, Scala or SQL.

In this comprehensive course, you will be introduced to Spark's pyspark library, empowering you to tackle common data challenges with remarkable ease. You will wrangle data in pandas dataframes and learn the most common dataframe operations. In short, you will acquire the skills necessary to manipulate your data in many different ways.

As you move deeper into Spark, you will master the art of filtering data, joining dataframes, and manipulating dates and times with ease. You will also be exposed to using SQL in Spark, allowing you to use SQL queries to extract data from Delta Parquet files and storing them in dataframes.

This course will also equip you with the knowledge you need to optimize Spark performance through effective configuration and autotuning. You will learn that there is very little you have to do to fully optimize the Spark environment in which your notebooks run.

The course consists of over 30 lectures, complemented by hands-on practice exercises that will challenge your understanding of the concepts being taught and perhaps your research ability. By the end of this course, you will be a confident and proficient Spark notebook user, capable of harnessing its immense power to solve real-world data scenarios.

Here are just some of the compelling benefits you will gain from this course:

Master the fundamentals of Spark libraries, dataframes, and SQL
Learn to how to filter, aggregate, and join data using dataframes
Gain expertise in manipulating dates and times
Data Wrangling
Learn how to optimize Spark performance through configuration and autotuning
Solve real-world data scenarios with Spark notebooks
Enhance your resume with in-demand Fabric Spark notebook skills
Position yourself for success in the booming field of Big Data

If you are ready to take your data skills to the next level, then enroll in this course today and learn how you can use Apache Spark and notebooks, integrated in Microsoft Fabric, in your day-to-day work.

Who this course is for:

Anyone wanting to learn how Apache Spark and Notebooks are integrated into Microsoft Fabric

Microsoft Fabric - Apache Spark and Notebooks

What you'll learn

Explore related topics

Course content

Introduction3 lectures • 8min

Introduction to Microsoft Fabric Notebooks3 lectures • 41min

Apache Spark and Notebooks15 lectures • 2hr 39min

Data Wrangler3 lectures • 29min

Test Your Knowledge13 lectures • 1hr 18min

Requirements

Description

Who this course is for: