
Introduction to the course
Important. Please listen to this brief lecture. It may not be necessary to purchase this course.
Before getting started
A discussion of what Microsoft Fabric Notebooks are
In this lecture we do a quick tour of the notebook environment
Part 2 of our quick tour of the notebook environment
Listen to this short lecture before the next lecture
An introduction to Apache Spark in Fabric
In this lecture we discuss the four major components or libraries that make up Spark
Part 1 of a discussion on Spark dataframes
Part 2 of a discussion on Spark dataframes
Part 3 of a discussion on Spark dataframes
Common operations when using dataframes
In this lecture we discuss various ways to filter data in Spark
In this lecture we talk about ways to aggregate data in Spark.
In this lecture we talk about how to join Spark dataframes
In this lecture we demonstrate how Spark can deal with dates
In this lecture we talk about using the spark sql function and its interaction with other Spark functions.
In this lecture we talk about the Microsoft Fabric / Spark runtime
In this lecture we talk about configuring Spark
In this lecture we talk a bit about Spark autotuning
Part 1 of our discussion of Data Wrangler
Part 2 of our discussion of Data Wrangler
Part 3 of our discussion of Data Wrangler
Exercise using notebooks
Exercise using notebooks
Question to test your knowledge
Question to test your knowledge
Exercise using notebooks
Exercise using notebooks
Questions to test your knowledge
Exercise using notebooks
Exercise using notebooks
Exercise using notebooks
Exercise using notebooks
Exercise using notebooks
Exercise using notebooks
This course takes you into the Microsoft Fabric experience of Apache Spark and Notebooks, where the power of distributed computing meets the elegance and fun of interactive coding in Python, R, Scala or SQL.
In this comprehensive course, you will be introduced to Spark's pyspark library, empowering you to tackle common data challenges with remarkable ease. You will wrangle data in pandas dataframes and learn the most common dataframe operations. In short, you will acquire the skills necessary to manipulate your data in many different ways.
As you move deeper into Spark, you will master the art of filtering data, joining dataframes, and manipulating dates and times with ease. You will also be exposed to using SQL in Spark, allowing you to use SQL queries to extract data from Delta Parquet files and storing them in dataframes.
This course will also equip you with the knowledge you need to optimize Spark performance through effective configuration and autotuning. You will learn that there is very little you have to do to fully optimize the Spark environment in which your notebooks run.
The course consists of over 30 lectures, complemented by hands-on practice exercises that will challenge your understanding of the concepts being taught and perhaps your research ability. By the end of this course, you will be a confident and proficient Spark notebook user, capable of harnessing its immense power to solve real-world data scenarios.
Here are just some of the compelling benefits you will gain from this course:
Master the fundamentals of Spark libraries, dataframes, and SQL
Learn to how to filter, aggregate, and join data using dataframes
Gain expertise in manipulating dates and times
Data Wrangling
Learn how to optimize Spark performance through configuration and autotuning
Solve real-world data scenarios with Spark notebooks
Enhance your resume with in-demand Fabric Spark notebook skills
Position yourself for success in the booming field of Big Data
If you are ready to take your data skills to the next level, then enroll in this course today and learn how you can use Apache Spark and notebooks, integrated in Microsoft Fabric, in your day-to-day work.