Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Microsoft Fabric - Apache Spark and Notebooks
Rating: 4.7 out of 5(71 ratings)
476 students

Microsoft Fabric - Apache Spark and Notebooks

Learn the Microsoft Fabric Spark environment, notebooks, dataframes, filtering, aggregating, joining dataframes and more
Created byRandy Minder
Last updated 10/2023
English

What you'll learn

  • Apache Spark and Notebook Integration into Microsoft Fabric
  • Overview of the Microsoft Fabric Notebooks Environment
  • Dataframe Basics
  • Common Operations with Dataframes
  • Filtering data using Dataframes
  • Aggregating data using Dataframes
  • Joining Dataframes
  • Spark Libraries
  • Working with Dates
  • SQL Statements and Spark
  • Spark Configuration and Tuning
  • Exercises and Questions for Practice

Course content

5 sections37 lectures5h 15m total length
  • Introduction2:59

    Introduction to the course

  • Important! You May Not Need to Purchase This Course!1:49

    Important. Please listen to this brief lecture. It may not be necessary to purchase this course.

  • Getting Started with Fabric3:08

    Before getting started

Requirements

  • Some working experience with Microsoft Fabric and Spark would be useful but not absolutely necessary

Description

This course takes you into the Microsoft Fabric experience of Apache Spark and Notebooks, where the power of distributed computing meets the elegance and fun of interactive coding in Python, R, Scala or SQL.

In this comprehensive course, you will be introduced to Spark's pyspark library, empowering you to tackle common data challenges with remarkable ease. You will wrangle data in pandas dataframes and learn the most common dataframe operations. In short, you will acquire the skills necessary to manipulate your data in many different ways.

As you move deeper into Spark, you will master the art of filtering data, joining dataframes, and manipulating dates and times with ease. You will also be exposed to using SQL in Spark, allowing you to use SQL queries to extract data from Delta Parquet files and storing them in dataframes.

This course will also equip you with the knowledge you need to optimize Spark performance through effective configuration and autotuning. You will learn that there is very little you have to do to fully optimize the Spark environment in which your notebooks run.

The course consists of over 30 lectures, complemented by hands-on practice exercises that will challenge your understanding of the concepts being taught and perhaps your research ability. By the end of this course, you will be a confident and proficient Spark notebook user, capable of harnessing its immense power to solve real-world data scenarios.

Here are just some of the compelling benefits you will gain from this course:


  • Master the fundamentals of Spark libraries, dataframes, and SQL

  • Learn to how to filter, aggregate, and join data using dataframes

  • Gain expertise in manipulating dates and times

  • Data Wrangling

  • Learn how to optimize Spark performance through configuration and autotuning

  • Solve real-world data scenarios with Spark notebooks

  • Enhance your resume with in-demand Fabric Spark notebook skills

  • Position yourself for success in the booming field of Big Data

If you are ready to take your data skills to the next level, then enroll in this course today and learn how you can use Apache Spark and notebooks, integrated in Microsoft Fabric, in your day-to-day work.

Who this course is for:

  • Anyone wanting to learn how Apache Spark and Notebooks are integrated into Microsoft Fabric