Apache Spark Core and Structured Streaming 3.0 In-Depth

Name: Apache Spark Core and Structured Streaming 3.0 In-Depth
Rating: 4.5 (390 reviews)

In-Depth, Hands-On driven exposure to the features and concepts of Spark Core with tips on tuning its performance

Created byAmit Ranjan

Last updated 5/2024

English

What you'll learn

Strong focus on the practicality by getting into hands-on mode with plentiful of examples
Develop in-depth understanding of the underlying concepts the core of Apache Spark
Know the ways to get the best performance from Spark in production
Avoid the common pitfalls when writing Spark applications
In Depth exploration of Spark Structured Streaming 3.0 using Python API.
Get introduced to Apache Kafka on a high level in the process.
Understand the nuances of Stream Processing in Apache Spark
Discover various features Spark provides out of the box for Stream Processing

Course content

15 sections • 42 lectures • 13h 51m total length

Expectations from Data Processing Framework11:41
Introduction to Map Reduce22:09

Requirements

We'll be using Python API in Spark Programming. However, we'll explain all the programs in details but fundamental knowledge of Python will be beneficial.

Description

Apache Spark has turned out to be the most sought-after skill for any big data engineer. An evolution of MapReduce programming paradigm, Spark provides unified data processing from writing SQL to performing graph processing to implementing Machine Learning algorithms. It effectively uses cluster nodes and better memory management to spread the load across cluster of nodes to get faster results. Apache Spark drives the mission of data-driven-decision-making in thousands of organizations.

In order to fairly appreciate the benefits of the libraries of Apache Spark, it is essential to know the foundations right. This course aims exactly at that part. It starts from the beginner level and gradually explains all the complex concepts in an easy to reflect manner. It gives a profound description of the features and working of the framework through 5 different use cases with detailed hands on implementations. In fact, some hands-on sessions and solutions to the use-cases are explained in a full classroom mode with videos extending over 40 mins. After taking this course, you will gain the expertise on Spark Core and usage of further libraries like Spark SQL, Structured Streaming, Spark ML and GraphX will be much easier to visualize, implement and optimize.

This illustrative course will build your foundational knowledge. You will learn the differences between batch & stream processing, programming model, the APIs and the challenges specific to stream processing. Quickly we'll move to understand the concepts of stream processing with wide varieties of examples & hands-on, dealing with inner working and taking a use case towards the end. All of this activity will be on cloud using Spark 3.0.

Who this course is for:

Data engineers and developers who wish to leverage the fast analytics using Apache Spark in production environments.

Apache Spark Core and Structured Streaming 3.0 In-Depth

What you'll learn

Explore related topics

Course content

Distributed Data Processing2 lectures • 34min

Resources1 lecture • 1min

Concepts of Apache Spark3 lectures • 45min

Starting HandsOn with Spark Applications3 lectures • 49min

Deep Dive into Spark Concepts2 lectures • 1hr 1min

Exploring operations on RDDs3 lectures • 1hr 8min

Advanced Concepts of Apache Spark8 lectures • 4hr 25min

Running Spark Application in a Cluster1 lecture • 43min

Tuning Spark Applications4 lectures • 1hr 14min

Interview with Spark Expert1 lecture • 29min

Requirements

Description

Who this course is for: