Apache Spark and Scala
3.9 (68 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
401 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Apache Spark and Scala to your Wishlist.

Add to Wishlist

Apache Spark and Scala

A complete Guide for Processing Big Data with Spark
3.9 (68 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
401 students enrolled
Last updated 4/2017
English
Current price: $10 Original price: $200 Discount: 95% off
4 days left at this price!
30-Day Money-Back Guarantee
Includes:
  • 7.5 hours on-demand video
  • 1 Article
  • 39 Supplemental Resources
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Understand the limitations of Hadoop mapreduce and how Spark overcomes these limitations
  • Gain expertise in Scala programming language and its characteristics
  • Able to work with RDDs' and create applications in Spark
  • A thorough understanding about Spark SQL by using SQL queries in Spark
View Curriculum
Requirements
  • Prior knowledge of Apache Hadoop will be an added advantage, but not compulsory
  • Fundamental understanding of any programming language
Description

This course on Apache Spark and Scala aims at providing an advanced expertise in big data Hadoop ecosystem. This course will provide a standard skillset which helps one become a specialist on the top of Big data Hadoop developer. 

The course starts with a detailed description on limitations of mapreduce and how Spark can help overcome them. Further it covers a deeper dive into the Scala programming language.

Moving on it covers Spark as a standalone cluster and an understanding of Resiliient Distributed Datasets.

The course also covers concepts of Spark SQL using SQL queries through SQL context and Hive Queries through Hive context.

This course certainly provides material required for building a career path from Big data Hadoop developer to BIg data Hadoop architect.


Who is the target audience?
  • Students who aspire to gain a deep understanding of Apache Spark
  • Professionals looking for a career in real time big data analytics
Students Who Viewed This Course Also Viewed
Curriculum For This Course
67 Lectures
07:39:50
+
Module-1 Introduction to Big data, Hadoop and Spark
12 Lectures 43:35






1.7 Users and Use Cases of Apache Spark
07:45

1.8 Job Execution Flow and Spark Execution
01:12

1.9 Spark Unified Stack
01:08

1.10 Complete Picture of Apache Spark
06:37

1.11 Why Spark with Scala
02:12

1.12 Apache spark Architecture
02:16
+
Module 2: Introduction to Scala Programming Language
5 Lectures 37:32
2.1 Introduction to Scala
10:15

2.2 Scala Basic Syntax
05:11

2.3 Scala Class and Objects
04:03

2.4 If else Statements in Scala
08:31

2.5 Loops in Scala
09:32
+
Module 3: Advanced Scala Programming
9 Lectures 01:09:57
3.1 Functions and Procedures in Scala
08:19

3.2 Access Modifiers
06:25

3.3 Strings and Arrays
10:45

3.4 Scala Collections
14:29

3.5 Scala Traits
03:58

3.6 Pattern Matching
07:25

3.7 Scala Extractors
05:41

3.8 Scala Exception Handling
03:29

3.9 Scala Files IO
09:26
+
Module 4: Apache Spark RDDs
5 Lectures 12:16
4.1 Programming with RDDs
02:33

4.2 Starting with Spark
02:16

4.3 Creating RDDs
02:10

4.4 RDD Operations
03:36

4.5 Lifecycle of Spark
01:41
+
Module 5: Apache Spark RDDs II
4 Lectures 33:26
5.1 Spark Caching
03:12

5.2 Common Transformations and Actions
06:03

5.3 Spark Functions
13:56

5.4 Some more Spark functions
10:15
+
Module 6: Working with Key-Value pairs
5 Lectures 01:02:42
6.1 Key Value Pairs
05:24

6.2 Aggregate Functions
10:55

6.3 Working with Aggregate Functions
20:01

6.4 Joins in Spark
18:44

6.5 Practical on Word count example
07:38
+
Module 7: Advanced Spark Programming
5 Lectures 41:37
7.1 Spark Shared Variables
13:47

7.2 Spark and Fault Tolerance
01:53

7.3 Broadcast variables
11:16

7.4 Numeric RDD Operations
03:18

7.5 Per-Partition Operations
11:23
+
Module 8: Running Spark jobs on Cluster
5 Lectures 16:58
8.1 Spark Runtime Architecture
02:45

8.2 Spark Driver
03:41

8.3 Executors
01:07

8.4 Cluster Managers
06:11

8.5 Cluster Managers II
03:14
+
Module 9: Spark SQL
4 Lectures 33:29
9.1 Introduction to Spark SQL
04:59

9.2 Starting Point-SQL Context
07:42

9.3 Hive with Spark SQL
10:20

9.4 Spark SQL Caching
10:28
+
Module 10: Spark Streaming
1 Lecture 00:01
People.json, Employee.json
00:01
2 More Sections
About the Instructor
Digitorious Technologies
3.9 Average rating
193 Reviews
1,732 Students
10 Courses
Make Learning Smarter

Digitorious technologies is a leading publisher of development courses which provide in-depth knowledge and high quality training. Digitorious technologies is serving with a mission of providing right direction to people who are looking for a career in IT/software industry. Digitorious is the best place for learning new technologies and making things easy to understand virtually.