Apache Spark 2.0 + Python : DO Big Data Analytics & ML

Project Based, Hands-on Practices, Spark SQL, Spark Streaming, Real life Full cycle Project
4.4 (31 ratings) Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
440 students enrolled
$19
$80
76% off
Take This Course
  • Lectures 58
  • Length 7.5 hours
  • Skill Level Beginner Level
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works

Discover

Find online courses made by experts from around the world.

Learn

Take your courses with you and learn anywhere, anytime.

Master

Learn and practice real-world skills and achieve your goals.

About This Course

Published 8/2016 English

Course Description

Welcome to our course. Looking to learn Apache Spark 2.0, practice end-to-end projects and take it to a job interview? You have come to the RIGHT course! This course teaches you Apache Spark 2.0 with Python, trains you in building Spark Analytics and machine learning programs and helps you practice hands-on with an end-to-end real life application project. Our goal is to help you and everyone learn, so we keep our prices low and affordable.

Apache Spark is the hottest Big Data skill today. More and more organizations are adapting Apache Spark for building their big data processing and analytics applications and the demand for Apache Spark professionals is sky rocketing. Learning Apache Spark is a great vehicle to good jobs, better quality of work and the best remuneration packages.

The goal of this project is provide hands-on training that applies directly to real world Big Data projects. It uses the learn-train-practice-apply methodology where you

  • Learn solid fundamentals of the domain
  • See demos, train and execute solid examples
  • Practice hands-on and validate it with solutions provided
  • Apply knowledge you acquired in an end-to-end real life project

Taught by an expert in the field, you will also get prompt response to your queries and excellent support from Udemy.

What are the requirements?

  • Python programming
  • Have a laptop/desktop to setup Spark

What am I going to get from this course?

  • Acquire Knowledge of Apache Spark 2.0 fundamentals and architecture
  • Write Spark 2.0 scripts for Transformations, actions, Spark SQL and Spark Streaming
  • Execute Machine Learning / Data Science algorithms
  • Solve real world data problems with Apache Spark 2.0
  • Handle interviews for Apache Spark 2.0 confidently and get jobs

What is the target audience?

  • Software Professionals
  • Big Data Architects
  • Data Engineers

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.

Curriculum

Section 1: Kick-start your learning
02:54

About the mentor and setting expectations.

03:37

Introduction to the Learn-Train-Practice-Apply methodology

Your Course Guide - Pathway to success
Preview
06:59
Resource Bundle - Code and Data
Article
Section 2: Introduction to Apache Spark
07:58

Introduction to Apache Spark

11:29

Download Apache Spark and setup your Python environment to use Spark

05:36

Run your first Spark program - get your hands dirty !

06:11

Overview of Spark and its various modules and libraries

05:15

Overview of Resilient Distributed Data Sets (RDD)

11:09

Spark cluster architecture and scalability

04:01

Various steps/stages in a Spark project and how things happen.

Spark Architecture
5 questions
Section 3: Spark Programming with Python
08:41

How you load external data into Spark and how data gets saved.

<train/> Loading and Storing Data
07:46
@Practice() Loading and Storing Data
Article
Transformations - Change how data looks
09:41
<train/> Transformations
14:33
@Practice() Transformations
Article
Actions - Extract insights from Data
08:39
<train/> Actions
07:30
@Practice() Actions
Article
Key-Value RDDs
04:16
<train/> Key-Value RDDs
07:10
@Practice() Key-Value RDDs
Article
10:42

Broadcast variables, accumulators, partitioning and persistence

<train/> Advanced Spark - Enhanced Capabilities
06:37
@Practice() Advanced Spark
Article
Section 4: Spark SQL
Spark SQL Data Frames - the new era
07:53
<train/> SQL Data Frames
11:18
@Practice() SQL Data Frames
Article
Temp Tables / Views - Easy querying
02:42
<train/> Temp Tables / Views
05:04
@Practice() Temp Tables/ Views
Article
Section 5: Spark Streaming
Spark Streaming - real time data processing
05:14
Spark Streaming Architecture - how it works.
06:44
<train/> Spark Streaming
10:35
Section 6: Machine Learning with Spark
Types of Analytics - simple to predictive
12:08
Types of Machine Learning
17:16
Analyzing results and Errors
13:46
Spark ML Concepts - new data types
08:09
Linear Regression - fit to a line
19:00
<train/> Linear Regression Use Case
17:15
Decision Trees Classification
10:42
<train/> Decision Trees Use Case
12:20
Principal Component Analysis
07:28
Random Forest Classification
10:31
<train/> Random Forests and PCA Use Case
14:00
Text Pre-processing with TF-IDF
14:53
Naive Bayes Classification
19:21
<train/> Naive Bayes and Text Pre-processing Use Case
07:32
K-Means Clustering - grouping similar items
11:53
<train/> K-Means Clustering Use Case
09:22
Recommendation Engines
11:55
<train/> Recommendation Engines Use Case
05:03
Section 7: APPLY : Your Course Challenge Project
Real world problem Statement - Credit Card defaulters
Article
Hints to help you with the project
Article
12:30

The final solution is available in the resource bundle ( APPLY Project *.py)

Section 8: Conclusion
Closing Remarks
01:15
BONUS Lecture - Your next steps & Discount coupons
Article

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

V2 Maestros, Big Data Science / Analytics Experts | 10K+ students

V2 Maestros is dedicated to teaching big data / data science at affordable costs to the world. Our instructors have real world experience practicing big data and data science and delivering business results. Big Data Science is a hot and happening field in the IT industry. Unfortunately, the resources available for learning this skill are hard to find and expensive. We hope to ease this problem by providing quality education at affordable rates, there by building data science talent across the world.

Ready to start learning?
Take This Course