Big Data with Apache Spark PySpark: Hands on PySpark, Python
What you'll learn
- Basic overview of Spark technology
- End to end Installation of Apache spark in Windows machine
- End to end Installation of Apache spark in Linux machine
- Setup Apache Spark Cluster on Microsoft azure HDInsight
- Learn Spark SQL
- Learn Spark DataFrame API
- Spark Structured Streaming
Requirements
- Experience with Programming
Description
Welcome to the Apache Spark : PySpark Course.
Have you ever thought about How big company like Google, Microsoft, Facebook, Apple or Amazon Process Petabytes of data on thousands of machine.
This course starting point to learn about in memory big data analysis tool Apache Spark.
==============================================
What previous students have said:
"Very good introduction. Ideal for beginners to obtain a big picture as a starting point. The course should be further developed and supplemented with further practical examples. But overall I would highly recommend."
"I like the pace at which the instructor is going. I like the fact that he quickly dives into the practical. For me, this helps to put subsequent learning into perspective. He tends to have quite a few typos, but I can overlook those and still give him a 5 star rating. I am still quite early in the. Hope to update my review as I go along."
"Great course, knowledgeable author."
"Curso excelente para quem deseja aprender sobre Big Data e Spache Spark com PySpark."
==================================================
Apache Spark can perform up to 100x faster than Hadoop MapReduce Data processing framework, Which makes apache spark one of most demanded skills.
The top companies like Google, Facebook, Microsoft, Amazon, Airbnb using Apache Spark to solve their big data problems!. Data analysis, on huge amount of data is one of the most valuable skills now a days and This course will teach such kind of skills to complete in big data job market.
This course will teach
Introduction to big data and Apache spark
Getting started with databricks
Detailed installation step on ubuntu - linux machine
Python Refresh for newbie
Apache spark Dataframe API
Apache spark structured streaming with end to end example
Basics of Machine Learning and feature engineering with Apache spark.
This course is not complete, will be adding new content related to Spark ML.
Note : This course will teach only Spark 2.0 Dataframe based API only not RDD based API. As Dataframe based API is the future of spark.
Regards
Ankit Mistry
Who this course is for:
- Anyone who wants to learn advance big data skill
- Anyone who knows Hadoop and wants to move ahead in faster data processing
- Anyone wants to make career as data Engineer, Data analyst, Machine Learning Engineer
- Interested in learning Apache spark and pyspark for big data analysis
- Anyone wants learn cutting edge technology in Data processing
Course content
- Preview01:04
- 00:22Course FAQ
- Preview10:18
- Preview03:32
- 04:37Time Line of Big data and Hadoop based Eco-Systems
- 07:03What is Apache Spark
- 03:55Spark API Overview
Instructor
I am Ankit Mistry, completed my master from IIT Kharagpur in area of machine learning, Artificial intelligence.Now working as Software Developer, Big Data Engineer in one of leading private investment bank with 8+ years of experience in software industry.
Over the time I developed interest related to data discipline and learned about data analysis, machine learning model development.
Created course in area of Python, Data Science, Data analysis, Machine Learning.
I am so excited to be on Udemy online learning platform and want to make big impact on your software career.
I hope you will like my course offering.