PySpark - Build DataFrames with Python, Apache Spark and SQL
What you'll learn
- Create big data streaming pipelines with Spark using Python
- Learn how to apply Spark on distributed Dataframes
- Use Python with Big Data on a distributed framework (Apache Spark)
Requirements
- No necessary experience needed
Description
This course covers all the fundamentals about Apache Spark streaming with Python and teaches you everything you need to know about developing Spark streaming applications using PySpark, the Python API for Spark. At the end of this course, you will gain in-depth knowledge about Spark streaming and general big data manipulation skills to help your company to adapt Spark Streaming for building big data processing pipelines and data analytics applications. This course will be absolutely critical to anyone trying to make it in data science today.
Spark can perform up to 100x faster than Hadoop MapReduce, which has caused an explosion in demand for this skill! Because the Spark 2.0 DataFrame framework is so new, you now have the ability to quickly become one of the most knowledgeable people in the job market!
This course will teach the basics with a crash course in Python, continuing on to learning how to use Spark DataFrames with the latest Spark 2.0 syntax! Once we've done that we'll go through how to use the MLlib Machine Library with the DataFrame syntax and Spark. All along the way, you'll have exercises and Mock Consulting Projects that put you right into a real-world situation where you need to use your new skills to solve a real problem!
We also cover the latest Spark Technologies, like Spark SQL, Spark Streaming, and advanced models like Gradient Boosted Trees! After you complete this course you will feel comfortable putting Spark and PySpark on your resume! This course also has a full 30-day money-back guarantee and comes with a LinkedIn Certificate of Completion!
If you're ready to jump into the world of Python, Spark, and Big Data, this is the course for you!
Who this course is for:
- Someone who knows Python and would like to learn how to use it for Big Data
- Someone who is very familiar with another programming language and needs to learn Spark
- Developers transferring from other languages
- Python Developers looking to get better at Data Streaming
- Spark Developers eager to expand their skills.
Instructors
Mammoth Interactive is a leading online course provider in everything from learning to code to becoming a YouTube star. Mammoth Interactive courses have been featured on Harvard’s edX, Business Insider and more.
Over 11 years, Mammoth Interactive has built a global student community with 3.3 million courses sold. Mammoth Interactive has released over 300 courses and 3,000 hours of video content.
Founder and CEO John Bura has been programming since 1997 and teaching since 2002. John has created top-selling applications for iOS, Xbox and more. John also runs SaaS company Devonian Apps, building efficiency-minded software for technology workers like you.
"I absolutely love this course. This is such a comprehensive course that was well worth the money I spent and a lot more. Will definitely be looking at more Mammoth Interactive courses when I finish this." – Student Matt W.
"Very good at explaining the basics then building to more complex features." – Student Kevin L.
Try a course today.
John Bura is has been a successful Udemy instructor since 2011. John Bura has been programming games since 1997 and teaching since 2002. John is the owner of the game development studio Mammoth Interactive. This company produces XBOX 360, iPhone, iPad, android, HTML 5, ad-games and more. Mammoth Interactive recently sold a game to Nickelodeon! John has been contracted by many different companies to provide game design, audio, programming, level design and project management. To this day John has 40 commercial games that he has contributed to. Several of the games he has produced have risen to the top 10 in the Apple's app store. In his spare time John likes to play ultimate Frisbee, cycle and work out.