Big Data with Apache Spark and AWS
3.3 (2 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
32 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Big Data with Apache Spark and AWS to your Wishlist.

Add to Wishlist

Big Data with Apache Spark and AWS

Learn the latest Big Data technology - Build, and execute real-world Big Data solutions using Spark and AWS.
New
3.3 (2 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
32 students enrolled
Created by Skillbox, LLC
Last updated 8/2017
English
Current price: $12 Original price: $195 Discount: 94% off
3 days left at this price!
30-Day Money-Back Guarantee
Includes:
  • 2.5 hours on-demand video
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion

Training 5 or more people?

Get your team access to Udemy's top 2,000 courses anytime, anywhere.

Try Udemy for Business
What Will I Learn?
  • Start a project using Apache Spark
  • Understand how Spark SQL lets you work with structured data
  • Install and run Apache Spark on a desktop computer or on a cluster
  • Gain hands-on experience setting up Spark clusters on AWS cloud services platform
  • Understand how to control a cloud instance on AWS using SSH or PuTTY
  • Understand how to access data from the CSV, Json, HDFS, and S3 formats
View Curriculum
Requirements
  • A PC or Mac
  • Basic understanding and functional knowledge of Apache Spark and big data
Description

Welcome to this course: Big Data with Apache Spark and AWS.

Every year we have a big increment of data that we need to store and analyze. AWS is a web service used to process and store vast amount of data, and it is one of the largest Hadoop operators in the world. We will teach you how to create Spark clusters on the Amazon Web Services (AWS) platform; With the increase in the amount of data generated and collected by many businesses and the arrival of cost-effective cloud-based solutions for distributed cloud computing, the feasibility to crunch large amounts of data to get deep insights within a short span of time has increased greatly.

This course will get you started with AWS so that you can quickly create your own account and explore the services provided, many of which you might be delighted to use. You'll learn to perform cluster based data modeling using Gaussian generalized linear models, binomial generalized linear models, Naive Bayes, and K-means modeling; access data from S3 Spark DataFrames and other formats like CSV, Json, and HDFS; and do cluster based data manipulation operations with tools like SparkR and SparkSQL.

By the end of this course, you will have a thorough understanding of Spark and AWS, and you will be able to perform full-stack data analytics with a feel that no amount of data is too big.

Who is the target audience?
  • Software Engineer
  • Application developers
  • Data scientists
  • Big data architects
Compare to Other Apache Spark Courses
Curriculum For This Course
18 Lectures
02:17:18
+
Welcome
1 Lecture 04:12
+
Creating Clusters
6 Lectures 44:28
Creating an AWS Instance
09:39

Connecting to AWS Instance with SSH
06:18

Connecting to AWS Instance with PuTTY
08:37

Spark Clusters
09:01

Spark Clusters in depth
09:55

Learn How to Terminate Your Clusters
00:58
+
Data and Modeling Basics
4 Lectures 38:39
Data Basics
08:33

Modeling with Gaussian Generalized Linear Models
11:19


Naive Bayes and K-Means Modeling
09:14
+
Data Sources and Data Manipulation
4 Lectures 28:41
Bigger Data and AWS S3
07:27

Accessing S3 Spark Dataframes
04:57

SparkR Dataframe Operations
11:01

Intro to SparkSQL
05:16
+
Various
2 Lectures 19:18
Intro to HDFS
10:59

Databricks Community Edition
08:19
+
Course Summary
1 Lecture 02:00
Summary
02:00
About the Instructor
Skillbox, LLC
4.2 Average rating
313 Reviews
8,434 Students
4 Courses
High Quality Courses from Expert Instructors

Skillbox, LLC specializes in technical training via on-demand streaming. We are committed to providing students professional development, networking, and learning opportunities through our educational content. 

Our instructors are industry leading experts in the subjects they teach, because they continue to spend time working on real-world industry applications. They are expert trainers – able to communicate complex concepts clearly, understandably and with the enthusiasm that will inspire you to learn.

Today Skilbox, LLC is the world's most trusted provider of mentioned services and training along with web security aspects, and open source technology.