Big Data Science with Apache Hadoop, Pig and Mahout
3.2 (69 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
2,034 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Big Data Science with Apache Hadoop, Pig and Mahout to your Wishlist.

Add to Wishlist

Big Data Science with Apache Hadoop, Pig and Mahout

Learn to execute Big Data Science Projects and deliver results using Apache Hadoop, MapReduce, Pig, Hive and Mahout
3.2 (69 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
2,034 students enrolled
Created by V2 Maestros, LLC
Last updated 1/2017
English
Current price: $10 Original price: $100 Discount: 90% off
1 day left at this price!
30-Day Money-Back Guarantee
Includes:
  • 9.5 hours on-demand video
  • 2 Articles
  • 1 Supplemental Resource
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Appreciate what Data Science really is
  • Understand the Data Science Life Cycle
  • Learn to use Apache Hadoop, Map Reduce, Pig and Mahout for executing Data Science projects
  • Master the application of Analytics and Machine Learning techniques
View Curriculum
Requirements
  • Linux experience
  • Java Programming experience preferred
  • SQL experience preferred
Description

"Data Science is the sexiest job of the 21st century - It has exciting work and incredible pay".

Learning Data Science though is not an easy task. The field traverses through Computer Science, Programming, Information Theory, Statistics and Artificial Intelligence. College/University courses in this field are expensive. Becoming a Data Scientist through self-study is challenging since it requires going through multiple books, websites, searches and exercises and you will still end up feeling "not complete" at the end of it. So how do you acquire full-stack Data Science skills that will get you a and give you the confidence to execute it?

Big Data Science with Hadoop addresses the problem. This course provides extensive, end-to-end coverage of all activities performed in a Data Science project. If teaches application of the latest techniques in data acquisition, transformation and predictive analytics to solve real world business problems. The goal of this course is to teach practice rather than theory. Rather than deep dive into formula and derivations, it focuses on using existing libraries and tools to produce solutions. It also keeps things simple and easy to understand.

Through this course, we strive to make you fully equipped to become a developer who can execute full fledged Data Science projects. By taking this course, you will

  • Appreciate what Data Science really is
  • Understand the Data Science Life Cycle
  • Learn Apache Hadoop, Map Reduce, Pig, Mahout and Hive.
  • Apply Hadoop technologies for executing Data Science Projects
  • Master the application of Analytics and Machine Learning techniques
Big Data and Data Science go hand in hand and this is a great course to learn both !!

Please note: This course only covers Hadoop components as-required for Data Science. It does not provide exhaustive coverage.

Who is the target audience?
  • IT Professionals aspiring to be Data Scientists
  • Students who want to learn about Data Science domain
  • Statisticians and Project Managers who want to expand their horizon into Data Science
Students Who Viewed This Course Also Viewed
Curriculum For This Course
Expand All 52 Lectures Collapse All 52 Lectures 09:29:54
+
Introduction
3 Lectures 12:51
+
What is Data Science?
5 Lectures 52:48
Basic Elements of Data Science
11:51

The Dataset
10:44


Modeling and Predictions
09:31

Use Cases for Data Science
07:47
+
Data Science Life Cycle
3 Lectures 42:59
Stage 1 - Setup
11:46

Stage 2 - Data Engineering
11:57

Stage 3 - Analysis and Production
19:16
+
Statistics for Data Science
4 Lectures 52:53
Types of Data
07:29

Summary Statistics
16:10


Correlations
10:09
+
Data Engineering
4 Lectures 52:53
Data Acquisition
16:01

Data Cleansing
10:50

Data Transformations
11:09

Text Processing - TF-IDF
14:53
+
Apache Hadoop
8 Lectures 01:21:55
Hadoop Overview
10:06

Setting up the Cloudera VM
06:51

About HDFS
14:46

HDFS Usage Examples
06:01

Introduction to Map Reduce
17:24

A Map Reduce example in Java
16:46


Hadoop tools for Data Science
03:34
+
Apache Sqoop and Hive
3 Lectures 28:00
Sqoop Overview and examples
07:32

Hive Overview
14:18

Hive Examples
06:10
+
Apache Pig
6 Lectures 01:09:32
Apache Pig Overview
08:31

Pig Latin Basics
10:25

Pig Latin Operations
15:29

Data Engineering with Pig
07:55

Examples - Pig Latin Operations
14:57

Examples - Data Engineering with Pig
12:15
+
Machine Learning with Apache Mahout
13 Lectures 02:38:01
Types of Analytics
12:08

Types of Learning
17:16

Analyzing results and errors
13:46

Apache Mahout Overview
03:30

Decision Trees
10:42

Random Forests
10:31

Mahout example - Random Forests
14:40

Naive Bayes Classifier
19:21

Mahout example - Naive Bayes
11:14

K Means Clustering
11:53

Mahout example - K Means Clustering
10:17

Recommendation Systems
11:55

Mahout example - User Based Recommender
10:48
+
Case Studies
1 Lecture 15:56
Use Case : Predicting Heart Disease
15:56
1 More Section
About the Instructor
V2 Maestros, LLC
4.2 Average rating
2,126 Reviews
22,230 Students
13 Courses
Big Data Science / Analytics Experts | 10K+ students

V2 Maestros is dedicated to teaching big data / data science at affordable costs to the world. Our instructors have real world experience practicing big data and data science and delivering business results. Big Data Science is a hot and happening field in the IT industry. Unfortunately, the resources available for learning this skill are hard to find and expensive. We hope to ease this problem by providing quality education at affordable rates, there by building data science talent across the world.