Hadoop Developer In Real World
4.6 (449 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
1,188 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Hadoop Developer In Real World to your Wishlist.

Add to Wishlist

Hadoop Developer In Real World

Free Cluster Access * HDFS * MapReduce * YARN * Pig * Hive * Flume * Sqoop * AWS * EMR * Optimization * Troubleshooting
4.6 (449 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
1,188 students enrolled
Last updated 9/2016
English
English
Price: $200
30-Day Money-Back Guarantee
Includes:
  • 15.5 hours on-demand video
  • 4 Articles
  • 40 Supplemental Resources
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Understand what is Big Data, the challenges with Big Data and how Hadoop propose a solution for the Big Data problem
  • Work and navigate Hadoop cluster with ease
  • Install and configure a Hadoop cluster on cloud services like Amazon Web Services (AWS)
  • Understand the difference phases of MapReduce in detail
  • Write optimized Pig Latin instruction to perform complex data analysis
  • Write optimized Hive queries to perform data analysis on simple and nested datasets
  • Work with file formats like SequenceFile, AVRO etc
  • Understand Hadoop architecture, Single Point Of Failures (SPOF), Secondary/Checkpoint/Backup nodes, HA configuration and YARN
  • Tune and optimize slowing running MapReduce jobs, Pig instructions and Hive queries
  • Understand how Joins work behind the scenes and will be able to write optimized join statements
  • Wherever possible, students will be introduced to difficult questions that are asked in real Hadoop interviews
View Curriculum
Requirements
  • Although you don't have to be an expert in Java, basic knowledge in Java programming is required as we will be looking at programs in Java.
  • Basic Linux commands
Description

From the creators of the successful Hadoop Starter Kit course hosted in Udemy, comes Hadoop In Real World course. This course is designed for anyone who aspire a career as a Hadoop developer. In this course we have covered all the concepts that every aspiring Hadoop developer must know to SURVIVE in REAL WORLD Hadoop environments.

The course covers all the must know topics like HDFS, MapReduce, YARN, Apache Pig and Hive etc. and we go deep in exploring the concepts. We just don’t stop with the easy concepts, we take it a step further and cover important and complex topics like file formats, custom Writables, input/output formats, troubleshooting, optimizations etc.

All concepts are backed by interesting hands-on projects like analyzing million song dataset to find less familiar artists with hot songs, ranking pages with page dumps from wikipedia, simulating mutual friends functionality in Facebook just to name a few.

Who is the target audience?
  • This course is for anyone who aspire a career as a Hadoop Developer
  • This course is for anyone who want to learn and understand in depth about Hadoop and Big Data
Students Who Viewed This Course Also Viewed
Curriculum For This Course
76 Lectures
15:36:56
+
Thank You and Let's Get Started
3 Lectures 27:54

Tools & Setup (Windows)
09:09

Tools & Setup (Linux)
07:42
+
Introduction To Big Data
3 Lectures 35:57
What is Big Data?
17:47


History of Hadoop
03:46

Test your understanding of Big Data
6 questions
+
HDFS
6 Lectures 56:04

Blocks
07:50

Working With HDFS
16:09

HDFS - Read & Write
09:31

HDFS - Read & Write (Program)
08:38

Test your understanding of HDFS
5 questions

HDFS Assignment
00:36
+
MapReduce
9 Lectures 01:42:42


Dissecting MapReduce Program (Part 1)
12:00

Dissecting MapReduce Program (Part 2)
16:09

Combiner
06:20

Counters
06:43

Facebook - Mutual Friends
17:38

New York Times - Time Machine
15:43

Test your understanding of MapReduce
12 questions

MapReduce Assignment
01:15
+
Apache Pig
11 Lectures 02:28:36
Introduction to Apache Pig
12:52

Loading & Projecting Datasets
13:41

Solving a Problem
13:32

Complex Types
21:12

Pig Latin - Joins
19:53

Million Song Dataset (Part 1)
10:29

Million Song Dataset (Part 2)
15:01

Page Ranking (Part 1)
08:11

Page Ranking (Part 2)
19:26

Page Ranking (Part 3)
12:17

Test your understanding of Apache Pig
13 questions

Apache Pig Assignment
02:02
+
Apache Hive
12 Lectures 01:50:25
Introduction to Apache Hive
09:58

Dissect a Hive Table
10:14

Loading Hive Tables
11:17

Simple Selects
06:07

Managed Table vs. External Table
06:20

Order By vs. Sort By vs. Cluster By
09:44

Partitions
19:31

Buckets
07:27

Hive QL - Joins
09:21

Twitter (Part 1)
09:33

Twitter (Part 2)
08:43

Test your understanding of Apache Hive
18 questions

Apache Hive Assignment
02:10
+
Architechture
5 Lectures 55:09
HDFS Architechture
12:46

Secondary Namenode
11:24

Highly Available Hadoop
08:48

MRv1 Architechture
10:49

YARN
11:22

Test your understanding of Hadoop Architechture
10 questions
+
Cluster Setup
5 Lectures 01:29:40
Vendors & Hosting
06:35

Cluster Setup (Part 1)
23:43

Cluster Setup (Part 2)
25:35

Cluster Setup (Part 3)
18:01

With Amazon EMR we can start a brand new Hadoop cluster and run MapReduce jobs in matter of minutes. This lecture will walk through step by step how to set up a Hadoop cluster and run MapReduce jobs in it.

Amazon EMR
15:46

Test your understanding of Cluster Setup
7 questions
+
Hadoop Administrator In Real World (Upcoming Course)
2 Lectures 37:15

In this lecture we will learn about the benefits of Cloudera Manager, differences between Packages and Parcels and lifecycle of Parcels.

Cloudera Manager - Introduction
13:08

In this lecture we will see how to install a 3 node Hadoop cluster on AWS using Cloudera Manager

Cloudera Manager - Installation
24:07
+
File Formats
5 Lectures 01:21:46
Compression
14:55

Sequence File
18:32

AVRO
19:08

File Formats - Pig
18:08

File Formats - Hive
11:03

Test your understanding of File Formats
10 questions
4 More Sections
About the Instructor
Hadoop In Real World
4.5 Average rating
6,173 Reviews
55,774 Students
3 Courses
Expert Big Data Consultants

We are a group of Senior Hadoop Consultants who are passionate about Hadoop and Big Data technologies. We have experience across several key domains from finance and retail to social media and gaming. We have worked with Hadoop clusters ranging from 50 all the way to 800 nodes.

We have been teaching Hadoop for several years now. Check out our FREE and successful Hadoop Starter Kit course at Udemy.