Build Big Data Pipelines w/ Hadoop, Flume, Pig, MongoDB
4.2 (120 ratings)
Course Ratings are calculated from individual students’ ratings and a variety of other signals, like age of rating and reliability, to ensure that they reflect course quality fairly and accurately.
1,349 students enrolled

Build Big Data Pipelines w/ Hadoop, Flume, Pig, MongoDB

Learn how to combine Hadoop, MongoDB, Pig, Sqoop and Flume to Architect and Build Big Data Pipelines and Data lakes.
4.2 (120 ratings)
Course Ratings are calculated from individual students’ ratings and a variety of other signals, like age of rating and reliability, to ensure that they reflect course quality fairly and accurately.
1,349 students enrolled
Created by V2 Maestros, LLC
Last updated 1/2018
English
English [Auto-generated]
Current price: $139.99 Original price: $199.99 Discount: 30% off
5 hours left at this price!
30-Day Money-Back Guarantee
This course includes
  • 3.5 hours on-demand video
  • 8 articles
  • 1 downloadable resource
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
Training 5 or more people?

Get your team access to 4,000+ top Udemy courses anytime, anywhere.

Try Udemy for Business
What you'll learn
  • Learn how to stitch individual Big Data Technologies together to solve business problems
  • Build and deploy Big Data Pipelines
Requirements
  • Hadoop
  • Linux / CDH 5
  • Java Programming preferred
Description

How do you build end-to-end Big Data Pipelines using multiple Big Data Technologies? You have seen courses and books that teach you individual technologies, but how do you combine and apply them to solve your business problems? This course teach you exact that !

Building Big Data Solutions require you to acquire data from multiple sources, transport them, process them and store them in Big Data repositories. You have to do that with scalability and reliability. Big Data Technologies like Hadoop, Sqoop, Pig, Flume etc. solve individual problems, but building an end-to-end solution requires stitching them together. This course teaches you how to do that. You solve complete business problems by building end-to-end pipelines in this course.

Who this course is for:
  • Big Data Engineers
  • Big Data Architects
Course content
Expand all 46 lectures 03:22:21
+ Introduction
5 lectures 12:15
Your Learning Process - Learn, Train, Practice, Apply
03:37
Setting up CDH Quickstart VM
00:14
Resource Bundle - Code Examples and Practice Exercises
00:01
+ Building Big Data Pipelines
4 lectures 09:41
Platform vs Application for Big Data
01:58
Big Data Pipelines - Recommended Strategy
02:45
Introduction to Use Cases
02:10
+ Apache Sqoop - an Introduction
5 lectures 18:01
What is Apache Sqoop?
05:01
Sqoop Command Line Overview
03:37
<train/> Simple Import Command
02:53
<train/> Setting up a job and Executing
06:19
@Practice() : Sqoop Practice Exercise
00:11
+ Apache Flume : An introduction
8 lectures 25:57
What is Apache Flume?
02:49
Installing and Running Flume
01:37
Chaining Flume Agents
02:33
<train/> Use Case 1 : Netcat to Console
05:57
<train/> Use Case 2 : Spool Directory to Destination Directory
03:24
<train/> Use Case 3 : Chaining with Directory, Avro and HDFS
04:37
@Practice() : Flume Practice Exercise
00:08
+ Apache Pig - An introduction
7 lectures 01:09:43
What is Apache Pig?
08:31
Pig Latin Basics
10:25
Pig Operations
15:29
Data Engineering with Pig
07:55
<train/> Pig Data Flows
14:57
<train/> Data Engineering with Pig
12:15
@Practice() : Pig Practice Exercises
00:11
+ Mongo DB - An introduction
7 lectures 21:08
What is Mongo DB?
03:30
Mongo DB Organization
02:52
Installation
01:05
<train/> Inserting Data
04:22
<train/> Querying Data
04:39
<train/> Updating and Deleting Data
04:20
@Practice() : Mongo DB Practice Exercise
00:20
+ Pipeline Use Case 1 : RDBMS to Big Data
2 lectures 10:43
Use Case 1 - Problem and Solution Overview
02:59
<train/> Use Case Solution - Building the Pipeline
07:44
+ Pipeline Use Case 2 : Web Logs to Mongo DB
2 lectures 09:23
Use Case 2 - Problem and Solution Overview
02:54
<train/> Use Case Solution - Building the Pipeline
06:29
+ Pipeline Use Case 3 : Multiple Sources to Monthly Summary
2 lectures 19:52
Use Case 3 - Problem and Solution Overview
04:14
<train/> Use Case Solution - Building the Pipeline
15:38
+ APPLY Project : Building multi-source pipelines
2 lectures 04:54
Apply Project - Problem Statement
00:57
Apply Project : Solution Review
03:57