Build Big Data Pipelines w/ Hadoop, Flume, Pig, MongoDB
3.8 (38 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
482 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Build Big Data Pipelines w/ Hadoop, Flume, Pig, MongoDB to your Wishlist.

Add to Wishlist

Build Big Data Pipelines w/ Hadoop, Flume, Pig, MongoDB

Learn how to combine Hadoop, MongoDB, Pig, Sqoop and Flume to Architect and Build Big Data Pipelines and Data lakes.
3.8 (38 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
482 students enrolled
Created by V2 Maestros, LLC
Last updated 1/2017
English
Current price: $10 Original price: $200 Discount: 95% off
5 hours left at this price!
30-Day Money-Back Guarantee
Includes:
  • 3.5 hours on-demand video
  • 8 Articles
  • 1 Supplemental Resource
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Learn how to stitch individual Big Data Technologies together to solve business problems
  • Build and deploy Big Data Pipelines
View Curriculum
Requirements
  • Hadoop
  • Linux / CDH 5
  • Java Programming preferred
Description

How do you build end-to-end Big Data Pipelines using multiple Big Data Technologies? You have seen courses and books that teach you individual technologies, but how do you combine and apply them to solve your business problems? This course teach you exact that !

Building Big Data Solutions require you to acquire data from multiple sources, transport them, process them and store them in Big Data repositories. You have to do that with scalability and reliability. Big Data Technologies like Hadoop, Sqoop, Pig, Flume etc. solve individual problems, but building an end-to-end solution requires stitching them together. This course teaches you how to do that. You solve complete business problems by building end-to-end pipelines in this course.

Who is the target audience?
  • Big Data Engineers
  • Big Data Architects
Compare to Other Big Data Courses
Curriculum For This Course
46 Lectures
03:22:21
+
Introduction
5 Lectures 12:17

Your Learning Process - Learn, Train, Practice, Apply
03:37


Setting up CDH Quickstart VM
00:16

Resource Bundle - Code Examples and Practice Exercises
00:01
+
Building Big Data Pipelines
4 Lectures 09:41

Platform vs Application for Big Data
01:58

Big Data Pipelines - Recommended Strategy
02:45

Introduction to Use Cases
02:10
+
Apache Sqoop - an Introduction
5 Lectures 18:01
What is Apache Sqoop?
05:01

Sqoop Command Line Overview
03:37

<train/> Simple Import Command
02:53

<train/> Setting up a job and Executing
06:19

@Practice() : Sqoop Practice Exercise
00:11
+
Apache Flume : An introduction
8 Lectures 25:57
What is Apache Flume?
02:49


Installing and Running Flume
01:37

Chaining Flume Agents
02:33

<train/> Use Case 1 : Netcat to Console
05:57

<train/> Use Case 2 : Spool Directory to Destination Directory
03:24

<train/> Use Case 3 : Chaining with Directory, Avro and HDFS
04:37

@Practice() : Flume Practice Exercise
00:08
+
Apache Pig - An introduction
7 Lectures 01:09:43
What is Apache Pig?
08:31

Pig Latin Basics
10:25

Pig Operations
15:29

Data Engineering with Pig
07:55

<train/> Pig Data Flows
14:57

<train/> Data Engineering with Pig
12:15

@Practice() : Pig Practice Exercises
00:11
+
Mongo DB - An introduction
7 Lectures 21:08
What is Mongo DB?
03:30

Mongo DB Organization
02:52

Installation
01:05

<train/> Inserting Data
04:22

<train/> Querying Data
04:39

<train/> Updating and Deleting Data
04:20

@Practice() : Mongo DB Practice Exercise
00:20
+
Pipeline Use Case 1 : RDBMS to Big Data
2 Lectures 10:43
Use Case 1 - Problem and Solution Overview
02:59

<train/> Use Case Solution - Building the Pipeline
07:44
+
Pipeline Use Case 2 : Web Logs to Mongo DB
2 Lectures 09:23
Use Case 2 - Problem and Solution Overview
02:54

<train/> Use Case Solution - Building the Pipeline
06:29
+
Pipeline Use Case 3 : Multiple Sources to Monthly Summary
2 Lectures 19:52
Use Case 3 - Problem and Solution Overview
04:14

<train/> Use Case Solution - Building the Pipeline
15:38
+
APPLY Project : Building multi-source pipelines
2 Lectures 04:55
Apply Project - Problem Statement
00:58

Apply Project : Solution Review
03:57
1 More Section
About the Instructor
V2 Maestros, LLC
4.1 Average rating
3,296 Reviews
33,360 Students
13 Courses
Big Data Science / Analytics Experts | 25K+ students

V2 Maestros is dedicated to teaching big data / data science at affordable costs to the world. Our instructors have real world experience practicing big data and data science and delivering business results. Big Data Science is a hot and happening field in the IT industry. Unfortunately, the resources available for learning this skill are hard to find and expensive. We hope to ease this problem by providing quality education at affordable rates, there by building data science talent across the world.