Learning Hadoop 2

An introduction to storing, structuring, and analyzing data at scale with Hadoop
4.3 (10 ratings) Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
110 students enrolled
$19
$75
75% off
Take This Course
  • Lectures 19
  • Length 1.5 hours
  • Skill Level Beginner Level
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works

Discover

Find online courses made by experts from around the world.

Learn

Take your courses with you and learn anywhere, anytime.

Master

Learn and practice real-world skills and achieve your goals.

About This Course

Published 12/2015 English

Course Description

Hadoop emerged in response to the proliferation of masses and masses of data collected by organizations, offering a strong solution to store, process, and analyze what has commonly become known as Big Data. It comprises a comprehensive stack of components designed to enable these tasks on a distributed scale, across multiple servers and thousands of machines.

Learning Hadoop 2 introduces you to the powerful system synonymous with Big Data, demonstrating how to create an instance and leverage Hadoop ecosystem's many components to store, process, manage, and query massive data sets with confidence.

We open this course by providing an overview of the Hadoop component ecosystem, including HDFS, Sqoop, Flume, YARN, MapReduce, Pig, and Hive, before installing and configuring our Hadoop environment. We take a look at Hue, the graphical user interface of Hadoop.

We will then discover HDFS, Hadoop’s file-system used to store data. We will learn how to import and export data, both manually and automatically. Afterward, we turn our attention toward running computations using MapReduce, and get to grips working with Hadoop’s scripting language, Pig. Lastly, we will siphon data from HDFS into Hive, and demonstrate how it can be used to structure and query data sets.

About The Author

Randal Scott King is the Managing Partner of Brilliant Data, a consulting firm specialized in data analytics. In his 16 years of consulting, Scott has amassed an impressive list of clientele from mid-market leaders to Fortune 500 household names. Scott lives just outside Atlanta, GA, with his children.

What are the requirements?

  • We expect familiarity working at the Linux command line, and a basic understanding of Java. No prior experience with Hadoop is required.

What am I going to get from this course?

  • Install and configure an Hadoop instance of your own
  • Navigate Hue, the GUI for common tasks in Hadoop
  • Import data manually, and automatically from a database
  • Build scripts with Pig to perform common ETL tasks
  • Write and run a simple MapReduce program
  • Structure and query data effectively with Hive, Hadoop’s built-in data warehousing component

What is the target audience?

  • This video course is designed for application and system developers interested in understanding how to manage and analyze large scale data sets with the Hadoop framework.

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.

Curriculum

Section 1: The Hadoop Ecosystem
01:51

This video will offer the overview of the course.

07:24

This video will introduce you to the basic concepts of Hadoop Distributed File System (HDFS) and Yet Another Resource Negotiator (YARN), which are the two core components of Hadoop.

03:17

An introduction to the basic concepts of Sqoop and Flume, two tools for the automation of data import into Hadoop.

03:38

An introduction to the basic concepts of MapReduce, the computation engine of Hadoop.

03:04

An introduction to the basic concepts of Pig, a scripting language for Hadoop.

06:33

An introduction to the basic concepts of Hive, Hadoop’s data warehousing solution.

Section 2: Installing and Configuring Hadoop
02:59

Put a working Hadoop installation on a laptop or server. You will need Hadoop on your laptop or server in order to continue.

05:24

Exploring the Hue, a GUI for Hadoop, to get familiar with the interface.

Section 3: Data Import and Export
04:33

This video will cover how to get data into HDFS manually.

06:27

This video will explain how to get data from databases into HDFS.

05:07

This video will cover how to import streaming data using the Flume tool.

Section 4: Using MapReduce and Pig
05:55

This video will explore how to build “Word Count” in Eclipse, then save it to a .jar and run it from MapReduce.

02:30

Coding the same word counting program, but this time in Pig.

08:48

This video will discuss how to use Pig to perform common Extract, Transform, and Load functions on data.

05:58

This video will explore how to use predefined code called User Defined Functions (UDFs) in Pig scripts.

Section 5: Using Hive
04:57

Create a database in Hive.

02:23

This video will cover how to get data into Hive from a database without going to HDFS first.

06:58

Using queries in Hive to find information.

02:15

A quick summary of what the viewer has learned in the entire course.

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

Packt Publishing, Tech Knowledge in Motion

Over the past ten years Packt Publishing has developed an extensive catalogue of over 2000 books, e-books and video courses aimed at keeping IT professionals ahead of the technology curve. From new takes on established technologies through to the latest guides on emerging platforms, topics and trends – Packt's focus has always been on giving our customers the working knowledge they need to get the job done. Our Udemy courses continue this tradition, bringing you comprehensive yet concise video courses straight from the experts.

Ready to start learning?
Take This Course