Process Big Data using Apache PIG

Learn analyzing and processing big data Using Apache Pig
4.5 (6 ratings) Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
69 students enrolled
$19
$100
81% off
Take This Course
  • Lectures 49
  • Length 5.5 hours
  • Skill Level Intermediate Level
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works

Discover

Find online courses made by experts from around the world.

Learn

Take your courses with you and learn anywhere, anytime.

Master

Learn and practice real-world skills and achieve your goals.

About This Course

Published 2/2016 English

Course Description

Pig is a high-level platform for creating MapReduce programs used with Hadoop. The language for this platform is called Pig Latin. In this course we will go through the PIG data flow platform and the language used by PIG tool. The concepts which are covered in this course are:

Writing complex MapReduce transformations using a simple scripting language.

Basics of Big Data, Hadoop and MapReduce Framework.

PIG Data Model and Different type of operators to operate on datasets.

Built-in Functions as well as User Defined Functions for performing a specific task.

Running PIG Script, Unit Testing and Compression.

Many more advance topics such as Embedding PIG in Java, PIG Macros etc.

All the books and PDFs are included, allowing you to follow along with the author throughout the modules in this course.

What are the requirements?

  • Basic Understanding of Hadoop
  • Basic knowledge of Declarative Language such as SQL
  • Basic knowledge of Java Programming Language

What am I going to get from this course?

  • Overview of Big Data and Hadoop Framework
  • Anatomy of a MapReduce Framework
  • Basics of Apache Pig tool and Where we should use it or not
  • Run Pig in different Modes
  • Use Pig Latin Queries
  • Different types of PIG Operators for analysing the data
  • Understand the architecture of PIG tool
  • Work with PIG data model
  • Different kinds of built-in functions
  • Advanced PIG concepts such as PIG Streaming, PIG scripts and User Defined Functions(UDFs)
  • Compress the input files, final output files and intermediate output files
  • Pig Unit Testing, PIG Macros and Parameter Substitution
  • How to embed PIG in Java

What is the target audience?

  • Students having interest in Big Data and Hadoop Field
  • Database Developers and Administrator
  • Software developers want to build their career in Big Data field
  • Data Analysts
  • Data Scientists and Resesarcher

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.

Curriculum

Section 1: Module-0 Introduction to the Course
0.1 Introduction to Apache PIG
Preview
01:12
0.2 Prerequisites for Apache PIG
Preview
00:43
0.3 Exploring Use Cases of PIG
Preview
02:42
0.4 History of Apache PIG
Preview
01:46
Section 2: Module-1 Basics of Big Data, Hadoop and Pig
1.1 Big Data Overview
Preview
03:27
1.2 Apache Hadoop Framework
Preview
02:29
1.3 Hadoop Distributed File System
05:00
1.4 Hadoop MapReduce
03:33
1.5 Introduction to Apache Pig
01:42
1.6 Importance of Apache PIG
01:11
1.7 Why PIG Over MapReduce
01:50
1.8 Where PIG is Best Suited
01:37
1.9 Where to Avoid PIG
00:28
Section 3: Module-2 PIG Latin Language, Architecture and Modes of Pig
2.1 PIG Latin Language
03:57
2.2 Running PIG in Different Modes
02:59
2.3 PIG Architecture
01:19
2.4 GRUNT Shell
07:18
2.5 PIG Latin Statements
07:36
Section 4: Module-3 Data Model, Operators and Streaming in PIG
3.1 PIG Data Model- Scalar Datatype
06:19
3.2 PIG Data Model - Complex DataType
11:02
3.3 Arithmetic Operators
06:37
3.4 Comparison Operators
10:44
3.5 Cast Operators
12:52
3.6 Type Construction Operator
07:03
3.7 Relational Operators
00:48
3.8 Loading and Storing Operators
05:38
3.9 Filtering Operators
10:45
3.10 Filtering Operators-Pig Streaming with Python
06:50
3.11 Grouping and Joining Operators- PART-1
16:41
3.12 Grouping and Joining Operators- PART-2
13:40
3.13 Sorting Operator
06:50
3.14 Combining and Splitting Operators
09:30
3.15 Diagnostic Operators
10:58
Section 5: Module-4 Different Kinds of Built-In Functions in PIG
4.1 Eval Functions PART-1
17:02
4.2 Eval Functions PART-2
04:13
4.3 Eval Functions PART-3
18:51
4.4 Load and Store Functions
16:52
4.5 Tuple and Bag Functions
06:19
4.6 String Functions
11:38
4.7 Math Function
08:48
Section 6: Module-5 Advanced Pig Latin with Pig Scripts,UDF's,Utility Commands
5.1 Running Pig Scripts
07:06
5.3 Utility Commands
11:41
Section 7: Module-6 File Compression
6.1 File Compression in pig
10:18
6.2 Intermediate Compression
07:28
6.3 Pig Unit Testing
06:21
Section 8: Module-7 Advanced Pig Latin
7.1 Embedded Pig in JAVA
07:11
7.2 Pig Macros
06:56
7.3 Import Macros
04:49
7.4 Parameter Substitution
09:15

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

Digitorious Technologies, Make Learning Smarter

Digitorious technologies is a leading publisher of development courses which provide in-depth knowledge and high quality training. Digitorious technologies is serving with a mission of providing right direction to people who are looking for a career in IT/software industry. Digitorious is the best place for learning new technologies and making things easy to understand virtually.

Ready to start learning?
Take This Course