Process Big Data using Apache PIG
4.3 (14 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
111 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Process Big Data using Apache PIG to your Wishlist.

Add to Wishlist

Process Big Data using Apache PIG

Learn analyzing and processing big data Using Apache Pig
4.3 (14 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
111 students enrolled
Last updated 3/2016
English
Current price: $10 Original price: $200 Discount: 95% off
1 day left at this price!
30-Day Money-Back Guarantee
Includes:
  • 5.5 hours on-demand video
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Overview of Big Data and Hadoop Framework
  • Anatomy of a MapReduce Framework
  • Basics of Apache Pig tool and Where we should use it or not
  • Run Pig in different Modes
  • Use Pig Latin Queries
  • Different types of PIG Operators for analysing the data
  • Understand the architecture of PIG tool
  • Work with PIG data model
  • Different kinds of built-in functions
  • Advanced PIG concepts such as PIG Streaming, PIG scripts and User Defined Functions(UDFs)
  • Compress the input files, final output files and intermediate output files
  • Pig Unit Testing, PIG Macros and Parameter Substitution
  • How to embed PIG in Java
View Curriculum
Requirements
  • Basic Understanding of Hadoop
  • Basic knowledge of Declarative Language such as SQL
  • Basic knowledge of Java Programming Language
Description

Pig is a high-level platform for creating MapReduce programs used with Hadoop. The language for this platform is called Pig Latin. In this course we will go through the PIG data flow platform and the language used by PIG tool. The concepts which are covered in this course are:

Writing complex MapReduce transformations using a simple scripting language.

Basics of Big Data, Hadoop and MapReduce Framework.

PIG Data Model and Different type of operators to operate on datasets.

Built-in Functions as well as User Defined Functions for performing a specific task.

Running PIG Script, Unit Testing and Compression.

Many more advance topics such as Embedding PIG in Java, PIG Macros etc.

All the books and PDFs are included, allowing you to follow along with the author throughout the modules in this course.

Who is the target audience?
  • Students having interest in Big Data and Hadoop Field
  • Database Developers and Administrator
  • Software developers want to build their career in Big Data field
  • Data Analysts
  • Data Scientists and Resesarcher
Students Who Viewed This Course Also Viewed
Curriculum For This Course
Expand All 49 Lectures Collapse All 49 Lectures 05:41:54
+
Module-1 Basics of Big Data, Hadoop and Pig
9 Lectures 21:17


1.3 Hadoop Distributed File System
05:00

1.4 Hadoop MapReduce
03:33

1.5 Introduction to Apache Pig
01:42

1.6 Importance of Apache PIG
01:11

1.7 Why PIG Over MapReduce
01:50

1.8 Where PIG is Best Suited
01:37

1.9 Where to Avoid PIG
00:28
+
Module-2 PIG Latin Language, Architecture and Modes of Pig
5 Lectures 23:09
2.1 PIG Latin Language
03:57

2.2 Running PIG in Different Modes
02:59

2.3 PIG Architecture
01:19

2.4 GRUNT Shell
07:18

2.5 PIG Latin Statements
07:36
+
Module-3 Data Model, Operators and Streaming in PIG
15 Lectures 02:16:17
3.1 PIG Data Model- Scalar Datatype
06:19

3.2 PIG Data Model - Complex DataType
11:02

3.3 Arithmetic Operators
06:37

3.4 Comparison Operators
10:44

3.5 Cast Operators
12:52

3.6 Type Construction Operator
07:03

3.7 Relational Operators
00:48

3.8 Loading and Storing Operators
05:38

3.9 Filtering Operators
10:45

3.10 Filtering Operators-Pig Streaming with Python
06:50

3.11 Grouping and Joining Operators- PART-1
16:41

3.12 Grouping and Joining Operators- PART-2
13:40

3.13 Sorting Operator
06:50

3.14 Combining and Splitting Operators
09:30

3.15 Diagnostic Operators
10:58
+
Module-4 Different Kinds of Built-In Functions in PIG
7 Lectures 01:23:43
4.1 Eval Functions PART-1
17:02

4.2 Eval Functions PART-2
04:13

4.3 Eval Functions PART-3
18:51

4.4 Load and Store Functions
16:52

4.5 Tuple and Bag Functions
06:19

4.6 String Functions
11:38

4.7 Math Function
08:48
+
Module-5 Advanced Pig Latin with Pig Scripts,UDF's,Utility Commands
2 Lectures 18:47
5.1 Running Pig Scripts
07:06

5.3 Utility Commands
11:41
+
Module-6 File Compression
3 Lectures 24:07
6.1 File Compression in pig
10:18

6.2 Intermediate Compression
07:28

6.3 Pig Unit Testing
06:21
+
Module-7 Advanced Pig Latin
4 Lectures 28:11
7.1 Embedded Pig in JAVA
07:11

7.2 Pig Macros
06:56

7.3 Import Macros
04:49

7.4 Parameter Substitution
09:15
About the Instructor
Digitorious Technologies
3.8 Average rating
169 Reviews
1,611 Students
10 Courses
Make Learning Smarter

Digitorious technologies is a leading publisher of development courses which provide in-depth knowledge and high quality training. Digitorious technologies is serving with a mission of providing right direction to people who are looking for a career in IT/software industry. Digitorious is the best place for learning new technologies and making things easy to understand virtually.