Step2:Creating and importing data in hive external tables
Hands on Lab DataMaskingProject-step2- Document
Step3:Creating UDFs in Java
Hands on Lab DataMaskingProject-step3(udf)- Document
Step4:Exporting data to MySql in masked database
Hands on Lab DataMaskingProjectStep4 - Document
Should know the basics of BigData concepts like-HDFS,MapReduce and some knowledge of RDBMS.
Should take our Part-1 free course (Big data Internship Program - Foundation) to understand these concepts better. (Not mandatory but desirable).
Should take our Part-1 free course (Big data Internship Program - ingestion ) to understand sqoop and flume. ( For our Book Recommendation Project only).
This course is part of “Big data Internship Program” which is aligned to a typical Big data project life cycle stage.
This course is focused on Data Processing in Big data.This course is suitable for developers, data analysts and business analysts. Experience with SQL and scripting languages is recommended, but is not required.
You will learn
Understanding of Hive core concept and architecture.
How to create and manipulate tables using Hive.
Advanced features of Hive.
Hive Best Practices
Performing real-time, complex queries on datasets
Reading and Writing Data with Pig
Pig Best Practices
Project work -
Provide Data in Hive and manipulate the data for Our Book Recommendation project.
One Ad-on project -- Data Masking with hive and sqoop
Who this course is for:
This course is for anyone who wants to learn about Hive and Pig in details with hand exprience.
Big Data Trunk is the leading Big Data focus consulting and training firm founded by industry veterans in data domain. It helps is customer gain competitive advantage from open source, big data, cloud and advanced analytics. It provides services like Strategy Consulting, Advisory Consulting and high quality classroom individual and corporate training.