Big Data Analyst -using Sqoop and Advance Hive (CCA159)
What you'll learn
- Students will learn Advance Hive and Sqoop for Big Data Analytics and Ingestion.
- Basic Knowledge of SQL
You will start by learning what is Hadoop & Hadoop distributed file system and most common hadoop commands required to work with Hadoop File system
Then you will be introduced to Sqoop Import
Understand lifecycle of sqoop command.
Use sqoop import command to migrate data from Mysql to HDFS.
Use sqoop import command to migrate data from Mysql to Hive.
Use various file formats, compressions, file delimeter,where clause and queries while importing the data.
Understand split-by and boundary queries.
Use incremental mode to migrate the data from Mysql to HDFS.
Further, you will learn Sqoop Export to migrate data.
What is sqoop export
Using sqoop export, migrate data from HDFS to Mysql.
Using sqoop export, migrate data from Hive to Mysql.
Finally, we will start with Apache Hive [Advance]
External & Managed Tables
Insert & Multi Insert
Data Types & Complex Data Types
Hive String Functions
Hive Date Functions
Joins, Multi Joins & Map Joins
Working with Different Files - Parquet,Avro
Windowing Functions - Rank/Dense Rank/lead/lag/min/max
Who this course is for:
- Who are preparing for CCA159 Cloudera Big Data Analytics Certification or who wants to learn Advance Hive & Sqoop
Navdeep is one of the renowned Premium Instructor at Udemy. Navdeep has 12 years of industry experience in different technologies and domains. With 9+ courses and 40,000+ students and rating of 4.5*, she is one of the leading instructors in the field of Big Data & Cloud.