This tutorial starts with understanding need for hive Architecture and different configuration parameters in Hive. During this course you will learn different aspects of Hive and how it fits as datawarehousing patform on Hadoop. Please subscribe to my Youtube Channel "Hadooparch" for more details.
This Course covers Hive, the SQL of Hadoop.(HQL) We will learn why and How Hive is installed and configured on Hadoop. We will cover the components and architecture of Hive to see how it stores data in table like structures over HDFS data. Understabd architecture, installation and configuration of Hive. We will install and configure Hive server2 and replace postgresql database with mysql. we will also learn how to install mysql and configure it as Hive Metastore
This Course is full of Hive demonstrations. We'll cover how to create Databases, understand data types, create external, internal, and partitioned hive tables, bucketing load data from the local filesystem as well as the distributed filesystem (HDFS), setup dynamic partitioning, create views, and manage indexes and how different layers work together on Hive.
We will go through different roles in implementing in Real time projects, how projects are set up and permissions, Auditing, Troubleshooting.
Finally I will give sample data and queries to work and replicate what has been taught in Videos.
This Course has multiple questions to test your understanding. Kindly attempt all of them.
What is Hive ?
Apache Hive is a popular SQL interface for batch processing on Hadoop. Hadoop was built to organize and store massive amounts of data Hive gives another way to access Data inside the cluster in easy, quick way.
I have experience of over 8 years. My Experience varies from working in an IT company to Starting up a design factory. I have worked with CMMi level 5 companies and provided services to clients in various areas including BIG DATA implementations.
My Passion is/was always explain concepts as simple as possible. I love Learning new technologies and Integration of technologies is my Passion. As Internet is full of all possible courses on Hadoop, I want to add courses on hadoop ecosystem products which made Hadoop popular.