
Attached docs are giveaways to all who wants to learn Hadoop the power tool. The sole purpose of this giveaway is to help the learner to be more confident in the interview. The attached 100 solved query pdf can help every learner to understand the different types of error Hadoop admins face in their day to day professional life. This giveaway covers issues and solutions to most of the technology which is part of the Hadoop Ecosystem.
This module will help the learner to understand what is bigdata, why we must understand the importance of big data and how the traditional tools failed to store and process it. This video will also help the learner to understand the importance of Hadoop and its eco tools.
In this video, the learner will understand how to install VM-Ware, configuring Centos on VM-Ware and how to install Hadoop on a single machine. If the student doesn't have enough capacity on his/her Laptop / Desktop, the student should follow the guideline of Hadoop Multi-node cluster, the only exception Learner needs to remember is that he/she doesn't have to create an image of the machine. For more details please go through the attached Configure Hadoop on AWS Cloud Platform pdf else contact me for more guidance.
In this video, the learner should get a clear cut understanding of how to create Hadoop Cluster on AWS platform and how to submit Hadoop commands or HDFS commands on the Hadoop cluster. The learner is advised to practice all the commands given in the attached HDFS command pdf file.
Hadoop Administer must know how to submit a job or how to test the credibility of the configured cluster. Leaner must know how to test the cluster potential of data distribution and data analysis. This video does exactly the same.
It is very important for any Hadoop admin to know how to plan Hadoop Cluster. This video can give professional-level expertise to plan a Hadoop Cluster. The learner should be able to plan a Hadoop cluster with a proper understanding of each step. After completing this video learner can practice on attached file for planning his/ her cluster.
SafeMode is a maintenance state of the NameNode machine. This video helps the learner to understand the importance of SafeMode command, what this command does and when to use this command.
This video should help the learner to remove or connect servers professionally and take care of all pre and post requisites to complete the task.
Trash Functionality helps the learner to recover accidentally deleted data. This module helps the learner to configure this functionality to increase the possibility of data recovery. The learner should not lose any more data after implementing this configuration.
Rack Awareness concept helps the learner to increase the possibility of data recovery and data reliability on Hadoop Cluster. Using this concept learner should learn the data block policy which makes the magic of data recovery.
This module helps the learner to use DistCp utility most effectively. Using this Utility learner should be able to copy data from one cluster to another cluster. This module also guides on how to copy data within the cluster.
This module focus on configuring Capacity Scheduler on Hadoop Cluster. This module explains how to use default Capacity Scheduler as a user-defined scheduler. After completing this module leaner will be able to manage cluster resource between multiple teams.
This lecture can help students to configure Namenode High Availability. Using this module learner can convert any single point of failure namenode into highly available namenode. In this module, the learner will understand the importance of an odd number of machines for high availability.
This module helps the learner to create a Cloudera Hadoop Multinode Cluster. Using this module Learner can understand all the configuration changes and tips to create and administer a Cloudera Cluster.
This module helps the learner to create an active directory and configure Kerberos on single node Cloudera Hadoop Cluster. The learner can use LDAP configuration for connection. To complete this module user must create one Cloudera Hadoop Cluster for quick learning. Student can follow steps given in Module 7 to create a Cloudera Single Node Cluster. The guideline given in this module of Kerberos configuration is applicable for Cloudera Multinode Cluster as well but since AWS free Tier account does not allow us to create more than 5 machines we are focusing on Single Machine only.
Module 0: Giveaways
· Linux / UNIX Course
· 100 Solved Queries of Hadoop Administration Day to Day activities.
· Guidelines to create an AWS account.
Module 1: Introduction of Hadoop Administration
· Understanding Big Data
· Common big data domain scenarios
· Analyze Limitation of Traditional Solutions
· Roles and Responsibility
· Case Studies
Module 2: Hadoop Architecture And Mapreduce
· Introduction to Hadoop
· Hadoop Architecture
· Difference between Hadoop 1.x, Hadoop 2.x and Hadoop 3.x
· Hadoop 1.x Ecosystem tools and Core System
· Hadoop 2.x Ecosystem tools and Core System
· HDFS File System
o Introduction of NameNode, DataNode and Secondary NameNode
o Anatomy of Write and Read
o Replication Pipeline
· YARN Framework
o Role and function of YARN in Hadoop
o Mapreduce Theory
§ Cluster testing using MapReduce Code in YARN Environment
Module 3: Cluster Planning
· Types of Rack
· General Principal of selecting CPU Memory and hardware
· Understand Hardware Consideration
· Machines requirement as per the daemons
· Learn Best Practice for selecting hardware
Know the network Consideration
Module 4: Hadoop Cluster Administration, Backup, Recovery and Maintenance
· SafeMode
· Decommissioning, Commissioning and Re-Commissioning of Node
· Trash Functionality
· Distcp
· Rack Awareness
· HDFS / Hadoop Balancer
Module 5: Managing Resources and Scheduling
· Scheduler: Explanation and demo
o Capacity Scheduler
Module 6: HDFS Federation and High Availability
· Understand the YARN framework
· Understand the Federation
· Understand High Availability
· High Availability Implementation Using Quorum Journal Manager
Module 7: Cloudera Setup and Performance Tuning
· Cloudera Distribution Hadoop
· Cloudera Features
· Cloudera Manager Editions
· Cloudera Manager Web UI
· CDH Installation
Module 8: Security
· Basics of Hadoop Platform Security
· Securing the Platform
· Understand Kerberos
Configuring Kerberos on Cloudera Hadoop Cluster using LDAP authentication