Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Complete Hadoop Framework including kafka,spark and mongo db
Rating: 3.6 out of 5(207 ratings)
1,321 students
Last updated 1/2019
English

What you'll learn

  • Importance of hadoop framework in BigData analytics
  • Understanding Hadoop Framework in detail
  • Hands on experience on data ingestion techniques : Apache Sqoop and Apache Flume
  • Hands on experience on MapReduce Programming and its hidden concepts
  • Hands on experience on Apache Hive Programming, Performance tuning, UDF's
  • Understand and work with Pig
  • Realtime data streaming analysis with Apache Spark and its ecosystems
  • Understand and work with Apache Kafka
  • Process workflow automation using Oozie
  • Understand and work with MongoDb
  • Case Studies , practical explanations and Interview Questions

Course content

17 sections96 lectures18h 31m total length
  • Course Introduction2:27

    This video provides detailed overview of this course and the topics that will be covered as a part of this course. Also , a brief note about trainer's profile and experience is also mentioned.

Requirements

  • Be familiar with sql concepts, programming basics
  • Download Cloudera quickstart VM CDH 5.8 and install VMWare workstation player. Environment setup guidance will be covered in our lectures

Description

Data Analytics is the practice of using data to drive business strategy and performance. It includes a range of approaches and solutions, from looking backward to evaluate what happened in the past to looking forward to do scenario planning and predictive modelling.Data Analytics spans all of the functional businesses to address a continuum of opportunities in Information Management, Performance Optimisation and Analytic Insights. Organizations now realize the inherent value of transforming these big data into actionable insights. Data science is the highest form of big data analytics that produce the most accurate actionable insights, identifying what will happen next and what to do about it. 

Hadoop is an open-source software framework for storing data and running applications on clusters of commodity hardware. It provides massive storage for any kind of data, enormous processing power and the ability to handle virtually limitless concurrent tasks or jobs. Hadoop is not just an effective distributed storage system for large amounts of data, but also, importantly, a distributed computing environment that can execute analyses where the data is.

In this course, detailed explanation about hadoop framework  and its ecosystems has been provided. All the concepts are explained in detail with examples and business use cases as case studies.Also, latest technologies in big data area like apache spark, apache kafka, Mongo DB are explained. In addition, Interview questions  with respect to each ecosystem and resume preparation tips are included.

Who this course is for:

  • This course is addressed to the students who has some prior knowledge on programming, sql concepts.
  • Any one who is interested to pursue their career as a hadoop developer