Data Analytics using Hadoop eco system
3.4 (251 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
12,083 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Data Analytics using Hadoop eco system to your Wishlist.

Add to Wishlist

Data Analytics using Hadoop eco system

Convert NYSE data into useful insights. In this course we will perform top down analysis of stock data based on volume.
3.4 (251 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
12,083 students enrolled
Created by Durga Gadiraju
Last updated 1/2015
English
Price: Free
Includes:
  • 2 hours on-demand video
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Use Hadoop and Hue to get insights into raw data
  • Use Tableau Public to build the reports and dashboard
  • Familiarize with Cloudera Quickstart VM so that one can explore Hadoop and other eco system tools
View Curriculum
Requirements
  • Basic SQL skills and advanced computer skills
  • A computer with 8 GB of RAM
Description

Are you an IT professional and interested in exploring Hadoop? Just take this free course and you will understand how easy it is not only to explore but also to implement Proof of concepts. You need to have a PC or Mac with 8 GB of RAM to run the code examples. Also you need to be comfortable in writing basic SQL queries.

You will learn setting up Cloudera VM and use tools like Hadoop, Hive and Hue to convert raw data into useful insights. Also you will become familiar with basic Hive commands/queries to process the data. You will also be able to develop basic reports using Tableau Public and publish them to your network.

You need not get overwhelmed of number of tools and technologies that are being referred as part of Hadoop eco system.

No need to struggle command line while developing PoCs, validating data, testing the code etc.

You can be an Architect, Developer, Tester, Analyst, Project Manager or any other IT professional.

Who is the target audience?
  • Any one from IT such as Data Analysts, Business Analysts, Developers, Testers who want to explore Hadoop
  • Course is not for non IT professionals
Students Who Viewed This Course Also Viewed
Curriculum For This Course
11 Lectures
01:57:03
+
Understanding requirements and gathering data
2 Lectures 10:28

After this lecture students will understand the requirements for this session

Understanding requirements
03:52

Gather data - NYSE and Companylist
06:36
+
Process data
5 Lectures 01:05:54

This session will cover setting up environment on the laptop. We will be downloading vmware software, CDH5 VM, configuring VM and then we will see what is Cloudera Manager and Hue.

Set up Hadoop eco system using Cloudera VM
09:52

In this lecture we will learn how to upload data to Hadoop using Hue.

Upload data using Hue
07:06

Create Hive database and external tables
16:08

In this lecture we will develop query for the insights in raw data, execute it and validate the results.

Develop Hive queries and validate the results
20:53

Create another hive table to stage the processed data for download.

Stage output of Hive query into another Hive table
11:55
+
Visualize data and publish reports
3 Lectures 33:19
Setup Tableau Public
06:51

Download data from staged hive table
06:12

In this lecture you will see how to generate reports and dashboard using Tableau Public and publish

Use Tableau Public to generate reports, dashboard and then publish
20:16
+
Conclusion
1 Lecture 07:22
Recap - Raw data to Dashboard in 2 hours
07:22
About the Instructor
Durga Gadiraju
3.4 Average rating
250 Reviews
12,083 Students
1 Course
Big Data Evangelist

Technology geek and Data Evangelist with deep dive expertise in Big Data, Decision Support and operational based systems.

* Pursuing leadership positions in India (currently based in US with Green card)
* Big Data Myth Buster
* Professional trainer in Big Data eco system, Oracle and Goldengate
* Proficient in cloud platforms such as AWS, Softlayer, Azure etc
* Expertise in Data Science - Statistical Analysis and Machine Learning
* Solid data integration background (real time, near real time, micro batch and batch).
* Proven expertise in implementing high volume OLTP, DW, ODS, MDM, Hadoop based systems.
* Multiple certifications in all categories of Big Data eco system, Relational Databases, ETL and Reporting tools etc.
* Mentor for cross functional teams in technology at all levels.
* Expert performance tuning and ability to provide scalable solutions.
* Ability to provide short term tactical technology solutions and strategic technology road maps for given business problem.
* Liaison between technology, business and program management.
* Solid consulting, program management and leadership skills in technology domain.
* Deep understanding in Investment Banking and Energy domains
* Experienced in building and managing teams up to 25 IT professionals