Data Analytics using Hadoop eco system

Convert NYSE data into useful insights. In this course we will perform top down analysis of stock data based on volume.
4.1 (226 ratings)
Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
11,728 students enrolled
Instructed by Durga Gadiraju IT & Software / Other
Start Learning Now
  • Lectures 11
  • Length 2 hours
  • Skill Level Intermediate Level
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works


Find online courses made by experts from around the world.


Take your courses with you and learn anywhere, anytime.


Learn and practice real-world skills and achieve your goals.

About This Course

Published 1/2015 English

Course Description

Are you an IT professional and interested in exploring Hadoop? Just take this free course and you will understand how easy it is not only to explore but also to implement Proof of concepts. You need to have a PC or Mac with 8 GB of RAM to run the code examples. Also you need to be comfortable in writing basic SQL queries.

You will learn setting up Cloudera VM and use tools like Hadoop, Hive and Hue to convert raw data into useful insights. Also you will become familiar with basic Hive commands/queries to process the data. You will also be able to develop basic reports using Tableau Public and publish them to your network.

You need not get overwhelmed of number of tools and technologies that are being referred as part of Hadoop eco system.

No need to struggle command line while developing PoCs, validating data, testing the code etc.

You can be an Architect, Developer, Tester, Analyst, Project Manager or any other IT professional.

What are the requirements?

  • Basic SQL skills and advanced computer skills
  • A computer with 8 GB of RAM

What am I going to get from this course?

  • Use Hadoop and Hue to get insights into raw data
  • Use Tableau Public to build the reports and dashboard
  • Familiarize with Cloudera Quickstart VM so that one can explore Hadoop and other eco system tools

Who is the target audience?

  • Any one from IT such as Data Analysts, Business Analysts, Developers, Testers who want to explore Hadoop
  • Course is not for non IT professionals

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.


Section 1: Understanding requirements and gathering data

After this lecture students will understand the requirements for this session

Gather data - NYSE and Companylist
Section 2: Process data

This session will cover setting up environment on the laptop. We will be downloading vmware software, CDH5 VM, configuring VM and then we will see what is Cloudera Manager and Hue.


In this lecture we will learn how to upload data to Hadoop using Hue.

Create Hive database and external tables

In this lecture we will develop query for the insights in raw data, execute it and validate the results.


Create another hive table to stage the processed data for download.

Section 3: Visualize data and publish reports
Setup Tableau Public
Download data from staged hive table

In this lecture you will see how to generate reports and dashboard using Tableau Public and publish

Section 4: Conclusion
Recap - Raw data to Dashboard in 2 hours

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

Durga Gadiraju, Big Data Evangelist

Technology geek and Data Evangelist with deep dive expertise in Big Data, Decision Support and operational based systems.

* Pursuing leadership positions in India (currently based in US with Green card)
* Big Data Myth Buster
* Professional trainer in Big Data eco system, Oracle and Goldengate
* Proficient in cloud platforms such as AWS, Softlayer, Azure etc
* Expertise in Data Science - Statistical Analysis and Machine Learning
* Solid data integration background (real time, near real time, micro batch and batch).
* Proven expertise in implementing high volume OLTP, DW, ODS, MDM, Hadoop based systems.
* Multiple certifications in all categories of Big Data eco system, Relational Databases, ETL and Reporting tools etc.
* Mentor for cross functional teams in technology at all levels.
* Expert performance tuning and ability to provide scalable solutions.
* Ability to provide short term tactical technology solutions and strategic technology road maps for given business problem.
* Liaison between technology, business and program management.
* Solid consulting, program management and leadership skills in technology domain.
* Deep understanding in Investment Banking and Energy domains
* Experienced in building and managing teams up to 25 IT professionals

Ready to start learning?
Start Learning Now