Programming Hadoop with Python

Learn to use Python for developing Map Reduce applications
4.5 (30 ratings)
Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
705 students enrolled
Take This Course
  • Lectures 20
  • Length 3 hours
  • Skill Level All Levels
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works


Find online courses made by experts from around the world.


Take your courses with you and learn anywhere, anytime.


Learn and practice real-world skills and achieve your goals.

About This Course

Published 4/2015 English

Course Description

Hadoop is a hot new technology that is growing everyday in its adaption in the Big Data world. However, Hadoop is built using Java and application developers need to know/learn Java for developing MapReduce applications. Python is a very popular programming language that makes the development of applications simple and easy. Hadoop provides a way by which MapReduce applications can be built using Python. This way, developers can use existing knowledge and code base for quickly developing MapReduce applications.

This intent of this course is to help Python developers learn the concepts and techniques for developing real world applications in Hadoop. It walks the learner through 5 examples of increasing difficulty for mastering MapReduce through Python.

What are the requirements?

  • Python programming knowledge/ experience
  • Unix / Linux experience preferred.

What am I going to get from this course?

  • Understand the components of Hadoop
  • Learn how Map Reduce works
  • Master the creation of Map Reduce jobs with Python

Who is the target audience?

  • Developers and Architects

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.


Section 1: Introduction
Introduction to the course
About V2 Maestros

Set of files and datasets used in examples for this course

Section 2: Hadoop Basics
Hadoop Overview
Setting up the Cloudera VM
About HDFS
HDFS Usage Examples
Introduction to Map Reduce
A Map Reduce example in Java
The Hadoop Stack
Section 3: Python with Hadoop
Introduction to Hadoop Streaming
Using Python with Hadoop Streaming
Section 4: Hadoop - Python Use Cases
Use Case 1 : Basic Data Cleansing
Use Case 2 : Data Filtering
Use Case 3 : Data Summarization
Use Case 4 : Joining Data
Introduction to Text Processing / TF-IDF
Use Case 5 : Computing TF-IDF
Section 5: Conclusion
Closing Remarks
BONUS Lecture : Other courses you should check out

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

V2 Maestros, LLC, Big Data Science / Analytics Experts | 10K+ students

V2 Maestros is dedicated to teaching big data / data science at affordable costs to the world. Our instructors have real world experience practicing big data and data science and delivering business results. Big Data Science is a hot and happening field in the IT industry. Unfortunately, the resources available for learning this skill are hard to find and expensive. We hope to ease this problem by providing quality education at affordable rates, there by building data science talent across the world.

Ready to start learning?
Take This Course