Programming Hadoop with Python

Learn to use Python for developing Map Reduce applications
4.7 (19 ratings) Instead of using a simple lifetime average, Udemy calculates a
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
611 students enrolled
$20
Take This Course
  • Lectures 20
  • Contents Video: 3 hours
    Other: 0 mins
  • Skill Level All Levels
  • Languages English
  • Includes Lifetime access
    30 day money back guarantee!
    Available on iOS and Android
    Certificate of Completion
Wishlisted Wishlist

How taking a course works

Discover

Find online courses made by experts from around the world.

Learn

Take your courses with you and learn anywhere, anytime.

Master

Learn and practice real-world skills and achieve your goals.

About This Course

Published 4/2015 English

Course Description

Hadoop is a hot new technology that is growing everyday in its adaption in the Big Data world. However, Hadoop is built using Java and application developers need to know/learn Java for developing MapReduce applications. Python is a very popular programming language that makes the development of applications simple and easy. Hadoop provides a way by which MapReduce applications can be built using Python. This way, developers can use existing knowledge and code base for quickly developing MapReduce applications.

This intent of this course is to help Python developers learn the concepts and techniques for developing real world applications in Hadoop. It walks the learner through 5 examples of increasing difficulty for mastering MapReduce through Python.

What are the requirements?

  • Python programming knowledge/ experience
  • Unix / Linux experience preferred.

What am I going to get from this course?

  • Understand the components of Hadoop
  • Learn how Map Reduce works
  • Master the creation of Map Reduce jobs with Python

What is the target audience?

  • Developers and Architects

What you get with this course?

Not for you? No problem.
30 day money back guarantee.

Forever yours.
Lifetime access.

Learn on the go.
Desktop, iOS and Android.

Get rewarded.
Certificate of completion.

Curriculum

Section 1: Introduction
Introduction to the course
Preview
04:26
About V2 Maestros
Preview
01:39
Article

Set of files and datasets used in examples for this course

Section 2: Hadoop Basics
Hadoop Overview
Preview
10:06
Setting up the Cloudera VM
06:51
About HDFS
14:46
HDFS Usage Examples
06:01
Introduction to Map Reduce
17:24
A Map Reduce example in Java
16:46
The Hadoop Stack
06:27
Section 3: Python with Hadoop
Introduction to Hadoop Streaming
09:44
Using Python with Hadoop Streaming
04:27
Section 4: Hadoop - Python Use Cases
Use Case 1 : Basic Data Cleansing
11:37
Use Case 2 : Data Filtering
06:53
Use Case 3 : Data Summarization
07:19
Use Case 4 : Joining Data
15:56
Introduction to Text Processing / TF-IDF
14:53
Use Case 5 : Computing TF-IDF
17:13
Section 5: Conclusion
Closing Remarks
01:47
BONUS Lecture : Other courses you should check out
Article

Students Who Viewed This Course Also Viewed

  • Loading
  • Loading
  • Loading

Instructor Biography

V2 Maestros, Big Data Science / Analytics Experts | 10K+ students

V2 Maestros is dedicated to teaching big data / data science at affordable costs to the world. Our instructors have real world experience practicing big data and data science and delivering business results. Big Data Science is a hot and happening field in the IT industry. Unfortunately, the resources available for learning this skill are hard to find and expensive. We hope to ease this problem by providing quality education at affordable rates, there by building data science talent across the world.

Ready to start learning?
Take This Course