Programming Hadoop with Python
course's star rating by considering a number of different factors
such as the number of ratings, the age of ratings, and the
likelihood of fraudulent ratings.
Find online courses made by experts from around the world.
Take your courses with you and learn anywhere, anytime.
Learn and practice real-world skills and achieve your goals.
Hadoop is a hot new technology that is growing everyday in its adaption in the Big Data world. However, Hadoop is built using Java and application developers need to know/learn Java for developing MapReduce applications. Python is a very popular programming language that makes the development of applications simple and easy. Hadoop provides a way by which MapReduce applications can be built using Python. This way, developers can use existing knowledge and code base for quickly developing MapReduce applications.
This intent of this course is to help Python developers learn the concepts and techniques for developing real world applications in Hadoop. It walks the learner through 5 examples of increasing difficulty for mastering MapReduce through Python.
Not for you? No problem.
30 day money back guarantee.
Learn on the go.
Desktop, iOS and Android.
Certificate of completion.
|Section 1: Introduction|
Introduction to the coursePreview
About V2 MaestrosPreview
Set of files and datasets used in examples for this course
|Section 2: Hadoop Basics|
Setting up the Cloudera VM
HDFS Usage Examples
Introduction to Map Reduce
A Map Reduce example in Java
The Hadoop Stack
|Section 3: Python with Hadoop|
Introduction to Hadoop Streaming
Using Python with Hadoop Streaming
|Section 4: Hadoop - Python Use Cases|
Use Case 1 : Basic Data Cleansing
Use Case 2 : Data Filtering
Use Case 3 : Data Summarization
Use Case 4 : Joining Data
Introduction to Text Processing / TF-IDF
Use Case 5 : Computing TF-IDF
|Section 5: Conclusion|
BONUS Lecture : Other courses you should check out
V2 Maestros is dedicated to teaching big data / data science at affordable costs to the world. Our instructors have real world experience practicing big data and data science and delivering business results. Big Data Science is a hot and happening field in the IT industry. Unfortunately, the resources available for learning this skill are hard to find and expensive. We hope to ease this problem by providing quality education at affordable rates, there by building data science talent across the world.