Udemy
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
  •  
Development
Web Development Data Science Mobile Development Programming Languages Game Development Database Design & Development Software Testing Software Engineering Development Tools No-Code Development
Business
Entrepreneurship Communications Management Sales Business Strategy Operations Project Management Business Law Business Analytics & Intelligence Human Resources Industry E-Commerce Media Real Estate Other Business
Finance & Accounting
Accounting & Bookkeeping Compliance Cryptocurrency & Blockchain Economics Finance Finance Cert & Exam Prep Financial Modeling & Analysis Investing & Trading Money Management Tools Taxes Other Finance & Accounting
IT & Software
IT Certification Network & Security Hardware Operating Systems Other IT & Software
Office Productivity
Microsoft Apple Google SAP Oracle Other Office Productivity
Personal Development
Personal Transformation Personal Productivity Leadership Career Development Parenting & Relationships Happiness Esoteric Practices Religion & Spirituality Personal Brand Building Creativity Influence Self Esteem & Confidence Stress Management Memory & Study Skills Motivation Other Personal Development
Design
Web Design Graphic Design & Illustration Design Tools User Experience Design Game Design Design Thinking 3D & Animation Fashion Design Architectural Design Interior Design Other Design
Marketing
Digital Marketing Search Engine Optimization Social Media Marketing Branding Marketing Fundamentals Marketing Analytics & Automation Public Relations Advertising Video & Mobile Marketing Content Marketing Growth Hacking Affiliate Marketing Product Marketing Other Marketing
Lifestyle
Arts & Crafts Beauty & Makeup Esoteric Practices Food & Beverage Gaming Home Improvement Pet Care & Training Travel Other Lifestyle
Photography & Video
Digital Photography Photography Portrait Photography Photography Tools Commercial Photography Video Design Other Photography & Video
Health & Fitness
Fitness General Health Sports Nutrition Yoga Mental Health Dieting Self Defense Safety & First Aid Dance Meditation Other Health & Fitness
Music
Instruments Music Production Music Fundamentals Vocal Music Techniques Music Software Other Music
Teaching & Academics
Engineering Humanities Math Science Online Education Social Science Language Teacher Training Test Prep Other Teaching & Academics
AWS Certification Microsoft Certification AWS Certified Solutions Architect - Associate AWS Certified Cloud Practitioner CompTIA A+ Cisco CCNA Amazon AWS CompTIA Security+ AWS Certified Developer - Associate
Graphic Design Photoshop Adobe Illustrator Drawing Digital Painting InDesign Character Design Canva Figure Drawing
Life Coach Training Neuro-Linguistic Programming Personal Development Mindfulness Meditation Personal Transformation Life Purpose Emotional Intelligence Neuroscience
Web Development JavaScript React CSS Angular PHP WordPress Node.Js Python
Google Flutter Android Development iOS Development Swift React Native Dart Programming Language Mobile Development Kotlin SwiftUI
Digital Marketing Google Ads (Adwords) Social Media Marketing Google Ads (AdWords) Certification Marketing Strategy Internet Marketing YouTube Marketing Email Marketing Google Analytics
SQL Microsoft Power BI Tableau Business Analysis Business Intelligence MySQL Data Modeling Data Analysis Big Data
Business Fundamentals Entrepreneurship Fundamentals Business Strategy Online Business Business Plan Startup Blogging Freelancing Home Business
Unity Game Development Fundamentals Unreal Engine C# 3D Game Development C++ 2D Game Development Unreal Engine Blueprints Blender
30-Day Money-Back Guarantee
Business Business Analytics & Intelligence Big Data

Big Data and Hadoop for Beginners - with Hands-on!

Everything you need to know about Big Data, and Learn Hadoop, HDFS, MapReduce, Hive & Pig by designing Data Pipeline.
Rating: 4.2 out of 54.2 (1,983 ratings)
27,714 students
Created by Andalib Ansari
Last updated 1/2021
English
English [Auto]
30-Day Money-Back Guarantee

What you'll learn

  • Understand different technology trends, salary trends, Big Data market and different job roles in Big Data
  • Understand what Hadoop is for, and how it works
  • Understand complex architectures of Hadoop and its component
  • Hadoop installation on your machine
  • Understand how MapReduce, Hive and Pig can be used to analyze big data sets
  • High quality documents
  • Demos: Running HDFS commands, Hive queries, Pig queries
  • Sample data sets and scripts (HDFS commands, Hive sample queries, Pig sample queries, Data Pipeline sample queries)
  • Start writing your own codes in Hive and Pig to process huge volumes of data
  • Design your own data pipeline using Pig and Hive
  • Understand modern data architecture: Data Lake
  • Practice with Big Data sets

Requirements

  • Basics knowledge of SQL and RDBMS would be a plus
  • Machine- Mac or Linux/Unix or Windows

Description


The main objective of this course is to help you understand the Complex Architectures of Hadoop and its components, guide you in the right direction to start with, and quickly start working with Hadoop and its components.

It covers everything that you need as a Big Data Beginner. Learn about the Big Data market, different job roles, technology trends, history of Hadoop, HDFS, Hadoop Ecosystem, Hive, and Pig. In this course, we will see how as a beginner one should start with Hadoop. This course comes with a lot of hands-on examples that will help you learn Hadoop quickly.

The course has 6 sections, and focuses on the following topics:

Big Data at a Glance: Learn about Big Data and different job roles required in the Big Data market. Know big data salary trends around the globe. Learn about the hottest technologies and their trends in the market.

Getting Started with Hadoop: Understand Hadoop and its complex architecture. Learn  Hadoop Ecosystem with simple examples. Know different versions of Hadoop (Hadoop 1.x vs Hadoop 2.x), different Hadoop Vendors in the market, and Hadoop on Cloud. Understand how Hadoop uses the ELT approach. Learn installing Hadoop on your machine. We will see running HDFS commands from the command line to manage HDFS.

Getting Started with Hive: Understand what kind of problem Hive solves in Big Data. Learn its architectural design and working mechanism. Know data models in Hive, different file formats supported by Hive, Hive queries, etc. We will see running queries in Hive.

Getting Started with Pig: Understand how Pig solves problems in Big Data. Learn its architectural design and working mechanism. Understand how Pig Latin works in Pig. You will understand the differences between SQL and Pig Latin. Demos on running different queries in Pig.

Use Cases: Real-life applications of Hadoop is really important to better understand Hadoop and its components, hence we will be learning by designing a sample Data Pipeline in Hadoop to process big data. Also, understand how companies are adopting modern data architecture i.e. Data Lake in their data infrastructure.

Practice: Practice with huge Data Sets. Learn Design and Optimization Techniques by designing Data Models, Data Pipelines by using real-life applications' data sets. 

Check out some of our reviews from real students:-


"A nice learning for beginners, the thing which differentiates this course from other similar courses is that it has very "effective and concise" content, so do even a layman can understand easily. The course shows only 3 hours of on-demand video lecture but one should always give time to each lecture ( by means of bookmarks and pause), then you would able to understand all the basics of Big data and Hadoop."


"I liked the hands-on approach. very helpful."

"Overall definitely worth the money for what you get, I learnt so much about Big Data."

"I absolutely recommend taking this course."


"Presenter explains in simple terms and any lay person or someone like me who has no background about databases and data can understand. Explaining the business use case application us very helpful in understanding how this can be useful for everyday business."

"Loved it. Saved lots of time searching information on the internet."

"Very informative, and the course gave me what I was looking for. Thanks!"


"Big Data introduction can be daunting with several new keywords and components that one needs to understand. But, this course very clearly explains to a beginner about the architecture and different tools that can be leveraged in a big data project. It also has indications on the scope of big data in the industry, different roles one can perform in the big data space and also cover various commercial distributions of big data. Overall, a great course for a beginner to get started on the fundamentals of big data. Use Case is a bonus !"



Who this course is for:

  • This course can be opted by anyone (students, developer, manager) who is interested to learn big data. This course assumes everyone as a beginner, and teaches all fundamentals of Big Data, Hadoop and its complex architecture.

Course content

7 sections • 32 lectures • 3h 11m total length

  • Preview03:49

  • Introduction to Big Data
    09:23
  • Job Roles in Big Data
    06:30
  • Salary Analysis
    02:55
  • Technology Trends in the Market
    06:30
  • Advice for Big Data Beginners
    02:45

  • Introduction to Hadoop
    08:23
  • Hadoop Ecosystem
    05:01
  • Hadoop 1.x vs Hadoop 2.x
    14:13
  • ETL vs ELT
    03:19
  • Different Hadoop Vendors
    04:20
  • Hadoop Installation
    14 pages
  • Preview09:09
  • Hadoop on Cloud
    05:11

  • Introduction to Hive
    02:41
  • Hive Architecture
    02:28
  • Hive Data Model
    07:55
  • File Formats in Hive (Text, Parquet, RCFile, ORC)
    04:40
  • SQL vs HQL
    03:46
  • UDF & UDAF in Hive
    02:57
  • Hive Demo
    18:50

  • Introduction to Pig
    02:57
  • Pig Architecture
    01:39
  • Pig Data Model
    02:17
  • How Pig Latin Works
    02:57
  • SQL vs PIG
    05:32
  • UDF in Pig
    03:26
  • Pig Demo
    12:49

  • Designing Data Pipeline using Pig and Hive
    07:59
  • Data Lake
    05:24

  • Practice-1: Analyzing Taxi Trips Data
    04:08
  • Practice-2: Designing Hive UDF
    03:46

Instructor

Andalib Ansari
Big Data Architect
Andalib Ansari
  • 4.2 Instructor Rating
  • 1,983 Reviews
  • 27,714 Students
  • 1 Course

My name is Andalib Ansari and I am a Big Data Architect with over 7 years of experience in online gaming, ride-hailing, SaaS, and telecom industries. Some of the tools and technologies I have used are; Python, Scala, Hadoop, MapReduce, Hive, Pig, Spark, SQL, designing Data Warehouse, designing Data Lake, building large scale Data Pipelines, building analytics infrastructure using Tableau, Pentaho Data Integration (Kettle), AWS EMR, AWS Redshift, MySQL, Big Query, React, JavaScript/TypeScript, etc. I have created various online courses on Big Data Technologies which have reached over 27k+ students in 145+ countries.

I am passionate about Data Engineering and the latest technologies, and I am looking forward to sharing my passion and knowledge with you.


  • Udemy for Business
  • Teach on Udemy
  • Get the app
  • About us
  • Contact us
  • Careers
  • Blog
  • Help and Support
  • Affiliate
  • Impressum Kontakt
  • Terms
  • Privacy policy
  • Cookie settings
  • Sitemap
  • Featured courses
Udemy
© 2021 Udemy, Inc.