Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Hadoop Spark Hive Big Data Admin Class Bootcamp Course NYC
Rating: 3.9 out of 5(209 ratings)
12,294 students

Hadoop Spark Hive Big Data Admin Class Bootcamp Course NYC

Learn installations and architecture of Hadoop, Hive, Spark, and other tools. Handle structured & Unstructured Data
Created byShivgan Joshi
Last updated 2/2019
English

What you'll learn

  • Kick start with basics for career in Big Data Hadoop in NY Area 312 285 6886
  • Learn how to install different tools on Hadoop
  • Learn enough Hadoop to join our NYC Bootcamp on Hadoop Big Data

Course content

5 sections34 lectures1h 26m total length
  • Introduction to the Course0:11
  • Course Syllabus, Scope, Intro Video10:33
  • Intro to HDFS & Hadoop Architecture8:29
  • Intro to Mapreduce - Pulling data from HDFS3:33
  • Map reduce3:40

Requirements

  • Basic knowledge of Programming and SQL would help
  • Basic idea of Python SQL Data Analytics would help

Description

Introduction Hadoop Big Data Course

  • Introduction to the Course

Top Ubuntu commands

Understand NameNode, DataNode, YARN and Hadoop Infrastructure

 

Hadoop Install

  • Hadoop Installation & HDFS Commands

  • Java based Mapreduce

# Hadoop 2.7  / 2.8.4

Learn HDFS commands

Setting up Java for mapreduce

Intro to Cloudera Hadoop & studying Cloudera Certification

SQL and NoSQL

  • SQL, Hive and Pig Installation (RDBMS world and NoSQL world)

  • More Hive and SQOOP (Cloudera – Sqoop and Hive on Cloudera.

  • JDBC drivers.   

  • Pig

  • Intro to NoSQL, MongoDB, Hbase Installation


Understanding different databases 

 

Hive : 

  1. Hive Partitions and Bucketing

  2. Hive External and Internal Tables

Spark Scala Python

  • Spark Installations and Commands

  • Spark Scala Scala Sheets

  • Hadoop Streaming Python Map Reduce

  • PySpark – (Python – Basics). RDDs.

 

Running Spark-shell and importing data from csv files

PySpark – Running RDD


  Mid Term Projects

  1. Pull data from csv online and move to Hive using hive import

  2. Pull data from spark-shell and run map reduce for fox news first page

  3. Create Data in MySQL and using SQOOP move it to HDFS

  4. Using Jupyter Anaconda and Spark Context run count on file that has Fox news first page

  5. Save raw data using delimiter comma, space, tab and pipe and move that into spark-context and spark shell


  Broadcasting Data – stream of data 

Kafka Message Broadcasting


 



Who this course is for:

  • Carrier changes who would like to move to Big Data Hadoop
  • Learners who want to learn Hadoop installations
  • New York Students who want to move to wall street