Talend For Big Data Integration Course : Beginner to Expert
4.1 (14 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
68 students enrolled
Wishlisted Wishlist

Please confirm that you want to add Talend For Big Data Integration Course : Beginner to Expert to your Wishlist.

Add to Wishlist

Talend For Big Data Integration Course : Beginner to Expert

Master guide for using Talend Big Data
4.1 (14 ratings)
Instead of using a simple lifetime average, Udemy calculates a course's star rating by considering a number of different factors such as the number of ratings, the age of ratings, and the likelihood of fraudulent ratings.
68 students enrolled
Last updated 4/2017
English
Price: $100
30-Day Money-Back Guarantee
Includes:
  • 16.5 hours on-demand video
  • 5 Articles
  • 6 Supplemental Resources
  • Full lifetime access
  • Access on mobile and TV
  • Certificate of Completion
What Will I Learn?
  • Learn Basic concepts of Big Data (Hadoop)
  • Create cluster Metadata manually, from configuration files and automatically
  • Create HDFS and Hive metadata Connect to your cluster to use HDFS, HBase, Hive, Pig, Sqoop and Map Reduce
  • Read and Write data to/from HDFS (HDFS, HBase)
  • Read and Write tables to/from HDFS (Hive, Sqoop)
  • Processing Tables stored on HDFS with Hive
  • Processing data stored on HDFS with Pig
  • Use Talend Open Studio for Big Data for real work as quickly as possible.
  • Write Talend Big Data v6 Certified Developer Exam
  • Work on Cloudera Hadoop Distribution
  • Work on HortonWorks Hadoop Distribution
  • Over 100 Lectures and Hours of Content !
  • Over 50 Exercises and Quiz Questions!
  • Once you finish this Course I guarantee, you will Pass the Certification Exam. (Offcourse you have to practice what ever I teach in this course :-)).
  • You will get Source code and Data Used in all 50 + Exercises.
  • You will get Source code and Data Used in all 100 + Jobs Designed in the Course.
  • I will respond to all your questions within 24 hours.
  • 50 % Off on my other course (Talend Data Integration Course : Beginner to Expert)
View Curriculum
Requirements
  • Talend Data Integration Basics
Description

Course Description

Talend Open Studio for Data Integration is an open Source ETL Tool, which means small companies or businesses can use this tool to perform Extract Transform and Load their data into Databases or any File Format (Talend supports many file formats and Database vendors).

Talend Open Studio for Big Data is an open Source Tool used to interact with Big Data systems from Talend.

If you want to learn how to use Talend Open Studio for Big Data from SCRATCH or If you want to IMPROVE your skills in Big Data Concepts and designing Talend Jobs, then this course is right for you.

Its got EVERYTHING, covers almost all the topics in Talend Open Studio for Big Data.

Talks about Real Time USE CASES.

Prepares you for the Certification Exam.

By the end of the Course you will Master Working with Big Data by designing Talend Jobs.

And what more you ask, All the Videos are HD Quality.

What Are the System Requirements ?

  • PC or Mac.
  • Virtual Box Which is FREE.
  • Talend Software Which is FREE.
  • HDP VM Which is FREE.
  • CDH VM Which is FREE.
Who is the target audience?
  • Any person who wants to use Talend Studio to interact with Big Data systems.
Students Who Viewed This Course Also Viewed
Curriculum For This Course
Expand All 109 Lectures Collapse All 109 Lectures 16:33:25
+
What Does the Course Cover ?
2 Lectures 08:39

This Video will talk about the topics that are covered in this course.

Preview 04:25

This video will show you how to download data files and Talend jobs that are designed as part of this course.

How to Download The Data files and Job files ?
04:14
+
TALEND OVERVIEW
2 Lectures 11:02

This lecture gives you an overview of what Talend Open Studio for Big data is and it also talks about the additional features in the subscription version.

Preview 07:10

After watching this lecture, you should be able to download and open Talend Open Studio for Big Data on your Windows OS.

Installing Talend Open Studio for Big Data on Windows/Mac/Linux
03:52
+
BIG DATA OVERVIEW
12 Lectures 01:13:22

You should be able to describe the 3 V's of Big Data and what big data is ?

Preview 05:21

Introduction to Hadoop and its advantages over traditional systems.

About Hadoop
10:42

Walks you through different Hadoop ecosystem Tools.

The Hadoop Ecosystem
05:21

What is a HDFS Block, What is a Namenode and its functions, Waht is a DataNode and its functions.

HDFS - Understanding Block Storage, NameNode and DataNode
05:33

This lecture explains you, What happens when you Read/Write a file from/to to HDFS.

HDFS - Architecture
05:40

Gives an overview of MapReduce and its functions.

MapReduce - Overview of MapReduce
12:38

Explains the Map phase, the Shuffle Sort Phase and the Reduce Phase in a Map Reduce Job.

MapReduce - Understanding MapReduce
09:46

Explains different types of Key-Values pairs generated as part of a MapReduce Job.

MapReduce - The Key/Value Pairs of MapReduce
02:22

HDFS - HDFS Federation & NameNode High Availability Hadoop 2
04:48

Briefly explains YARN and its daemons.

YARN - The Components of YARN
04:03

Explains what happens when you run an application on YARN.

YARN - Lifecycle of a YARN Application
03:12

Answers will followed :-)

Big Data Overview - Quiz
03:56
+
Getting Started
5 Lectures 43:59

After watching this video you should be able to download CDH virtual box image and your cluster should be up and running.

Installing Cloudera CDH VM
08:13

After watching this video you should be able to download HDP virtual box image and your cluster should be up and running.

Installing HortonWorks Sandbox VM
11:57

You should be able to create and open a Talend project.

Opening Talend project
04:40

This lecture walks you through the process of creating a Hadoop cluster Metadata using HDP.

Creating Hadoop Cluster Metadata in Talend for HDP
15:20

Creating Hadoop Cluster Metadata in Talend for CDH
03:49
+
HDFS Components
15 Lectures 01:46:46

This video will explain what HDFS is and shows you how to run different hdfs commands on cluster.

For example, how to create a file/directory,how to copy file from local system to hdfs, etc..

HDFS - Basic Commands Using Unix Shell
17:15

It will show you how to create a HDFS connection in a way that you can use it in all of your HDFS jobs.

How to create a reusable connection to the HDFS ?
06:37

Explains the scenario where you have to copy a file or directory from your local system to HDFS.

How to copy a source file or folder into a target directory on HDFS ?
08:51

How to retrieve a list of files or folders based on a filemask pattern ?
06:36

How to copy files from HDFS to HDFS ?
05:11

How to get files from HDFS into local directory ?
07:40

How to rename the selected files or specified directory on HDFS ?
01:52

How to check whether a file exists in a specific directory in HDFS ?
08:34

How to delete a file located on a given HDFS ?
07:30

How to read a file located on a given HDFS and Assign schema to it?
11:02

How to count the number of rows in a file in HDFS ?
03:32

How to present the properties of a file processed in HDFS ?
03:36

How to transfer data flows into a given HDFS file system ?
04:56

How to transfer data in the form of a single column into a given HDFS ?
02:45

Explains the scenario where you have to compare two different files that are on HDFS.

How to compare two files on HDFS ?
10:49
+
HIVE Components
16 Lectures 02:22:38
What is Hive ?
05:06

Hive Architecture
04:45

HiveQL Vs SQL
05:21

How to Connect to Hive Shell
04:06

How to Create Hive Managed and External Tables Using Hive Shell
13:53

How to Load data from HDFS & Local File System to Hive table using Hive Shell
15:12


How to join two HIVE Tables using Hive Shell
10:08

How to READ data from a HIVE Table and filter data using Hive Shell
01:54

How to open a connection to a Hive database using Talend?
25:59

How to close connection to a Hive databases using Talend?
06:13

How to create a Hive table using Talend?
17:26

How to extract data from Hive using Talend?
05:56

How to write data of different formats into a given Hive table using Talend?
12:21

How to execute the HiveQL query using Talend?
06:16

Answers will followed :-)

Hive - Quiz
00:56
+
PIG Components
21 Lectures 03:34:10
What is Pig ?
11:59

What are the different Datatypes supported by Pig ?
06:44

How to Assign a schema to input file using Grunt Shell ?
20:51

What are aliases,relations and How to Load a file into Pig Alias ?
12:20

Pig - GROUP,GROUP ALL,DUMP,STORE,FILTER,LIMIT Operators
25:46

Pig - FOREACH, COUNT, MAX Operators
19:40

Pig - ORDER BY,DISTINCT,JOIN,COGROUP
21:06

How to load input data to an output stream in one single transaction ?
19:10

How to filter data from a relation based on conditions ?
03:26

How to select one or more columns from a relation ?
07:27

How to store the result of your Pig Job into a defined data storage space ?
03:25

How to remove duplicate tuples in a relation ?
09:10

How to perform the Pig COGROUP operation ?
13:54

How to perform aggregations on input data to create data to be used by Pig ?
05:17

How to perform join of two files based on join keys ?
10:15

How to sort a relation based on one or more defined sort keys ?
02:26

How to duplicate the incoming schema into identical output flows as needed ?
02:32


How to transform data from multiple sources to multiple targets ?
07:45

How to integrate personalized Pig Code into a Talend Job ?
05:19

Answers will followed :-)

Pig - Quiz
01:24
+
SQOOP Components
9 Lectures 02:07:47

Download the jobs designed in this section from here.

What is Sqoop ?
11:13


How to transfer data from a RDBMS into the HDFS ? - Part1
21:10

How to transfer data from a RDBMS into the HDFS ? - Part2
17:51

Coming Soon :-)

How to transfer all of the tables of a RDBMS into the HDFS ?
10:50


How to import incremental data ? - Part1
20:28

How to import incremental data ? - Part2
23:35



How to transfer data from the HDFS to a RDBMS ?
10:54

Answers will followed :-)

Sqoop - Quiz
00:38
+
HCATALOG Components
5 Lectures 24:02


What is HCatalog ?
04:52


How to perform Operations on HCatalog managed Hive database/table/partition
08:17


How to Load data into a Hive Table from a file on HDFS using Hcatalog?
04:29


How to Load data into a Hive Table from a file on Local System using Hcatalog?
03:58


How to Read/extract data from hive tables using hcatalog ?
02:26
+
HBASE Components
6 Lectures 24:03


What is HBase ?
05:27


How to open a connection to an HBase database ?
04:41


How to close an active connection to an HBase database ?
00:54


How to writes columns of data into a given HBase database ?
07:14


How to read data from a HBase database and extract columns of selection ?
05:18

Answers will followed :-)

Hbase - Quiz
00:29
3 More Sections
About the Instructor
Mr. Kapil Chaitanya Kasarapu
4.0 Average rating
124 Reviews
374 Students
2 Courses

8 years experience in the area of DataStage, Talend ETL Design and Architecture, data analysis, and/or reporting, data management, modeling.

5 Years of experience in the area of Big Data.

Functional work in and familiarity with enterprise architecture concepts.

Passionate about data, technology, and innovation.

Certified Talend Open Studio for Data Integration 6.0 Consultant

Certified Talend Big Data 6.0 Developer

Certified Talend Real Time Big Data 6.0 Developer

HDP Certified Developer