Find online courses made by experts from around the world.
Take your courses with you and learn anywhere, anytime.
Learn and practice real-world skills and achieve your goals.
Talend Open Studio for Data Integration is an open Source ETL Tool, which means small companies or businesses can use this tool to perform Extract Transform and Load their data into Databases or any File Format (Talend supports many file formats and Database vendors).
Talend Open Studio for Big Data is an open Source Tool used to interact with Big Data systems from Talend.
If you want to learn how to use Talend Open Studio for Big Data from SCRATCH or If you want to IMPROVE your skills in Big Data Concepts and designing Talend Jobs, then this course is right for you.
Its got EVERYTHING, covers almost all the topics in Talend Open Studio for Big Data.
Talks about Real Time USE CASES.
Prepares you for the Certification Exam.
By the end of the Course you will Master Working with Big Data by designing Talend Jobs.
And what more you ask, All the Videos are HD Quality.
What Are the System Requirements ?
Not for you? No problem.
30 day money back guarantee.
Learn on the go.
Desktop, iOS and Android.
Certificate of completion.
|Section 1: What Does the Course Cover ?|
This Video will talk about the topics that are covered in this course.
This video will show you how to download data files and Talend jobs that are designed as part of this course.
|Section 2: TALEND OVERVIEW|
This lecture gives you an overview of what Talend Open Studio for Big data is and it also talks about the additional features in the subscription version.
After watching this lecture, you should be able to download and open Talend Open Studio for Big Data on your Windows OS.
|Section 3: BIG DATA OVERVIEW|
You should be able to describe the 3 V's of Big Data and what big data is ?
Introduction to Hadoop and its advantages over traditional systems.
Walks you through different Hadoop ecosystem Tools.
What is a HDFS Block, What is a Namenode and its functions, Waht is a DataNode and its functions.
This lecture explains you, What happens when you Read/Write a file from/to to HDFS.
Gives an overview of MapReduce and its functions.
Explains the Map phase, the Shuffle Sort Phase and the Reduce Phase in a Map Reduce Job.
Explains different types of Key-Values pairs generated as part of a MapReduce Job.
HDFS - HDFS Federation & NameNode High Availability Hadoop 2
Briefly explains YARN and its daemons.
Explains what happens when you run an application on YARN.
|Section 4: Getting Started|
After watching this video you should be able to download CDH virtual box image and your cluster should be up and running.
After watching this video you should be able to download HDP virtual box image and your cluster should be up and running.
You should be able to create and open a Talend project.
This lecture walks you through the process of creating a Hadoop cluster Metadata using HDP.
Creating Hadoop Cluster Metadata in Talend for CDH
|Section 5: HDFS Components|
This video will explain what HDFS is and shows you how to run different hdfs commands on cluster.
For example, how to create a file/directory,how to copy file from local system to hdfs, etc..
It will show you how to create a HDFS connection in a way that you can use it in all of your HDFS jobs.
Explains the scenario where you have to copy a file or directory from your local system to HDFS.
How to retrieve a list of files or folders based on a filemask pattern ?
How to copy files from HDFS to HDFS ?
How to get files from HDFS into local directory ?
How to rename the selected files or specified directory on HDFS ?
How to check whether a file exists in a specific directory in HDFS ?
How to delete a file located on a given HDFS ?
How to read a file located on a given HDFS and Assign schema to it?
How to count the number of rows in a file in HDFS ?
How to present the properties of a file processed in HDFS ?
How to transfer data flows into a given HDFS file system ?
How to transfer data in the form of a single column into a given HDFS ?
Explains the scenario where you have to compare two different files that are on HDFS.
|Section 6: HIVE Components|
What is Hive ?
HiveQL Vs SQL
How to Connect to Hive Shell
How to Create Hive Managed and External Tables Using Hive Shell
How to Load data from HDFS & Local File System to Hive table using Hive Shell
How to Load data from one Hive table to another Hive table using Hive ShellPreview
How to join two HIVE Tables using Hive Shell
How to READ data from a HIVE Table and filter data using Hive Shell
How to open a connection to a Hive database using Talend?
How to close connection to a Hive databases using Talend?
How to create a Hive table using Talend?
How to extract data from Hive using Talend?
How to write data of different formats into a given Hive table using Talend?
How to execute the HiveQL query using Talend?
|Section 7: PIG Components|
What is Pig ?
What are the different Datatypes supported by Pig ?
How to Assign a schema to input file using Grunt Shell ?
What are aliases,relations and How to Load a file into Pig Alias ?
Pig - GROUP,GROUP ALL,DUMP,STORE,FILTER,LIMIT Operators
Pig - FOREACH, COUNT, MAX Operators
Pig - ORDER BY,DISTINCT,JOIN,COGROUP
How to load input data to an output stream in one single transaction ?
How to filter data from a relation based on conditions ?
How to select one or more columns from a relation ?
How to store the result of your Pig Job into a defined data storage space ?
How to remove duplicate tuples in a relation ?
How to perform the Pig COGROUP operation ?
How to perform aggregations on input data to create data to be used by Pig ?
How to perform join of two files based on join keys ?
How to sort a relation based on one or more defined sort keys ?
How to duplicate the incoming schema into identical output flows as needed ?
How to compute the cross data of two or more relations ?Preview
How to transform data from multiple sources to multiple targets ?
How to integrate personalized Pig Code into a Talend Job ?
|Section 8: SQOOP Components|
Coming Soon :-)
Coming Soon :-)
Coming Soon :-)
Coming Soon :-)
Coming Soon :-)
|Section 9: HCATALOG Components|
|Section 10: HBASE Components|
|Section 11: MongoDB Integration|
Introduction & Installation of MongoDB
How to open a connection to a MongoDB database ?
How to close an active connection to a MongoDB database ?
How to write columns of data into a given MongoDB Collection ?
8 years experience in the area of DataStage, Talend ETL Design and Architecture, data analysis, and/or reporting, data management, modeling.
5 Years of experience in the area of Big Data.
Functional work in and familiarity with enterprise architecture concepts.
Passionate about data, technology, and innovation.
Certified Talend Open Studio for Data Integration 6.0 Consultant
Certified Talend Big Data 6.0 Developer
Certified Talend Real Time Big Data 6.0 Developer
HDP Certified Developer