Getting Started with Apache Flink

Name: Getting Started with Apache Flink
Rating: 3.3 (171 reviews)

A Basic Guide to Apache Flink for Beginners

Created byInflame Tech

Last updated 2/2019

English

What you'll learn

Architecture of Apache Flink
Features of Apache Flink
Distributed Execution
Job Manager
Task Manager
Job Client
Set Path on Environment Variables
Installation on Ubuntu
Multiple Java Installation

Course content

1 section • 18 lectures • 54m total length

1.1 What is Apache Flink2:27
1.2 History of Apache Flink1:42
1.3 Architecture of Apache Flink2:12
Explore the layered architecture of Apache Flink, its components and libraries, and how job graphs, data streams, and datasets interact via the API.
1.4 Features of Apache Flink4:11
1.5 Distributed Execution1:36
1.6 Job Manager6:34
1.7 Task Manager1:04
1.8 Job Client1:35
1.9 Download JDK1:54
Download and install the Java Development Kit (JDK) on Windows, accept the license, choose an install folder, and complete the setup.
1.10 Set Path on Environment Variables1:28
Learn to locate and edit environment variables, set the Java home, and configure the path to prepare the Apache Flink development environment.
1.11 Downloading Flink1:35
1.12 Installation Of Flink2:16
Explore the installation of Flink on Windows, start a local cluster, and verify the environment to ensure Flink runs smoothly.
1.13 Download VMware workstation player & ubuntu ISO image2:09
1.14 Installation of VMware workstation player1:26
1.15 Ubuntu Installation2:50
1.16 Ubuntu Installation part-20:40
1.17 Multiple Java Installation9:59
Install java and manage jdk versions in a terminal environment using sudo and interactive prompts. Handle storage space, license acceptance, and environment configuration to complete the setup.
1.18 Installation of flink on Ubuntu9:02

Requirements

Basics of Computer Science
Basics of Big Data
Basics of Analytics

Description

Apache Flink is an open source platform for distributed stream and batch data processing. It can run on Windows, Mac OS and Linux OS. In this blog post, let’s discuss how to set up Flink cluster locally. It is similar to Spark in many ways – it has APIs for Graph and Machine learning processing like Apache Spark – but Apache Flink and Apache Spark are not exactly the same.

Flink is an alternative of MapReduce, it processes data more than 100 times faster than MapReduce. Flink is independent of Hadoop but it can use HDFS to read, write, store, process the data. Flink does not provide its own data storage system. It takes data from distributed storage. The development of Flink is started in 2009 at a technical university in Berlin under the stratosphere. It was incubated in Apache in April 2014 and became a top level project in December 2014. Flink is a German word meaning swift / Agile. The logo of Flink is a squirrel, in harmony with Hadoop ecosystem.

Who this course is for:

Big Data Developers who want to analyse and process their data using their Flink
Spark Developers who want to upgrade their skills using Apache Flink