Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Getting Started with Apache Flink
Rating: 3.8 out of 5(17 ratings)
1,458 students

Getting Started with Apache Flink

An Overview of Apache Flink
Created byAnurag Kaushik
Last updated 11/2022
English

What you'll learn

  • Architecture of Apache Flink
  • Distributed Execution
  • Job Manager & Task Manager
  • How to install & download flink on different machines

Course content

1 section18 lectures54m total length
  • What is Apache Flink2:27
  • History of Apache Flink1:42

    Trace the history of Apache Flink from the Stratosphere project to incubator project, showing evolution from a Java API to a platform for batch, stream, graph processing, and machine learning.

  • Architecture of Apache Flink2:12
  • Features of Apache Flink4:11

    Discover Apache Flink's high-performance, low-latency features: exactly-once stateful computation, flexible time and session windows, fault-tolerant distributed snapshots, and memory-efficient stream and batch data processing.

  • Distributed Execution1:36
  • Job Manager6:34

    Explore how Flink's job managers coordinate task execution, scheduling, and fault-tolerant checkpoints with the Akka actor system enabling leader election and communication with task managers.

  • Task Manager1:04

    Explore how task managers serve as worker nodes that execute tasks in the JVM, allocate memory per task slot, balance parallelism across slots, and share TCP connections and heartbeat messages.

  • Job Client1:35
  • Download JDK1:54
  • Set Path on Enviro. Variables1:28
  • Downloading Flink1:35
  • Installation Of Flink2:16
  • Download VM ware workstation player & ubuntu Iso image2:09
  • Installation of VMware workstation player1:26
  • Ubuntu Installation2:50
  • Ubuntu Installation Part-20:40
  • Multiple Java Installation9:59
  • Installation of Flink on Ubuntu9:02

Requirements

  • Basic knowledge of SQL

Description

Apache Flink is an open source, native analytic database for Apache Hadoop. It is shipped by vendors such as Cloudera, MapR, Oracle, and Amazon. The examples provided in this course have been developing using Cloudera Apache Flink. This course is intended for those who want to learn Apache Flink.

Apache Flink is used to process huge volumes of data at lightning-fast speed using traditional SQL knowledge.

To make the most of this course, you should have a good understanding of the basics of Hadoop and HDFS commands. It is also recommended to have a basic knowledge of SQL before going through this course.

Apache Flink is the next generation Big Data tool also known as 4G of Big Data.

It is the true stream processing framework (doesn’t cut stream into micro-batches).

Flink’s kernel (core) is a streaming runtime which also provides distributed processing, fault tolerance, etc.

Flink processes events at a consistently high speed with low latency.

It processes the data at lightning fast speed.

It is the large-scale data processing framework which can process data generated at very high velocity.

Flink is an alternative to MapReduce, it processes data more than 100 times faster than MapReduce. It is independent of Hadoop but it can use HDFS to read, write, store, process the data. Flink does not provide its own data storage system. It takes data from distributed storage.

Who this course is for:

  • Students, Programmers, Learners