Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Apache Kafka Crash Course for Java and Python Developers

Name: Apache Kafka Crash Course for Java and Python Developers
Rating: 4.4 (482 reviews)

Quickly gain valuable skills in Apache Kafka as a Python or Java dev taught by a 2X Certified Confluent Kafka Engineer

Created byAdam McQuistan

Last updated 11/2023

English

What you'll learn

Kafka Basics: Key Architecture Components and Data Flow
Kafka Admin Client API (Java in Spring for Kafka and Python kafka-python)
Kafka Producer Client API (Java in Spring for Kafka and Python kafka-python)
Kafka Consumer Client API (Java in Spring for Kafka and Python kafka-python)
Schema Registry (Java in Spring for Kafka and Python confluent-kafka)
Kafka Connect for Data Pipelining into and out of Kafka
Overview of Stream Processing with Kafka
Kafka Streams in Java and Spring for Kafka
Faust Stream Processing with Python

Course content

9 sections • 100 lectures • 12h 3m total length

Course Introduction0:19
Join this Apache Kafka crash course to rapidly upskill Python and Java developers by mastering the most important features and functionalities of Kafka.
Overview0:52
Format0:59
Outline2:02
About Course Instructor2:11

Introduction0:35
Explore the basics of Kafka architecture, including topics, partitions, and event anatomy, and trace the data flow from producers to consumers in Kafka.
Why Apache Kafka?8:42
50K Foot View of Kafka1:46
Explore how data flows from producers to Kafka and to consumers in a real-time publish-subscribe system, enabling loose coupling. Make messages immutable and store them in an append-only log.
10K Foot View of Kafka2:11
1K Foot View of Kafka9:36
Apache Kafka uses topics and partitions to distribute writes across brokers for higher throughput, while replication and partition leaders provide fault tolerance and scalable consumption via consumer groups and offsets.
Anatomy of a Message3:49
Kafka Default Topic Storage5:42
Kafka Compacted Topics4:21
Kafka APIs3:20
Explore the five open source Kafka APIs—admin, producer, consumer, connect, and streams—and learn how connectors simplify moving data between Kafka and systems like PostgreSQL, MongoDB, and S3.
Kafka API Implementation Languages3:17
Explore Apache Kafka API implementations across languages, including native Java admin, producer, consumer, and streams APIs, and librdkafka bindings for Python, C++, C#, Go, Swift, and Node.
Dockerized Dev Environment Setup8:18
Kafka Basis, Storage, and Data Flow Knowledge Check

Admin API Introduction0:33
CLI to Manage Topics8:41
Learn to manage Kafka topics with the Kafka topics CLI using the admin API, including listing, creating (with partitions and replication), describing, and deleting topics in a Docker Compose environment.
CLI to Manage Topics Continued (Advanced Configurations)9:49
Python to Create Topics16:54
Python Create and Alter Topics with Advanced Configurations9:00
Java (Spring for Kafka) to Create Topics7:26
Java (Spring for Kafka) to Create and Alter Topics with Advanced Configurations18:33
Admin API Knowledge Check

Producer API Introduction0:31
Producers at a High Level5:17
Producers and their Influence on Message Partition Assignment6:03
Discover how producers map messages to partitions to boost Kafka scalability, using key-based hash-partitioning and round-robin when keys are absent.
CLI Tools for Producing Messages to Kafka5:55
Basic Producer in Python15:02
Basic Producer in Java (Spring for Kafka)14:44
Basic Producer in Java Alt Configuration (Spring for Kafka)6:29
Detailed Overview of Kafka Producer18:23
Changing Partitions Change Ordering4:34
Advanced Producer in Python20:33
Advanced Producer in Java (Spring for Kafka)18:44
Producer Configuration Documentation0:01
Producer API Knowledge Check

Consumer API Introduction0:54
Consumer Group Offsets and Progress Tracking7:53
Consumer Group Rebalances6:15
Explore how consumer group rebalances enable scalable, fault-tolerant consumption in Kafka by redistributing partitions among more consumers and tracking progress via the consumer offsets topic.
Basic Consumer in Python16:51
Basic Consumer in Java (Spring for Kafka)17:40
Auto Offset Commits and At Least Once Processing11:44
Manual Offset Commits and At Least Once Processing6:18
Disable auto commit and perform manual offset commits for at least once processing, risking possible reprocessing if failures occur, with batch commits in Spring Kafka and explicit commits in Python.
Manual Offset Commits and At Most Once Processing5:28
Manual Offset Commits and Exactly Once Processing8:19
Advanced Consumer in Python10:53
Advanced Consumer in Java (Spring for Kafka)8:35
Learn to build an advanced Spring for Kafka Java consumer that commits after each record, achieving near exactly-once processing by enabling record mode and disabling auto commit.
Viewing Consumer Group Offsets5:07
Updating Consumer Group Offsets5:58
Consumer Configuration Documentation0:01
Consumer API Knowledge Check

Schema Registry Introduction0:30
Explore the Confluent Schema Registry, a Kafka metadata management tool for messages and their data structures, and learn how it governs schemas across Kafka topics.
What is Confluent Schema Registry1:01
Why use Confluent Schema Registry2:32
How Schema Registry Fits into Kafka Architecture8:35
Quick Overview of Apache Avro6:05
Schema Registry Compatibility Settings and Schema Evolution Checks10:52
Explore schema registry compatibility settings, including backwards, forwards, and transitive rules, and configure compatibility levels in Docker and via the rest API, with Avro integration for producers.
Java Demo Part 1: Avro Library Project Setup14:04
Java Demo Part 2: Integrating Avro and Schema Registry in a Producer Application23:55
Java Demo Part 3: Integrating Avro and Schema Registry on a Consumer Application17:32
build a spring boot kafka consumer integrated with avro and the schema registry, using a reusable avro domain events library and confluent deserializers to process messages.
Java Demo Part 4: Evolving the Schema16:18
Python Demo Part 1: Integrating Avro and Schema Registry in a Producer34:09
Python Demo Part 2: Integrating Avro and Schema Registry in a Consumer12:39
Python Demo Part 3: Evolving the Schema8:54
Overview of Schema Registry REST API10:21
Explore the Confluent Schema Registry REST API to manage subjects for topic key or value schemas, query versions and IDs, validate compatibility, and register schemas via post requests.
Schema Registry Knowledge Check

Kafka Connect Introduction0:37
What is Kafka Connect1:52
Why use Kafka Connect1:07
Discover why Kafka Connect provides a robust, scalable low-code data pipeline with Kafka as the intermediary, backed by a vast ecosystem of connectors and a flexible API for customization.
How Kafka Connect Fits into Systems and Data Architecture4:10
Explains how Kafka Connect fits into a Kafka-driven architecture, ingesting data from databases, flat files, and apps via connectors, and sinking to Elasticsearch, cloud storage, and analytics platforms.
Conceptual Overview of a Connect Cluster and Starting a Connector Plugin9:25
Setting Up Datagen Connector in Docker6:36
Set up a local dockerized Kafka environment, build a custom Docker image with the Data Gen connector, and run Docker Compose up to view logs and learn Kafka Connect basics.
Configuring and Starting a Datagen Connector via REST API10:39
Query the Kafka Connect REST API to discover plugins. Configure and start a data gen connector with an Avro schema and schema registry, producing technologists data to a Kafka topic.
Using REST API to Manage Datagen Connector3:11
Kafka Connect Sink Demo with MongoDB11:31
Demonstrates wiring a Kafka topic to MongoDB using the MongoDB Sync Connector in Kafka Connect, including Docker Compose setup, topic creation, and verifying data flow into MongoDB.
Kafka Connect Documentation0:01
Kafka Connect Knowledge Check

Stream Processing Introduction0:36
The content for this section is currently being developed. Please check back soon.
What is Kafka Streams2:15
Streams and Tables4:00
Stateless and Stateful Transformations6:40
Learn stateless and stateful transformations in stream processing, applying map and filter to compute revenue per event, then group by customer ID and perform aggregates like reduce and windowing.
Processing Topologies6:06
Explore stream processing topologies built from source, sink, and stateful or stateless transforms forming directed acyclic flow. See order validation and revenue topologies that join and enrich data for insights.
Input Partitions Still Drive Parallelism and Throughput5:16
Java Stream Processing Demo Setup10:35
Java Stream Demo: Order Validation Service26:50
Java Streams Demo: Customer Revenue Service35:13
Build a multi-processor Kafka Streams app that enriches orders with product data, computes per-customer revenue, and outputs enriched results to a revenue topic.
Stream Processing Knowledge Check
Tumbling Windows2:36
Sliding Windows3:45
Session Windows3:29

Section Introduction0:15
What is Faust1:07
Key Data Constructs of Faust Library1:49
Types of Streaming Computations6:39
Faust Channels, Topics, Streams and Agents2:50
Demo: Install and Setup Faust & Agents and Topics5:29
Faust Tasks and Timers1:14
Demo: Faust Tasks and Timers1:31
Demonstrate Faust tasks and timers by running a demo app that logs a startup task and a timer that fires every 10 seconds.
Producing to Topics and Processing Streams in Faust2:40
Demo: Simple Faust Producer Consumer2:28
Processing Complex Types in Faust2:40
Learn to process complex types in Faust by defining a Faust record schema for message keys and values, creating greeting objects, and producing and consuming typed messages.
Demo: Producing and Consuming Complex Types4:14
Produce and consume on the Kafka greetings topic with a complex greeting object, sending the greeting record as the value and deserializing back on consumption.
Working with Tables in Faust3:41
Demo: Calculating Aggregates using Faust Tables4:16
Demonstrates building an aggregate table with Faust tables by consuming greetings events, grouping by the greeter, and incrementing per-greeter counts to produce a running total.
Understanding that Kafka Topic Partitions Still Drive Parallelism in Faust2:01

Requirements

Basic understanding of Docker along with comfortability using the CLI and familiarity with either Java or Python programming languages.

Description

A fast track to gain the skills needed to work with Apache Kafka as a Java or Python Software Engineer by taking the Kafka Crash Course developed and presented by a 2X Confluent Kafka Certified Engineer!

In this course students, Java or Python Software Developers, will be taken on a fast track journey to attaining skills required to harness the amazing power of Apache Kafka. Students gain the practical knowledge to build loosely coupled distributed systems that scale to insane levels of throughput while maintaining unprecedented resiliency.

Topics covered include:

Kafka Basics of Key Architecture Components and Data Flow

Kafka Admin API (In Java with Spring for Kafka as well as in Python)

Kafka Producer API (In Java with Spring for Kafka as well as in Python)

Kafka Consumer API(In Java with Spring for Kafka as well as in Python)

Confluent Schema Registry (In Java with Spring for Kafka as well as in Python)

Kafka Connect to Import and Export Data to/from Kafka from Common Source/Sink Systems

Overview of Stream Processing Basics with Kafka (Kafka Streams in Java and Faust Streams Python Framework)

The Apache Kafka Crash Course for Java and Python Developers is specifically designed for quickly getting Developers up to speed using Apache Kafka to be prepared for upcoming interviews or make timely yet significant contributions implementing Apache Kafka pub/sub messaging or event streaming in their current roles. The course provides a balance of fundamental theory on the inner workings of Apache Kafka's storage mechanism along with the know how to tune producer and consumer applications for performance and resiliency. This course is packed full of practical examples with code samples for putting the theoretical content into practice in two of the most popular languages used in industry, Java and Python.

Who this course is for:

Software Developers and Software Architects with experience programming in Java or Python interested in using Kafka for building robust, scalable, decoupled systems.

Apache Kafka Crash Course for Java and Python Developers

What you'll learn

Explore related topics

Course content

Introduction5 lectures • 6min

Kafka Architecture Foundations and Data Flow11 lectures • 52min

Kafka Admin API7 lectures • 1hr 11min

Kafka Producer API12 lectures • 1hr 56min

Kafka Consumer API14 lectures • 1hr 52min

Schema Registry14 lectures • 2hr 47min

Kafka Connect10 lectures • 49min

Stream Processing with Kafka Streams in Java12 lectures • 1hr 47min

Stream Processing with Faust in Python15 lectures • 43min

Requirements

Description

Who this course is for: