
Explore how Apache NiFi enables building and automating data flows, from flow files and expression language to data provenance, parameter context, and custom processors, with demos and real-world use cases.
Explore ETL data flow and data pipelines by mapping source and destination systems and applying extract, transform, and load steps. Note csv and parquet formats and diverse targets.
Explore Apache NiFi's introduction, revealing data flow and ingestion with hundreds of processors, a web-based UI, drag-and-drop workflows, and processors like Getfile, HDFS, and Kafka.
Explore why Apache NiFi addresses the global enterprise data flow problem by enabling secure, efficient data movement across diverse sources, locations, and user roles, with back pressure and prioritization.
Explore enterprise data flow problems and how NiFi addresses deployment, reliability, security, data provenance, and data prioritization through visual design, back pressure, and scalable clustering.
Explore Apache NiFi’s features for high availability, load balancing, fault tolerance, and multi-node clustering, plus authentication via Active Directory and Kerberos, and integrations with Spark, Kafka, and Hadoop.
Learn how to install and configure Apache NiFi on Windows, with notes on Java versions and the move to Java 17 or newer for NiFi 2.x compatibility and performance.
Install Java 21 JDK to meet Apache NiFi system requirements. Configure JAVA_HOME and the PATH, verify the Java installation, and prepare for NiFi setup on Windows, Linux, Unix, or macOS.
Install Apache NiFi from official binaries across Windows and Linux, verify Java prerequisite, start NiFi, and retrieve auto-generated login credentials to access the NiFi canvas.
Replace the default NiFi username and password using a Linux/macOS script or a Windows batch file, set Java and NiFi paths, then restart NiFi to log in with new credentials.
Explore the NiFi user interface featuring a blank canvas, component toolbar with processors and ports, and status indicators for processors and remote groups.
Build a simple NiFi pipeline that fetches files from staging, scrubs phone numbers by removing special characters, and writes results to the processed location. Learn processors, flow files, and connections.
Install Apache NiFi 2.4 on mac with JDK 21, guided by ChatGPT. Build a get file to put file pipeline, use replace text with regex, and verify on root canvas.
Discover how flow files power NiFi by carrying attributes and a data pointer through processors, while content, flow file, and provenance repositories store data, metadata, and history.
Learn how NiFi's generate flow file processor creates flow files with custom data. Includes IDs and phone numbers with special characters; integrates with replace text and put file.
Explore the NiFi 2.6 interface and generate flow file with the generator processor, then view the output as we run get file, replace text, and put file processors.
Explore NiFi expression language to fetch and update flow file attributes, manipulate file name with functions, and route files by region using update attribute and route on attribute processors.
Demonstrates a use case of expression language in Apache NiFi by building a pipeline that updates file names, compresses content to gzip, and routes outputs to region folders.
Learn how to use funnels in Apache NiFi to merge multiple connections into a single path and to debug flows by routing through a funnel before the next processor.
Learn to manage complex NiFi pipelines by creating and using process groups to logically group components, expose input and output ports, and enable reusability across projects.
Learn how a parameter context defines a set of globally accessible parameters in NiFi, and how assigning it to a process group enables access, policy control, and hiding sensitive values.
Apply a parameter context to the sample NiFi pipeline on mac, with NiFi 2.4.5, replacing hard-coded input and output paths and associating the context recursively to the process group.
Explore how to implement a parameter context in Apache NiFi to replace legacy variables, centralize configuration, and apply expression language for input and output paths in a region-based pipeline.
Explore how NiFi 2.x parameter contexts are not inherited by child process groups, requiring each group to have an explicit parameter context to access parameters.
explores controller services in NiFi as shared configuration pools for processors, reporting tasks, and credentials, shows scope and practical use case of loading JSON into a database.
Demonstrates building a NiFi controller service with a MySQL Dbcp connection pool to insert generated JSON into an employee table, including JDBC setup, retries, and logging.
Learn how Apache NiFi records data provenance by tracking flow file events—create, modify, send, and drop—in a provenance repository to reveal data lineage.
Explore data provenance in NiFi by examining data lineage, replaying content modifications, and validating input and output data to debug pipelines.
Understand how NiFi back pressure and threshold limit the connection queue by flow file count and data size, cascading upstream when downstream slows, illustrated with a practical pipeline demo.
Explore flow file prioritization in NiFi, including first in first out, newest file first, oldest file first, and priority attribute prioritization.
Demonstrate prioritization in a NiFi pipeline by creating a flow with high and low priority sources, setting a priority attribute, and observing high priority files processed first.
Explore how flow file expiration in NiFi drops unprocessed files after a set time, with a five-second queue expiration demonstrated on the flow file priority pipeline.
Learn to log events in NiFi with the log attribute and log message processors, view logs in the NiFi app log, and prefix entries for easy identification.
Discover how NiFi monitors pipelines and cluster resources, using processor stats, bulletins, and reporting tasks to track events of success and failure and resource utilization, with Splunk integration.
Explore NiFi cluster and resource monitoring by configuring memory and disk usage tasks, reading the status bar metrics, and adjusting thresholds to manage alerts.
Explore the Apache NiFi registry, a version control for data flows with buckets, change metadata, and automatic synchronization directly from the canvas across environments.
Install and start the Apache NiFi registry locally by downloading version 2.6.0 and running from the bin folder; access the UI at localhost:18080 to create buckets for version control.
Version control data flows with NiFi registry by grouping the flow in a process group, creating a bucket, and committing versions. Export and import JSON to move changes across environments.
Learn how to integrate Apache NiFi with Amazon S3 by writing data from NiFi to an S3 bucket, configure credentials, and troubleshoot failures.
Explore how custom processors extend Apache NiFi to fill unique needs, while built-in processors cover ingestion, routing, transformation, enrichment, and system integration with controller services.
Discover how to install Apache Maven and use NiFi archetypes—the processor bundle archetype and the service bundle archetype—to generate custom processors and controller services.
Generate a NiFi custom processor template with Maven archetype, configure group id, artifact id, and version, and review the sample processor with my property and my relationship.
Update and deploy a NiFi custom processor by editing and renaming, configuring properties and relationships, then build a jar and test the processor with sample data.
This course will prepare you for a real world Data Engineer role by using NiFi.
This course covers all the topics in Apache Nifi with real world use cases. You don't need any prior understanding to start this course. It covers step by step topics designed to upskill you from beginner to expert.
You will learn the following : -
Real world Enterprise Data flow problem and its challenges.
Why we need tool like Apache Nifi and Introduction to Apache Nifi.
User Interface and Internals of a Nifi Flow File.
Installation of Java, Maven and Apache Nifi.
Understanding of Various login identity providers in Nifi.
Setting up Nifi on Windows and Updating Username and Password.
Introduction to the folders structure covering bin. repository, logs and extension folders etc.
Simplistic demo to cover the Nifi User Interface.
Basic Concepts like Expression Language, Funnels and Variables.
In Depth Concepts like Context Service. Parameter Contexts and Templates.
Nifi Hands-On implementation of all the use cases.
Building a Pipeline to create a controller service for data base connection and write data to MySQL Database.
Logging, Pipeline and Cluster Monitoring in Nifi.
Bonus - Git link for all Templates used in Course.
Prerequisites :
No NiFi knowledge required to start this course.
Basic programming skills
Basic database knowledge
Big Data entry level knowledge
This course uses high-quality AI-generated text-to-speech narration to complement the powerful visuals and enhance your learning experience.