
Learn how to create an Azure Data Factory by configuring subscription, resource group, and a unique name, selecting a region, then review, create, and access author, monitor, and manage.
Learn how a resource group acts as a container for related Azure resources—data factory, data lake, Synapse Analytics, and Key Vault—organized under a subscription, in the Azure portal.
Learn to configure linked services as connection strings to data stores, and create datasets that reference specific files or tables for etl operations in Azure Data Factory.
Azure blob storage is the main cloud data store for unstructured data, letting you create a storage account and containers to store and read text, images, and common file formats.
Explore variables in ADF by initializing and modifying values in a data factory pipeline with set variable, then use dynamic content to compute x plus y and assign to z.
Master json as a lightweight data structure for storing and transporting data via api endpoints and data factory pipelines, using objects, arrays, indexing, and dot notation to access nested values.
Explore copy activity in Azure Data Factory to move data between on-premises and cloud stores, configuring source and sink, supporting diverse data stores, file formats, and post-move transformation and analysis.
Learn how delete activity in Azure Data Factory removes files or folders in blob storage to clean up archives, using dataset options like file path, wildcards, prefixes, and recursion.
Invoke another pipeline from a master pipeline using the execute pipeline activity in Azure Data Factory, demonstrating master and child pipelines, lookup, and dataset integration.
Explore how the get metadata activity retrieves file, folder, or table metadata in Azure Data Factory or Synapse pipelines, and how to use output in conditional expressions and downstream activities.
Learn how the lookup activity fetches data from a file or table in Azure Data Factory, returning configuration data or query results for use in subsequent activities.
Explore set variable in Azure Data Factory to declare and initialize variables, then compute area from length and breadth with dynamic expressions, int conversion, mul, and string formatting.
Understand how the wait activity in Azure Data Factory holds a pipeline for a specified time, using seconds (60) or 3600 seconds to manage dependencies from external sources.
Explore how the filter activity in Azure Data Factory applies a condition to each array item, using contains to filter substrings and output an array of matching values.
Explain how the until activity loops until its condition becomes true, using a variable, incrementing it, and performing type conversions with set variable and add operations.
Demonstrate how the Azure Data Factory if condition evaluates a boolean to execute true or false branches, using variables and set variable to determine the interest rate.
Apply join transformations in Azure Data Factory data flows to merge two sources using inner, left, right, and full outer joins, handling matched and unmatched records with nulls.
Aggregate transformation in Azure Data Factory performs group-by style aggregations on columns to compute average salary, max salary, and counts per department.
Learn the select transformation in Azure Data Factory data flows, a SQL-like tool to rename, drop, and reorder columns without altering row data.
Learn how to derive a new column or modify existing fields in Azure Data Factory data flows, using the derived column transformation and storing results to blob storage.
Master the pivot transformation in Azure Data Factory data flows, turning unique values from a single column into multiple columns by grouping by department and counting by job.
Explore unpivot transformation in Azure Data Factory to convert a wide dataset into a vertical view by ungrouping a column and turning column names into a values column.
Explore the rank transformation in Azure Data Factory data flows, generating dense rankings from a sorted salary column in ascending or descending order and exporting results to a file.
Explore the flatten transformation in Azure Data Factory data flows to unroll array values in JSON into individual rows, demonstrating data normalization and outputting flattened results to a blob destination.
Learn how the filter transformation in Azure Data Factory data flows filters rows by a condition, like a SQL where clause, producing only matching records and/or supporting multiple conditions.
Copy files dynamically from a blob storage container to a destination using Azure Data Factory, designing linked services, datasets, and a get metadata plus loop to move all files.
Explore how the set variable activity assigns values to pipeline variables. Learn to increment with a temporary variable and int conversion to avoid self assignment.
Design an Azure Data Factory pipeline to migrate files whose names contain a substring pattern from source blob storage to a destination, using get metadata, filter, and copy activities.
Log pipeline details after execution using a stored procedure into an ADF_pipeline_log table, recording data factory name, pipeline name, run ID, and timestamp.
Split a semicolon-delimited CSV line into product, description, and area columns using Azure Data Factory Data Flows; derive columns, map to SQL Server destination, and validate pipeline execution.
Learn to copy missed files from source to destination in Azure Data Factory by identifying missing files and orchestrating a pipeline with get metadata, filter, and copy activities.
Implement end-to-end email notifications in Azure Data Factory using one stored procedure to manage start, success, and failure, with dynamic subject, body, and recipients.
Discover what a data warehouse is: a central, well-structured repository for cleansed data used for analysis and reporting, with data organized into fact and dimension tables.
Master data warehouse keys by exploring candidate keys, primary keys, foreign keys, and surrogate keys, and learn how each key functions in data design.
Identify a candidate key as a column that is both unique and not null, as shown by employee id or passport; names may repeat, so they are not candidate keys.
Discover how a foreign key links an employee to a department by referencing the dept table's primary key, using dept number as the reference, and see self-referential keys like mgr.
Define surrogate key as a system generated, auto-incremented unique identifier that serves as the primary key, especially in slowly changing dimensions for historical tracking.
Explore dimension and fact tables in a star schema, including primary and surrogate keys, descriptive attributes, and measurable facts; learn how incremental and full loads impact the data model.
Learn how to implement SCD type 2 in Azure Data Factory to preserve full history with surrogate keys, start and end dates, and incremental loads via staging and merge scripts.
Learn the scd type 4 approach with a current dimension table and a history table, using staging data and a surrogate key to track changes and auditing.
Understand SCD type 5, using a mini dimension to track city changes while keeping current values in the main dimension for fast queries.
Implement SCD type 6 by maintaining history and the current value in a single record with dates and active flag. Learn to merge staging data and insert new history records.
Create a logs table under the afm schema with an identity id and fields for adf name, pipeline name, run id, start time, status, and error messages, illustrating identity usage.
Create two stored procedures to log Azure Data Factory pipeline starts and ends, inserting data factory name, pipeline name, run id, and start/end times with dynamic values and testing.
Create an Azure Key Vault to securely store secrets like passwords and access keys, with current and older versions. Apply them in Data Factory pipelines and Databricks mounting.
Welcome to the Course "Azure Data Factory - Data Engineer : Real Time Projects".
Learn ETL and Big Data Processing on Azure with Azure Data Factory V2
Azure Data Factory (ADF) is one of the in-demand data engineering tool in the Cloud. This course has been taught with covering a data engineering solutions using ADF for real world scenarios.
Once you complete this course including assignments, quizzes, you will be in a position to take up or handle confidently any kind of projects one equal to 10+ years of experience.
Entire course built based on real time scenarios which are going to use very frequently and common to any type of projects on various domains.
This course will gives you necessary skills and help you greatly to pass for Certification exams like DP200 and DP203.
What this course will teach you?
You will learn how to build a real time data pipeline in Azure Data Factory (ADF).
You will learn how to transform data using Data Flows in Azure Data Factory (ADF) and load into ADLS2, Blob storage.
You will learn how to ingest JSON data from SQL Server to API end point.
You will learn how to build production ready pipelines and good practices and naming standards.
You will learn how to monitor pipelines using Azure Data Factory (ADF), Azure Monitor and Log Analytics with a real-world project.
You will learn about manual/Triggers in Azure Data Factory (ADF) and how to use them to schedule the data pipelines.
Create Data driven, fully integrated, dynamic and automated production grade pipeline creation and orchestration.
Data Warehousing Concepts like Fact & dimension tables, SCD type1, type2, Incremental loading and implement using ADF.
Connect & copy data from On Premise data stores, On premise SQL server via Self Hosted Integration Runtime to cloud.
How to ingest data from sources such as REST API, Azure Blob Storage, SQL DB into Azure Data Lake Gen2 using Azure Data Factory (ADF)