Udemy Business

Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

DP-203: Data Engineering on Microsoft Azure - 2022

Name: DP-203: Data Engineering on Microsoft Azure - 2022
Rating: 4.5 (7855 reviews)

Exam DP-203: Data Engineering on Microsoft Azure || 25+ hrs of videos || Practice Tests || 100% Syllabus || Demos

Created byEshant Garg | | LearnCloud.Info | 100,000+ Enrollments

Last updated 1/2024

English

English [Auto],

What you'll learn

Prepare to pass the Microsoft exam DP-203: Data Engineering on Microsoft Azure
Azure Storage (Blob Storage) (DP-203 Syllabus)
Azure Cosmos DB (DP-203 Syllabus)
Azure Data Lake (DP-203 Syllabus)
Azure SQL Database (DP-203 Syllabus)
Azure SQL Datawarehouse, Synapse (DP-200203 Syllabus)
Azure Data Factory (DP-203 Syllabus)
Azure Databricks (DP-203 Syllabus)
Azure Analytics Services (DP-203 Syllabus)
Azure Monitoring Service (DP-203 Syllabus)

Course content

12 sections • 189 lectures • 24h 19m total length

Course Introduction and set expectation5:11
The dp-203 data engineering on Microsoft Azure course introduces the certification, highlights salary and demand, and outlines exam objectives—data storage, processing, security, and monitoring.
IMP: Everything you need to know about exam and this course3:38
Understand dp-203 exam structure: 180 minutes, 50–60 questions, 700 points with no negative marking and partial credit; questions include multiple choice, multiple select, arrange, and case study.
IMP: Instructions1:33
How to keep using Azure Portal for FREE after 12 months?4:49
Discover how to keep using Azure portal for free after 12 months with Microsoft Learn sandbox sessions. Switch directory to Microsoft Learn sandbox and work within four hours per session.
How to get FREE credits for Azure Portal1:39
PPT and Demo Resources0:15
Before you start...1:29
Share your reviews to support discourse; you’ll be invited to review after a few minutes. Set 720p quality, enable captions, adjust caption position and font size, and use speed control.
Practice Questions and Quizzes0:03

Absolute new to Cloud Computing? Here is FREE course for you0:11
Create Azure Subscription6:13
Create a free Microsoft Azure subscription by signing in with a Microsoft account. Receive $200 credit for 30 days, with 25 always free services and 12 months of free services.
Azure Portal Overview14:45
Explore the Azure portal, learn to create, manage, and monitor resources via the web interface, customize dashboards, use global search, access marketplace and all services, and cloud shell options.
Delete Resources and Set Budget6:58
Learn how to delete resources and resource groups to avoid unwanted costs, and use cost management, cost analysis, and budgets with alerts to monitor and forecast spending.
How did Data Engineer Profile Evolve2:38
Data volumes grow exponentially, and data is the biggest superpower in the present times. The data engineer profile evolves to meet rising demand for data professionals in Microsoft Azure environments.
Data Engineer Role and Responsibility5:33
Trace the data flow from collection and ingestion to storage, data wrangling, and dimensional and structural modeling for analysis, using data factory and enabling business insights.
Data Engineer Technologies3:48
Explore how Azure Data Factory v2 ingests and transforms raw data using distributed file systems, Polybius, and Spark, storing in blob storage or data lake for analytics.
Summary of Data Engineering Profile and Technologies
Further Study Material0:07

Learning objectives3:03
Dive into non-relational datastore concepts with blob storage, data lake, and Cosmos DB, and explore replication, security options, cost, throughput, partitioning, and global distribution.
Azure Storage Services Overview6:02
Explore how Azure storage services handle blob, file, table, and queue data with durability, high availability, encryption, and regional redundancy, while offering rest api and client libraries for scalable access.
Demo Provision Azure Storage Account5:13
Provision an Azure storage account in the portal by selecting subscription, resource group, and a unique name. Choose region and standard general-purpose v2 with locally redundant storage for blobs.
Data Redundancy Options6:01
Explore data redundancy options to protect data against hardware failures and disasters. Compare locally, zone, and geo redundant storage, including read-access variants, and weigh cost against availability.
Azure Blob Storage6:14
Azure blob storage stores any file type as blobs within a storage account and containers, with block, page, and append blob types for streaming, logging, backup, and archiving.
Azure Storage Access Tiers4:51
Explore Azure storage access tiers: hot for frequent data, cool for infrequent data, and archive for long term backups, plus lifecycle rules that move data to save costs.
Azure Table Storage5:05
Explore NoSQL table storage as a key-value store with rows and fields, emphasizing semi-structured data, partitioning, and fast, scalable data insertion and retrieval with Cosmos DB.
Azure Queue Storage2:46
Azure queue storage helps decouple producer and consumer apps by buffering millions of messages in queues, enabling asynchronous backlog processing via APIs and the portal.
Azure File Share Storage5:35
Discover how Azure file storage provides a fully managed, cross-platform file share that scales from on-premises to the cloud, enabling centralized configurations and tools accessible via SMB or NFS.
Demo Azure File Share Storage3:50
Explore how to create an Azure file share, upload a file, and connect to the share from Windows, Linux, or Mac using the provided script, while troubleshooting firewall issues.
Azure Disk Storage and demo8:19
Explore how disk storage attaches to virtual machines, OS disks and data disks. Compare managed versus unmanaged disks and disk types like standard HDD, standard SSD, premium SSD, and ultra.
IMP Note: Cosmos DB is Not in exam0:13
Cosmos DB: Problem Statement - How Cosmos DB Evolved?2:38
Understand how Cosmos DB evolved from documentdb to solve global distribution and scalability, storing data as json documents with querying capabilities.
Cosmos DB: Features11:24
Discover Cosmos DB as a fully managed, serverless database as a service with global distribution, multi-model support, automatic indexing, and strong five-nines uptime for mission-critical apps.
Cosmos DB: Multi Model 5 APIs12:20
Master Cosmos DB's multi-model APIs, including sql, table, MongoDB, Cassandra, and Gremlin. Learn to migrate on-premises data and select the right API for graphs, documents, and key-value stores.
Cosmos DB: Provision Account10:34
Provision a Cosmos DB account in Azure by selecting subscription, resource group, and a unique name with Core SQL API, then configure networking, encryption, review, and create.
Cosmos DB: Database containers and items4:27
Explore Cosmos DB concepts by creating a database, containers, and items in the data explorer, and understand throughput, partition keys, and unique keys across multiple apis.
Cosmos DB: Throughput and request units9:18
Configure Cosmos DB throughput to maximize performance, measure throughput in request units, and monitor latency, with alerts at 80 percent consumption and scalable options above 400 RU.
Cosmos DB: Horizontal Scaling2:49
Explore how Cosmos DB scales horizontally with unlimited storage and throughput by distributing data across multiple machines behind a container, through partitioning.
Cosmos DB: What is partitioning and partition key6:10
Explore how Cosmos DB uses a partition key to divide items into logical partitions that map to physical partitions, with multiple logical partitions able to share a single physical partition.
Cosmos DB: Dedicated vs Shared throughput5:52
Learn how Cosmos DB throughput can be configured at the database level or at the container level, and how that choice creates shared versus dedicated throughput.
Cosmos DB: Avoiding hot partition5:24
Learn how container throughput is evenly divided among logical partitions in Cosmos DB, recognize hot partitions, and apply partition keys and partition-on-throughput strategies to spread data and queries.
Cosmos DB: Single partition vs Cross partition3:32
Learn the difference between single partition and cross partition queries in Cosmos DB using a social network scenario, emphasizing efficiency and when fan out occurs.
Cosmos DB: Composite Key3:14
Learn Cosmos DB composite keys and partitioning, balancing 2 MB document limits and 20 GB partition limits by using many small partitions for unlimited scale.
Cosmos DB: Partition key best practice7:48
Learn to select a high-cardinality partition key, like user id or product id, to avoid hot partitions and evenly distribute data and queries across partitions, while understanding partition size limits.
Cosmos DB: Demo - Insert and query data12:41
Create a Cosmos DB database and container, insert and update JSON items, and run queries with the data explorer. Learn about partition keys and throughput to optimize data operations.
Cosmos DB: Time to Live5:48
Configure Cosmos DB time to live to automatically delete documents after a set time, by enabling ttl in the container, specifying seconds or -1 to never expire.
Cosmos DB: Globally Distribution13:23
Explore Cosmos DB global distribution by replicating data across multiple data centers, improving read and write latency, availability, and disaster recovery with multi-region replication and automatic failover.
Cosmos DB: Multi Master8:38
Cosmos DB enables multi-region writes, allowing read and write across centers, with conflict resolution options like last writer wins, merge procedures, or conflict feeds.
Cosmos DB: Manual vs Automatics Fail-over7:36
Explore manual and automatic failover in Cosmos DB across multi-region setups, configure priorities, and ensure seamless continuity while preserving consistency during regional outages.
Cosmos DB: 5 consistent level14:16
Explore Cosmos DB consistency levels—strong, bounded staleness, session, consistent prefix, and eventual—and how they trade latency, availability, and data freshness in design.
Cosmos DB: CLI7:02
Explore how to use the cosmos db cli to create and configure a cosmos account, including resource groups, databases, and options like default consistency level, automatic failover, and multi-region settings.
What is Data Lake?3:54
The data lake is a massive repository for structured, unstructured, and raw data in native format, handling any volume. It enables immediate loading and later transformations, unlike traditional warehouses.
How Data Lake Gen 2 evolved5:17
Trace how data lake gen2 evolved from hdfs in cloud, blending blob storage's cost efficiency and tiers with dfs fault tolerance for big data analysis.
Azure Blob Storage vs Azure Data Lake4:37
Compare blob storage with data lake. Built on blob storage, data lake storage gen2 enables big data analytics with Hadoop integration.
Azure Blob & Data Lake Security options20:52
Explore Azure blob and data lake security options, including storage account keys, shared access signatures, and Azure AD RBAC and ACL, with network and IP restrictions.
High Availability vs Disaster Recovery7:16
Learn how same-region high availability uses multiple instances and a load balancer to avoid downtime, and how recovery time objective and recovery point objective govern data loss during failover.
Azure Storage - HA and DR Options11:31
Examine high availability and disaster recovery options for storage, including locally and zone redundant storage, plus manual failover to a secondary region and blob data protection features.
Cosmos DB - HA and DR Options17:22
Explore Cosmos DB high availability and disaster recovery options, including regional replication, global distribution, automatic and manual failover, and automated backup and restore with blob storage.
Further Study Material0:33
My Notes on this section3:07
Practice Tests0:03

Learning objectives2:32
Learn about relational data stores, Azure database offerings (single, elastic, managed), the Azure data warehouse, and Polybius loading, MBP architecture, storage and data distribution, partitioning, and loading methods.
Azure SQL: Why?2:58
discover how Azure SQL, a fully managed relational database as a service, leverages elastic pools, offers 99.99% uptime, zero replication, and security features.
Azure Iaas vs Pass Database Offerings9:22
Examine iaas vs paas for sql server workloads: manage sql server in a virtual machine versus a fully managed database service with automated backups, scaling, and high availability.
Limitations of PaaS and IaaS Database offerings.1:01
Azure SQL: PaaS Deployment Options3:50
Explore Azure SQL deployment options in a PaaS framework, including single database, elastic pool, and managed instance, with guaranteed resources and shared pool benefits.
Azure SQL Server Demo: Provision Single Database15:37
Provision a single database in Azure SQL Database and explore deployment options—single database, elastic pool, and managed instance—along with firewall and connectivity basics.
Azure SQL Server: Purchasing models and service tiers13:23
Explore Azure SQL Database purchasing models and service tiers, including data-based and vehicle-based options, with general purpose, standard, business critical, and hyperscale deployments.
Azure Elastic Database4:16
Use elastic pools to share a set of resources across multiple databases, reducing costs and handling variable workloads without over or under provisioning.
Azure Elastic Database Demo: Provision14:01
Provision three Azure SQL databases and group two into an elastic pool on a shared server, migrate the third, then demonstrate scaling and cleanup of resources.
Azure SQL Database: Security layers8:11
Explore Azure SQL Database security layers, including firewall-based network access, authentication and authorization, auditing and threat protection, encryption in transit and at rest, dynamic data masking, and vulnerability assessment.
Scaling Azure Database11:20
Explore vertical and horizontal scaling for Azure databases, including scale up and scale down, scale out with read-only replicas, and global scale out through sharding.
Azure SQL Database High Availability and Disaster Recovery options28:32
Explore Azure SQL Database high availability and disaster recovery options, including standard vs premium availability models, read scale out, cross-region dual replication, and failover groups with stable listener endpoints.
Azure SQL Database Backup and Restore19:07
Learn how to use full, differential, and transaction log backups to restore an Azure SQL Database to any point in time, and configure long-term retention up to 10 years.
Traditional vs Modern Warehouse architecture14:24
Compare traditional on-premises data warehouses with modern architectures that separate compute and storage, and ingest, clean, and model data for a single source of truth.
What is Synapse Analytics Service8:45
Learn how Synapse Analytics unifies data integration, data warehousing, and big data analytics into a single service for end-to-end analysis. Explore limitless scale, built-in data factory, storage, and visualization.
Demo: Create Dedicated SQL Pool11:43
Create a dedicated sequel pool by provisioning a server, configuring firewall rules, and pausing compute to save costs, with deployment options inside the workspace or as a separate service.
Demo: Connect Dedicated SQL Pool with SSMS3:40
Connect a dedicated SQL pool with SSMS, configure firewall rules for your IP via the portal, access the AdventureWorks sample database, and pause compute to save costs while storage continues.
Demo: Create Azure Synapse Analytics Studio Workspace5:57
Create a new Azure Synapse Analytics workspace, provision a data storage account and file system, set access roles, configure security, and open Synapse Studio to manage jobs.
Demo: Explore Synapse Studio11:53
Explore synapse studio's data, development, integrate, monitor, and manage tabs to connect storage, link datasets, run SQL scripts, notebooks, dataflows, and pipelines, and publish reports to Power BI.
Demo: Create Dedicated SQL Pool and Spark Pool8:12
Demonstrates how to create a dedicated sequel pool and an Apache Spark pool in a workspace, compare serverless versus dedicated pools, and discuss auto pause and cost considerations.
Demo: Analyse Data using Dedicated SQL Pool9:14
Resume a dedicated sql pool, create and populate a table with millions of rows, and analyze taxi trip data. Publish scripts, explore round-robin and hash distributions, and export results.
Demo: Analyse Data using Apache Spark Notebook11:52
Demonstrates analyzing data from multiple sources with an Apache Spark notebook, loading datasets into a data frame, and ingesting results into Spark databases for New York taxi data.
Demo: Analyse Data using Serverless SQL Pool7:52
Use the serverless sql pool for ad hoc queries across blob storage and external data sources, create a serverless database, and link external csv and parquet files.
Demo: Data Factory from Synapse Analytics Studio9:03
Demonstrates running a data factory pipeline inside Synapse Analytics Studio to copy data from SQL Server to a data warehouse, with connections, mapping, and monitoring.
Demo: Monitor Synapse Studio7:36
Explore monitoring of pipelines, triggers, and integration runtimes in Synapse Studio, and review various query types, Spark, serverless, server pool, and dataflow, with logs, inputs, outputs, and cost insights.
Azure Synapse: MPP Architecture7:10
Explore the Azure Synapse MPP architecture with a control node, compute nodes, and a data movement service enabling parallel queries across 60 distributions and scalable data warehousing units.
Azure Synapse: Storage and Sharding Patterns5:29
Explore Azure Synapse storage and sharding patterns, including replicated, round-robin, and hash distributions, and learn how distributions drive parallel queries and table performance.
Azure Synapse: Data Distribution and Distributing Keys6:05
Learn to select data distributions in Azure Synapse to avoid data skew, using hash keys, replicated tables, and round-robin distributions across 60 buckets, and optimize joins, grouping, and performance.
Azure Synapse: Data Types and Table Types5:20
Choose the smallest data types and default lengths for integers and characters to save space and move data efficiently between compute nodes; compare clustered columnstore, heap, and clustered index types.
Azure Synapse: Partitioning4:48
Explore table partitioning in Azure Synapse, splitting large tables into partitions by date to speed queries, ease data load and maintenance, and avoid performance pitfalls from excessive partitions.
Azure Synapse: Best Practices for Fact and Dimension tables2:49
Apply dimensional modeling in Azure Synapse with hash-distributed fact tables for efficient joins. For dimensions, use hash or round-robin for small tables and avoid partitioning them.
Demo: Analyze Data distribution before migration to Azure12:19
This demo analyzes an on-premises Adventure Works data warehouse before migrating to Azure, using round-robin and hash distribution to prepare large fact and dimension tables and assess data types.
Azure Synapse: Different loading methods2:38
Explore single client loading methods such as SAS data factory or BCP, and parallel loading with Polybius that bypasses the control node to feed compute nodes in a data warehouse.
Azure Synapse: Loading with SSIS vs PolyBase3:35
Compare loading data into Azure Synapse with SARS and Polybius; note control node bottlenecks versus parallel loads from blob storage, and setup external data source, file format, and external table.
Azure Synapse Demo: Loading with Polybase27:10
Export the on-premises table to a flat file, upload it to blob storage, and run the polybius six-step load to the Azure data warehouse, monitoring and validating 60 distributions.
Scaling Azure Datawarehouse3:17
Scale azure data warehouse on demand by adjusting the data warehouse unit (DWU), pause to save costs, and automate start or pause with PowerShell, Data Factory, or CLI.
Azure SQL Datawarehouse Backup and Restore10:30
Learn how to back up and restore an Azure SQL data warehouse with snapshots and restore points, seven-day retention, up to 42 points, user-defined options, final snapshots, and regional replication.
Azure Database vs Azure Datawarehouse (Synapse Data Pool)2:33
Differentiate online transaction processing databases from data warehouses: online transaction processing handles create, read, update, delete operations, while data warehouses optimize queries and reports with massively parallel processing across horizontally partitioned compute nodes.
Implement data masking21:20
Implement dynamic data masking in sql server to shield sensitive data such as social security numbers, credit cards, and emails using default, random number, email, and credit card masking.
Encrypt data at rest and in motion25:43
Learn to encrypt data at rest, in motion, and in use across Azure services using symmetric and asymmetric encryption, key vaults, and always encrypted with deterministic or randomized options.
Further Study Material0:38
My Notes on this section4:10
Practice Tests0:03

Learning objectives2:45
Explore the basics of data factory and data breaks in the data factory, including pipelines, activities, datasets, integration runtime, triggers, notebooks, and clusters.
What is Data Factory4:39
Discover how data factory, a cloud version of SSIS, enables copy and transform data across 80 connectors, from on-premises to cloud sources, with built-in data flow for transformations.
Data Factory within Azure Eco system9:41
Learn when to use Azure Data Factory versus specialized migration and streaming services, leveraging version 2, connectors, and event-based triggers for cloud data workflows, with SSIS integration.
Provision Data Factory8:02
Create a new Azure Data Factory v2, assign a unique name, choose subscription, resource group, and location, then explore the home, author, monitor, and management hubs to build pipelines.
Data Factory - Components7:11
Explore how Azure Data Factory components—integration runtime, activities, datasets, linked services, sources, sinks, and pipelines—work together to move and transform data, using copy and data flow tasks.
Data Factory - Pipeline and Activities13:21
Develop and organize Azure Data Factory pipelines with activities, folders, and templates. Configure data movement, transformation, and control activities; validate, publish, and view JSON-backed code.
Data Factory - Linked service and Datasets14:30
Connect and organize your data flows by defining linked services and datasets in Azure Data Factory, enabling copy activities to reference blob storage data with proper schema.
Data Factory - Integration Runtime19:22
Learn how Data Factory's integration runtime executes activities, data flows, and data movement by bridging linked services and datasets with serverless, managed compute.
Data Factory - Triggers19:30
Discover how Azure Data Factory triggers automate pipelines with schedule, tumbling window, and event triggers, including setup, dependencies, and backfill options.
Demo: Copy Data Activity through wizard35:02
Use Azure Data Factory to copy data from blob storage to a SQL database with the copy data activity. Create linked services and datasets, map fields, and monitor the pipeline.
Demo: Copy Data Activity using Author page9:21
Demonstrate building a copy data activity pipeline in Azure Data Factory's Autopage, sourcing from blob storage and sinking to SQL Server, with schema mapping, publishing, monitoring, and troubleshooting duplicates.
Data Factory - User Properties3:42
Explore the data factory user property across activities, view source and destination in the monitor, auto generate properties, and add custom user properties like Ishant.
Data Factory - Parameters16:42
Learn how parameters in Data Factory pipelines enable dynamic inputs, such as file names, container names, and destination details, to run the same workflow for multiple sources.
Data Factory - Data Flow Concept15:19
Explore data flow in data factory to transform data graphically with drag-and-drop, generating code behind the scenes for scalable mapping and wrangling data flows.
Data Factory - Mapping Data Flow28:04
Demonstrates mapping data flow in Azure Data Factory by joining product and product category files from blob storage, producing a final output with corrected column names and data preview.
Data Factory - Wrangling Data Flow10:58
Explore wrangling data flow in Azure Data Factory to clean and transform data with Power Query, including column removal, renaming, and value replacement, and output to blob storage.
What is Azure Databricks8:58
Explore how Azure Databricks harnesses Apache Spark on the Azure cloud to deliver a fully managed data lake, data factory integration, and machine learning workflows.
How to save Databricks demo Cost1:07
Demo overview4:35
Connect a database with data in place via a service principal, mount a data lake, process with Scala, Python, or SQL, and save results back to the data link.
Demo: Provision Databricks, Clusters and workbook11:22
Provision a database service, create a workspace, and build an interactive cluster with a notebook workbook in Azure; enable premium tier, RBAC, auto termination, and auto scaling.
Demo: Mount Data Lake to Databricks DBFS12:24
Mount the data lake to Databricks DBFS using a service principal and app registration. Configure client id, directory id, and secret, grant read, write, execute, and verify access with dbutils.
Demo: Explore, Analyze, Clean, Transform and Load Data in Databricks18:04
Explore, analyze, clean, transform, and load taxi data in Databricks using notebooks and Spark dataframes, reading from a data lake and writing results back to the datalink.
Spark Basics27:25
Explore Spark basics, an in-memory analytics engine, its evolution from MapReduce and Hadoop, and the RDD, DataFrame, and Dataset abstractions with lazy evaluation and actions.
Azure Databricks Clusters12:56
Explore interactive and automated clusters in Azure Databricks for notebook analysis and scheduled jobs. Learn standard and high concurrency modes, auto scaling, and idle termination to save cost.
Azure Databricks Other Important Components9:34
Explore Azure Databricks workspace fundamentals, including databases and tables, notebooks with multi-language support, and jobs with scheduling and cluster configurations.
Further Study Material0:35
My Notes on this section1:25
Practice Tests0:03

Learning objectives1:22
Explore the streaming analytics service for real-time data, learn to configure inputs and outputs, write processing logic, and apply tumbling, hoping, sliding in session windowing with end-to-end demos.
What is Live Event Processing9:19
Describe live data processing with event producers, processors, and consumers and real-time responses to anomalies in banking and markets, using Azure Stream Analytics and related services.
Introducing Azure Stream Analytics (configure input and output)6:18
Discover how Azure Stream Analytics delivers fully managed real-time analytics for fast-moving data, ingesting from Event Hubs, IoT Hub, and Azure Blob Storage to produce outputs.
Introduction to Windowing Functions2:22
Explore streaming analytics by grouping timestamped events into windows, computing metrics like average or count, and learning four types: tumbling windows, hoping windows, sliding windows, and session windows.
Tumbling Window2:33
Understand tumbling windows by dividing time into non-overlapping buckets, using group by with a time unit and bucket value, e.g., ten-second intervals counting events.
Hopping Window2:13
Understand hopping windows, where ten-second intervals overlap every five seconds, counting events like tweets across overlapping windows to illustrate dynamic time-based aggregations.
Sliding Window2:22
Explore sliding window analysis with a fixed ten-second window. Each new event starts a window, creating overlap and producing window results of 1, 2, 4, and 1.
Session Window6:06
Define the session window as a non-fixed, non-overlapping window that starts on an event, ends after five minutes of silence, or after ten minutes (max), using minute units.
Demo: Processing Blob Storage Input18:43
Learn to set up blob storage input and output, configure a stream analytics job with a processing query, and start processing uploaded JSON files that flow to the output.
Demo: Processing IOT Hub Input18:49
Demonstrates processing data from an IoT hub using a streaming analytics job, with a simulated device feeding sensor data and outputs saved to blob storage.
Spark Structured Streaming1:16
Further Study Material0:18
My Notes on this Section2:43
Practice Tests0:03

Learning objectives2:21
Learn to monitor NoSQL storage, blob storage, data lake, and analytics services, including Cosmos DB; explore a centralized monitor service with alerts, logs, and metrics for relational databases.
Intro to Azure Monitor Service16:22
Azure Monitor centralizes monitoring by collecting and analyzing data from metrics and logs across resources, enabling alerts, insights, log analytics, diagnostic settings, workbooks, dashboards, and custom views.
Demo: Azure Monitor Service22:59
Explore the Azure Monitor service and its core tools—metrics, logs, alerts, activity logs, diagnostic settings, and workbooks—for monitoring, diagnosing, and optimizing resources.
Implementing Blob and Data Lake Storage monitoring17:24
Learn to monitor blob and data lake storage with insights and workbooks, analyzing metrics, alerts, and diagnostic settings to troubleshoot latency, availability, and capacity.
Implement Azure Synapse Analytics monitoring12:15
Explore Azure Synapse Analytics monitoring by reviewing query activity, alerts, metrics, and diagnose settings; learn to create alert rules, configure actions, view query plans, and analyze performance dashboard.
Implement Cosmos DB monitoring12:43
Monitor Cosmos DB in Azure with metrics, alerts, and diagnostic settings to track throughput, storage, latency, availability, consistency, and SLA performance.
Further Study Material0:21
My Notes on this Section2:02
Practice Tests0:03

Learning objectives1:41
Learn to monitor Azure data services including data factory, databases, and streaming analytics using metrics, log analytics, alerts, and service-specific tools for reliable operation.
Monitor Data Factory Pipelines15:18
Monitor data factory pipelines with dashboards and alerts, tracking completion status, run duration, errors, and resource usage. Explore pipelines, triggers, and integration runtimes to understand performance and set proactive alerts.
Monitor Data Factory - Metrics, Alerts, Diagnostic Settings13:39
Learn to configure alerts and metrics for a data factory, create new alert rules and action groups, and set diagnostic settings to collect logs and metrics.
Monitor Azure Databricks6:39
Explore monitoring options for Azure Databricks data, including the Ganglia monitoring system embedded with the database, the adjure monitor workflow, and Gravagna with log analytics workspace.
Monitor Stream Analytics14:12
Monitor Azure stream analytics via portal, SDK, or Visual Studio, and configure alerts for utilization, runtime errors, watermark delay, and deserialization errors.
Further Study Material0:13
My Notes on this Section1:36
Practice Tests0:03

Learning objectives2:21
Optimize data partitioning and troubleshoot bottlenecks to boost performance. Structure data lake, optimize ingestion and stream analytics, and apply hash, Round-Robin, and replicated tables in Polybius ingestion for Sanabis analytics.
Troubleshoot Data Partitioning Bottlenecks6:56
Identify and troubleshoot data partitioning bottlenecks by applying horizontal, vertical, and functional partitioning strategies, and follow best practices for balanced workload distribution and cross-partition optimization.
Optimize Data Lake Storage5:39
Optimize data lake storage by maximizing throughput with parallel reads/writes and fast cloud links, and size data between 256 megabyte and 100 gigabyte with month/date naming in the same region.
Optimize Stream Analytics10:22
Optimize stream analytics by tuning input, output, and query processing; monitor cpu utilization and memory, and apply partitioning with partition by for scalable, parallel processing.
Optimize Azure Synapse Analytics9:54
Implement best practices for Azure Synapse Analytics by maintaining up-to-date statistics on key columns, using Polybius for large data loads, and distributing large tables by join-optimized columns.
Manage the Data Lifecycle13:54
Learn to manage data lifecycle in Azure blob storage by configuring policies that move blobs from hot to cool to archive, enabling rehydration and cost governance.
Further Study Material0:21
My Notes on this Section2:08
Practice Tests0:03

Data Types11:36
Explore the four data categories—structured, semi-structured, unstructured, and streaming—highlighting schema, flexibility, and real-time analysis with examples like XML, JSON, GPS, and IoT.
Data Storage Types18:16
Explore relational and non-relational storage types, including key-value, document, graph, column-family, object, and file storage, and learn how to match data structure and latency needs for analytics and apps.
Select Azure Store for application4:08
Choose Azure storage by matching your on-prem data to Cassandra API, MongoDB API, or SQL API, then use blob storage or data lake with time series and graph options.
Azure Data Platform Architecture5:41
Discover how the Azure data platform architecture layers load, storage, process, serve, and visualize data from sources, including streaming and relational data, using Event Hubs, Data Factory, Synapse, and PolyBase.
RTO and RPO4:11
Explore RTO and RPO for disaster recovery, and see how lower RTO and near real-time data replication to an alternate region reduce downtime and data loss.
Scenarios - Designing a solution for CosmosDB vs Data Lake vs Blob Storage10:25
Apply scenario-based design to choose Cosmos DB for global, real-time pricing; use Data Lake for business intelligence reporting from raw data; and Blob Storage for cost-effective video storage.
Scenarios - Designing for SQL Database vs Data warehouse6:10
Evaluate scenarios to decide between SQL database elastic pool and data warehouse, illustrating cost-sensitive transactional workloads versus complex queries on massive data.
Design Batch Processing Solutions using Data Factory and DataBricks17:03
Design batch processing architectures on azure using blob storage, cosmos db, data bricks, and spark, with data factory or ssis orchestration for end-to-end processing and reporting.
Data Ingestion Methods12:56
Learn data ingestion methods in adjure, compare batch and real-time approaches, and explore tools like UCLA, easy copy, datalink, Hadoop, scoop, Polybius, and data factory for efficient ingestion and orchestration.
Real Time Processing11:31
Analyze real time processing architecture across message injection, streaming, analytical data store, and reporting to deliver near real time insights with event hubs, Kafka, IoT hub, and spark streaming.
Design and Provision Compute Resources8:51
Compare dual stream analytics architectures for real-time dashboards, storage-backed reporting, and real-time alerting, using event hubs, streaming analytics, and reference data.
Lambda Architecture9:44
Master lambda architecture for real-time hot path and batch cold path processing. Use data factory, data lake, data warehouse, and machine learning studio for archive, analysis, and AI-driven insights.
Plan for Secure Endpoints (Public/Private)7:01
Enable secure connectivity with virtual network service endpoints and private endpoints to keep traffic within the network backbone and grant access to specific storage resources.
Practice Tests0:03

Requirements

Basic Database Concepts
Enthusiasm and determination to clear DP-203 Exam

Description

This course is designed to help you and your team develop the skills necessary to pass the Microsoft Azure DP-203 certification exam.

DP-203 is intended for Azure data engineers. This exam is all about implementation and configuration, so you need to know how to create, manage, use, and configure data services in the Azure portal.

Why one should take DP-203 certification?

According to the 2019 dice dot com report, there was an 88 percent year over year growth in the job postings for data engineers which was the highest growth rate among all the technological jobs.
According to a recent study by pearsonvue, after taking the certification, 65% of people say they feel more confident in their current job. And 35% of people say that their salary has increased.

Highlights of course

Course is completely up-to-date with the latest syllabus released by Microsoft for DP-203
Course covers 100% exam syllabus
Course include
- 25+ hrs of content
- 2 practice test
- Quiz - specially designed to clear concepts of objectives.
- Further study material
- PPT and Demo resources

Course includes:

Full lifetime access with all future updates
30-Day Money-Back Guarantee
Certificate of course completion

Intended Audience

Anyone who wants to prepare for DP-203 exam
Anyone who wants to become an Azure Data Engineer
Microsoft Azure Data Engineers
Microsoft Azure Data Scientist
Database and BI developers
Database Administrators
Data Analyst or similar profiles
On-Premises Database related profiles who want to learn how to implement these technologies in Azure Cloud.
Anyone who wants to become an Azure Data Engineer

Prerequisites

Basic Database concepts

Language

English
Please make sure you are comfortable in English, captions are not good enough to understand the course.

Technologies covered in DP-203 certification

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Implement non-relational data stores

Non-relational data stores (Blob Storage)
Cosmos DB

Implement relational data stores

Azure SQL Server
Azure Synapse Analytics Service

Manage data security

Data masking
Data Encryption

Develop batch processing solutions

Azure Data Factory
Azure Databricks

Develop streaming solutions

Azure Streaming Service

Monitor data storage

Monitoring for Azure Blob Storage
Monitoring for Azure Data Lake
Monitoring for Azure SQL Database
Monitoring for Azure Synapse Analytics
Monitoring for Azure Cosmos DB
Azure Monitoring Service

Monitor data processing

Monitoring for Azure Data Factory
Monitoring for Azure Databricks
Monitoring for Azure Stream Analytics

Optimize Azure data solutions

Optimize Azure Data Lake
Optimize Azure Stream Analytics
Optimize Azure Synapse Analytics
Optimize Azure SQL Database

Some students Feedback

One of the most amazing courses i have ever taken on Udemy. Please don't hesitate to take this course. The instructor is really professional and has a great experience about the subject of the course. - Khadija Badary
Very nicely explained most of the concepts. a must have course for beginners - Manoranjan Swain
I appreciate this course explaining everything in great detail for a beginner. This will assist me in overcoming challenges at my work - Benjamin Curtis
Good course for Beginners. Labs are really helpful to grasp the concept. Thank you - Sapna

Who this course is for:

Anyone who wants to clear DP-203 certification
Anyone who wants to become Azure Data Engineer
Database Developer
Database Administrators (DBA)
Business Intelligence (BI) Developers
Data Engineers
Data Scientist
Data Analyst or similar profiles
Other on-premises Database related profiles who want to learn how to implement these technologies in Azure Cloud.

DP-203: Data Engineering on Microsoft Azure - 2022

What you'll learn

Explore related topics

Course content

Course Introduction8 lectures • 19min

Introduction to Azure Cloud and Data Engineer Profile9 lectures • 40min

Data Storage: Non-Relational Data Stores42 lectures • 4hr 47min

Data Storage: Relational Data Stores43 lectures • 6hr 30min

Batch Processing [Design and develop data processing]28 lectures • 5hr 27min

Streaming Analytics [Design and develop data processing]14 lectures • 1hr 14min

Monitor Data Storage9 lectures • 1hr 27min

Monitor Data Processing8 lectures • 53min

Optimize Azure Data Solutions9 lectures • 52min

Designing an Azure Data Solution14 lectures • 2hr 8min

Requirements

Description

Who this course is for: