Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Data Engineering Fundamentals with Prefect Workflow

Name: Data Engineering Fundamentals with Prefect Workflow
Rating: 3.5 (6 reviews)

Data Engineering Fundamentals with Prefect Data pipeline using Oracle Cloud Infrastructure - VM and Autonomous DB

Created byGuha Rajan M., B.Engg, MBA

Last updated 3/2024

English

What you'll learn

What is Data Engineering and its difference with Data Analysis and Data Science
Provisioning of Virtual Machine and Oracle Cloud Autonomous Database in Oracle Cloud Infrastructure
Introduction to Data Pipeline workflow tool - Prefect.
Demonstration fo Prefect client with prefect Dash Board & its integration
Building up and executing tasks using Python prefect libraries, task dependencies, views in Perfect dashboard
Demonstation of Webhooks with Prefect.

Course content

11 sections • 33 lectures • 3h 8m total length

Course Coverage6:42

What is Virtualization?7:15
Steps involved in creation of Linux Virtual Machine on OCI9:25
Creationing Public private & public Key using Putty Gen5:34
Provisioning Compartment and Virtual cloud Network (VCN) in OCI3:20
Creating Linux 9 - Virtual Machine on OCI3:43
Connecting through putty to Virtual Machine2:26
Executing scripts in VM for Linux GUI - Part 13:30
Executing scripts in VM for Linux GUI - Part 29:51
Quiz 3

Overview : Prefect Cloud Environment5:01
Prefect Client installation on Linux 9 - VM3:12
Connecting to Prefect cloud Dashboad Data pipeline from Client VM3:15
Executing the first Flow based datapipeline program using Prefect orchestration6:04
Experiment with a simple Python hello world flow in Prefect IO, deploy it on a client machine, and monitor its minute-by-minute execution and dashboard status via Prefect UI.

What is Autonomous Cloud Database?3:58
Significance of Compartment & creation-deletion of Compartment.3:28
Learn how compartments define a boundary in Oracle Cloud to isolate resources such as databases, VMs, and VCNs, and how to create or delete compartments with the removal rule.
Provisioning the Autonomous Database on OCI6:30
Different Ways to Connect to Oracle Autonomous Database2:30
Connecting through Cloud Web SQL Developer5:12
Connect to the Oracle Autonomous Database via the web edition of SQL Developer, explore the always free tier options, and run sample queries like selecting all from the customers table.
Python Connect to Oracle Autonomous Database through python library - Part 19:10
Python Connect to Oracle Autonomous Database through python library - Part 22:49
Learn how to retrieve data from an Oracle Autonomous Database using a Python script, including connection setup, cursor usage, and executing a select query to fetch table data.
OLTP Vs OLAP3:20
Prefect datapipe with two task and building dependency between tasks9:02
Quiz 5

Understanding the difference between Web Hooks, MQTT, Web Sockets7:52
Compare webhooks, mqtt, and web sockets by communication style differences. Webhooks push events to a URL; mqtt uses a broker for publish-subscribe in IoT; web sockets enable bidirectional communication.
Hands-on Demonstration Web Hooks with Prefect Workflow and Githhub - Part 19:58
Automated Deployment of webhook for event based workflows - Prefect & Githhub11:33
Automate GitHub issue events by deploying a webhook-driven workflow with Prefect, using a programmatic deployment, a work pool, and JSON parameters to trigger automated events.
Quiz 6

Requirements

Access to Oracle Cloud Infratructure free tier
Basic Linux and Python programming skills.

Description

Data engineering is the process of designing and building systems that let people collect and analyze raw data from multiple sources and formats. These systems empower people to find practical applications of the data, which businesses can use to thrive.

Companies of all sizes have huge amounts of disparate data to comb through to answer critical business questions. Data engineering is designed to support the process, making it possible for consumers of data, such as analysts, data scientists and executives, to reliably, quickly and securely inspect all of the data available.

About a decade back, the data analysis was merely on the structured data available on the a Relational data base or in ERP system and any decision was made based on analysis of the historic data and tools like ETL (extract, Tranform & load) was used for datawarehousing system. However in this dynamic ever changing world, non relational data base information need to used for quick analysis.

So apart from transactions in database, the other source of web information from CSV, webhooks, http & MQTT need to taken care as appropriate.

Further more, the process of ETL as evolved into Data pipelines. A data pipeline is a method in which raw data is ingested from various data sources and then ported to data store, like a data lake or data warehouse, for analysis. In data pipe line task dependency can be build with different task. These task can be also based on some events happening like Order booked or Issues raise which can trigger a task. For this concepts of Webhooks are used.

Prefect is one such newly evolved data pipeline or workflow tool, in which one can build not only static task dependency, but these task dependency can be built based on some event happeningas well.

This course uses the cloud version Prefect worflow tool which can be invoked from a cloud based virtual machine. Knowledge of Python & shell scripting is essential.

This course covers following topic:

•Difference between Data Engineering Vs Data Analysis Vs Data Science

•An Overview about Data Science, Machine Learning & Data Science.

•Extract, Transform, Load vs Data pipeline.

•Provisioning Oracle Linux Virtual machine On Oracle Cloud Infrastructure.

•Prefect Cloud Data pipeline and Client VM Set up.

•Documentation reference - Prefect Workflow / Data pipelines.

•Hands-on Demonstration of Perfect Flow with Tasks dependency.

•Building Prefect dataflow pipeline for Oracle Database extract using Python.

•Introduction to Webhooks and Hands-on Demonstration with Prefect & Github.

•Career Path for Data Engineers

Happy Learning!

Who this course is for:

Computer science students
IT consultants

Data Engineering Fundamentals with Prefect Workflow

What you'll learn

Explore related topics

Course content

Introduction1 lecture • 7min

Difference between Data Engineering Vs Data Analysis Vs Data Science2 lectures • 17min

Extract, Transform, Load vs Data pipeline2 lectures • 9min

Provisioning Oracle Linux Virtual machine On Oracle Cloud Infrastructure.8 lectures • 45min

Prefect Cloud Datapipeline and Client VM4 lectures • 18min

Documentation reference - Prefect Workflow / Datapipelines1 lecture • 5min

Hands-on Demonstration of Perfect Flow with Tasks1 lecture • 5min

Building Prefect dataflow pipeline for Oracle Database extract using Python9 lectures • 46min

Introduction to Webhooks and Hands-on Demonstration with Prefect & Github3 lectures • 29min

Career Path for Data Engineers1 lecture • 6min

Requirements

Description

Who this course is for: