Name: Data Engineering with Informatica, Snowflake and Streamlit
Rating: 4.6 (23 reviews)

Udemy Business

Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Created byMarcos Vinicius Oliveira

Last updated 3/2024

English

What you'll learn

Setup Informatica Cloud, Snowflake and AWS Accounts
Design IICS Taskflows and assets
Ingest data from S3 to Snowflake with CDI
Build Snowflake Dashboards
Build Streamlit apps inside of Snowflake

Course content

11 sections • 57 lectures • 6h 3m total length

Introduction to the course2:11
Learn to load data with Snowflake and Informatica, build dashboards to validate mappings, and develop Streamlit apps for updating data and dimensions using Python.

Creating a Snowflake account2:36
Sign up for a 30-day Snowflake trial, choose AWS and Sao Paulo, and complete account details. Use the attached data, mapping, calendar, and Informatica transformations to create tables.
Loging into Snowflake1:47
Register and activate your Snowflake account using the activation link within 72 hours, set a strong username and password, and begin creating the tables for the course.
Creating Databases and Schemas3:20
Create and organize Snowflake objects by using a sysadmin role to set up the IX course database, establish stage, PSA, and data warehouse schemas, then analyze data and create tables.
Creating course tables - 15:33
Create the product mapping table as a transient stage table, replace text keys with a numeric product code, and set up brand and line extension lookups with a primary key.
Creating course tables - 25:32
Create a touchpoint mapping table with a touchpoint key, a touchpoint code key, and media, sub media, and platform lookups, then design a date-rich calendar dimension on the psa layer.
Creating course tables - 36:20
Create the media spend fact table from date, brand, line extension, and touchpoint mappings. Enforce not null constraints and use lookups for stage and PSA tables to preserve history.
Creating course tables - 44:55
Create the final course table with start date, product code, touch point code, value, metric and modified timestamp, include media spend, and apply mapping logic with lookups.

Creating an IICS Data Integration trial account3:48
Create an Informatica IICS data integration trial account by entering your details, verify the email, and choose a cloud region aligned with your Snowflake on AWS to reduce data transfer.
Installing the local secure agent3:56
Download and install a secure agent, link it to your Informatica instance with a token, and verify services run to enable connections to your data sources.
Creating a flat file connection2:29
Create and test a flat file connection by assigning a project tag, specifying a runtime path, and adjusting date format and UTF-8, while addressing administrator access for the secure agent.
Creating a Snowflake connection5:58
Enable the Snowflake data cloud connector in Informatica, configure standard authentication, set the account identifier, select the compute warehouse and sysadmin role, test and save the connection.
Setting up a parameter file3:32
Set up a parameter file to manage database connections across dev, uat, and production in Informatica, linking a global db name to the environment and preparing for environment migrations.

Source files overview2:36
STG Product Mapping12:49
Define and execute a stg product mapping from a flat file to Snowflake, generating a CRC32 product code and a parameterized task flow for scalable data migration.
Issue on using mapping task instead of mass ingestion for loading files1:51
Prefer messenger ingestion over mapping tasks for loading files, and use archiving and error handling to skip the flow when no new files arrive.
STG Touchpoint Mapping6:51
PSA Calendar9:24
Learn to map the PSA calendar from source to target in a new Informatica workflow, adjust delimiters, handle date time fields, and troubleshoot date-lookup errors.
STG Media Spend13:24
Implement and validate the final media spend upload using Informatica to load a flat file into Snowflake, handling date and decimal formats and mapping fields.
Data Review3:18
Review and fix mapping tables by removing primary keys from product and touchpoint keys, applying not null constraints, then recreate stage and psa mappings to preserve business keys.

DWH mapping - 15:23
Explore data warehouse mapping with Informatica by building multi-step data flows, including lookups for product and touchpoint, filters, and returning key fields for the PSA media spend table in Snowflake.
DWH mapping - 25:26
DWH mapping - 34:49
Aggregate daily source data weekly using a calendar lookup and a grouped metrics approach, summarizing grp, impressions, and investment while reducing data size for faster mapping.
DWH mapping - 43:58
Apply the normalizer to unpivot data, creating a single value column and a metric name while mapping week start touch point code and product code in the unpivot matrix.
DWH mapping - 58:44
DWH mapping - 64:54
Execute final data warehouse mapping by applying pre sql, handling unpivoted week start as a string, normalizing dates, and enforcing delete-then-insert on the snowflake media spend target.
DWH mapping - 713:25
Design and validate the pre-SQL for the data warehouse layer by building joins between stage, product mapping, and calendar, and ensure accurate week start and product code matching.

Snowflake Dashboard Design - 14:37
Leverage medispan data and mapping tables to build runtime product and touchpoint masters for Snowflake dashboards. Capture brand, product key, and subbrand logic with case expressions to enable reliable aggregation.
Snowflake Dashboard Design - 25:15
Snowflake Dashboard Design - 32:38
Build and replace touch point tables in Snowflake, map media, touch point codes and keys, and design dashboards driven by source and target dimension mappings.
Snowflake Dashboard Design - 47:05
Design a Snowflake dashboard by creating worksheets and dashboards, adding tiles to visualize product and touchpoint mappings, and identify unmapped spend with a SQL query.
Snowflake Dashboard Design - 53:45
Create a touchpoint mapping tile from a sql worksheet, linking media and platform to spend touchpoints, and validate mappings to reveal missing data for charts and dashboards.
Snowflake Dashboard Design - 65:19
Snowflake Dashboard Design - 710:00
Create a Snowflake dashboard by building views that combine product and touchpoint mappings with spend data. Compare mapped and unmapped values using charts and a calendar join to surface gaps.

Product Master app design - 11:33
Discover how Streamlit lets you build data apps with Python without HTML, share quickly, and support data mappings and validation, with Snowflake integration and local or cloud runs.
Product Master app design - 25:14
Design a product master app to ingest data from the PSA table via Streamlit, update the PSA table using Informatica, and prevent manual inserts, ensuring access to the latest data.
Product Master app design - 35:35
Design a product master app using Streamlit and Snowpark, establishing an active Snowflake session to run queries, configure a wide page layout, and manage an environment-agnostic database name.
Product Master app design - 46:06
Persist data across a Streamlit app with session state, query the product master from Snowflake, and display it as a pandas DataFrame; design selectors to add brands and sub-brands.
Product Master app design - 58:37
Design and ingest a psa product table in a data warehouse using Informatica, Snowflake, and Streamlit, handling brand and subbrand levels with dynamic ui and ddl creation.
Product Master app design - 67:49
Define product master columns and a reset data frame, then implement a streamlit brand input that enforces uppercase, informs users with info messages, and checks duplicates via session SQL.
Product Master app design - 710:48
Build and test a product master workflow in Streamlit by initializing and validating a pandas data frame in session state, previewing new rows, and handling duplicates with reset logic.
Product Master app design - 811:13
Designs a product master app that ingests data with brand input, validates duplicates by product key, previews before ingest, and writes to a Snowflake table via pandas, with Streamlit feedback.
Product Master app design - 98:55
Design a dynamic product master app that adds brand and sub brand via input-driven controls, filters duplicates, ensures unique keys, and prepares data for PSA and data warehouse ingestion.
Product Master app design - 1010:09
Create an object mapping to populate the product master from PSA media product, connect to Snowflake, create a CRC32 product code, and drop duplicates with a pre-step cleanup.

Product mapping app design - 110:14
Design and implement a Streamlit app for product mapping, displaying products, brands, subbrands, and missing mappings in a PSA media layer with a compose key.
Product mapping app design - 24:34
Recreate mapping table without product code, keep the product key, replace code with a compose key; build select boxes for mapping and product keys from the data warehouse media product.
Product mapping app design - 312:45
Designs a product mapping app workflow that ingests data, uses a session state data frame, and submits mappings with brand, line extension, product key, and compose key.
Product mapping app design - 49:45
Touchpoint mapping app design - 112:09
Touchpoint mapping app design - 25:54
Refine the touchpoint mapping app by updating fields like media lookup, platform lookup, and touchpoint key, reusing components, and validating dynamic mapping inputs in Streamlit.
Touchpoint mapping app design - 35:42
Design, map, and validate touchpoint data in the touchpoint mapping app using the latest Streamlit app, syncing fields, creating touch point keys, and updating mappings for consistent dashboards.

Requirements

Basic knowledge on Informatica Cloud Data Integration (CDI)
Proficient knowledge on SQL and basic knowledge on Snowflake database
Basic knowledge on data modeling and engineering
Basic Python knowledge

Description

Unlock the power of data integration and transformation with this comprehensive course designed to elevate your skills in handling data and leveraging Snowflake tables through Informatica Intelligent Cloud Services (IICS).

Throughout this course, you'll embark on a transformative journey, starting from the basics of extracting, transforming, and loading data (ETL) sourced from CSV files into Snowflake tables using IICS Data Integration. Delve deep into understanding the intricacies of data movement and manipulation, gaining proficiency in streamlining processes for seamless integration.

As the course progresses, you'll transition to exploring Snowflake's Dashboard capability, unlocking new avenues for data validation and analysis. Dive into creating visually appealing and insightful dashboards that provide a comprehensive overview of your data landscape, empowering you to make informed decisions with confidence.

But the learning doesn't stop there. We'll take it a step further by introducing you to Streamlit, a powerful tool for building interactive data applications with Python. Discover how to harness the capabilities of Streamlit to develop custom data apps that not only enhance the ETL process but also offer dynamic functionalities that can potentially replace certain steps altogether.

By the end of this course, you'll emerge equipped with a robust skill set in data integration, validation, and application development, ready to tackle real-world data challenges head-on. Whether you're a seasoned data professional or just starting your journey in the world of data engineering, this course provides invaluable insights and practical knowledge to propel your career forward.

Who this course is for:

Data Engineers looking to get proficient on IICS using cloud platforms for data load/ingestion
Data Engineers looking to get into building Streamlit apps in Snowflake

What you'll learn

Explore related topics

Course content

Introduction1 lecture • 2min

Snowflake Setup7 lectures • 30min

IICS Setup5 lectures • 20min

SOS - Changing objects permission/ownership on Snowflake1 lecture • 4min

IICS Data Ingestion7 lectures • 50min

IICS Delete + Insert3 lectures • 28min

DWH Logic7 lectures • 47min

Snowflake Dashboard7 lectures • 39min

Streamlit - Master Product App10 lectures • 1hr 16min

Streamlit - Mapping Apps7 lectures • 1hr 1min

Requirements

Description

Who this course is for: