Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Snowflake - Automate JSON Ingestion With Snowpark (Save 95%)
Rating: 4.4 out of 5(8 ratings)
111 students

Snowflake - Automate JSON Ingestion With Snowpark (Save 95%)

Automate DDL & DML SQL Generation For JSON Data Files & Save 95% Manual Effort
Last updated 8/2024
English

What you'll learn

  • How to automate JSON Data ingestion from named stages to snowflake table using Snowpark Python API
  • How to use infer-schema to automate JSON data ingestion activities.
  • How to build automation utilities using Snowpark Python API
  • How to build end to end data onboarding tool using Snowpark Python API

Course content

10 sections27 lectures1h 35m total length
  • Challenges With JSON Data Onboarding & Processing Using DDL/DML5:03
  • JSON Data To DDL/DML Automation Scope In Snowflake1:28

    Automate json ingestion into Snowflake using the Snowpark Python API. Transform json entities from a stage into tabular data and load bronze and silver layer, excluding the gold layer.

Requirements

  • Basic working knowledge with Python Programming Language
  • Working knowledge with Snowflake Cloud Data Warehouse
  • Basic working knowledge with Snowpark Python API

Description

In many situations, Snowflake gets lots of JSON data files and JSON entities, and data development teams write complex stored procedures to make DDL and DML SQL scripts to process and flatten the JSON data entities. Creating & building DDL & DML statement manually is a time consuming and error prone process. This also hampers overall development process.

This tutorial helps you create a simple and sophisticated utility using Snowflake's Snowpark Python API. This python utility uses Infer-schema table function along with Python JSON library to figure out JSON structures and helps to create all your landing/bronze/silver layer snowflake object requirements. With Python-Jinja2 templates, it doesn't just create DDL commands; it also makes copy commands, streams, tasks, and stored procedures. This makes it easy to automate moving data from your external storage to your bronze/silver layers.


This tutorial will explain the current challenges and how to solve this problem

  1. We'll look at the Infer Schema Table Function and its limits in detecting JSON structures.

  2. We'll discuss the usual patterns and repetition when making DDL & DML statements for JSON entities.

  3. You'll learn how to create a simple data ingestion solution using Snowpark and Python without writing any SQL code

  4. We'll show you how to put this design into practice with the Snowpark Python API.

  5. Finally, we'll demonstrate the whole process using multiple JSON data examples, and you'll see how it can quickly load a large number of records in under 2 minutes.

Who this course is for:

  • Snowflake Cloud Data Developer
  • Snowflake Cloud Data Engineer
  • Snowflake Cloud Data Architect
  • Python Data Developer
  • Snowflake Data Engineer