Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
Stitch ETL - A Simple, extensible ETL built for data teams
Rating: 4.4 out of 5(9 ratings)
238 students

Stitch ETL - A Simple, extensible ETL built for data teams

Learn Stitch ETL, migrating data between Snowflake, AWS S3 and AWS PostgreSql
Created byJim Macaulay
Last updated 1/2022
English

What you'll learn

  • Stitch ETL from scratch
  • Data Migration
  • Data Replication
  • Streaming Data Pipeline

Course content

5 sections10 lectures59m total length
  • About the Course0:42
  • Introduction3:00

Requirements

  • No programming experience required. Basic understanding of ETL/ELT and Cloud Architecture is an advantage, but not mandatory

Description

The course is about Stitch, a product owned by Talend.


What is Stitch?

Stitch is a cloud-first, open source platform for rapidly moving data. A simple, powerful ETL service, Stitch connects to various data sources and replicates that data to a destination.


• Stitch helps you replicate data into cloud data warehouses

• Stitch rapidly moves data from 130+ sources into a cloud data warehouse with no coding

• Stitch is Simple, extensible ETL built for data teams


This course starts with,

• Introduction of Stitch

• Signing up with Stitch

• Creating sources of AWS S3, AWS RDS PostgreSql

• Creating the targets of Snowflake, AWS S3 and AWS RDS PostgreSql

• Replicate the data from source to target


It enables to,

• Extract data from various sources

• Load into the leading cloud data platforms

• Analyze the data with the leading BI tools


Replication

Stitch’s replication process consists of three distinct phases:

  1. Extract: Stitch pulls data from your data sources and persists it to Stitch’s data pipeline through the Import API.

  2. Prepare: Data is lightly transformed to ensure compatibility with the destination.

  3. Load: Stitch loads the data into your destination.

A single occurrence of these three phases is called a replication job. You can keep an eye on a replication job’s progress on any integration’s Summary page.


Stitch integrated with the target systems such as,

• Amazon Redshift

• AWS S3

• Delta Lake on Databricks

• Google BigQuery

• Microsoft Azure Synapse Analytics

• Microsoft SQL Server

• Panoply

• PostgreSQL

• Snowflake


This course is for,

• ETL Developers

• Data Engineers

• Data Architects

• Data Migration Specialists

• Data Integration Specialists

Who this course is for:

  • ETL Developers
  • Data Migration specialists
  • Data Engineers
  • Data Architects
  • Database Administrators