Data Warehouse, Data Lake & Data Lakehouse - Fundamentals

Name: Data Warehouse, Data Lake & Data Lakehouse - Fundamentals
Rating: 4.0 (6 reviews)

Master Modern Data Platforms: From Warehouses to Lakehouses for Scalable Analytics

Created byCoding Gears | Train Your Brain

Last updated 11/2025

English

What you'll learn

Understand Modern Data Architectures
Master Data Integration Techniques
Explore Key Technologies and Tools
Understand Lakehouse Architecture

Course content

6 sections • 22 lectures • 1h 31m total length

Course Overview1:24
Explore how databases form the foundation of data systems and how oltp, olap, etl, elt, data warehouses, data marts, data lakes, and the data lakehouse enable analytics and machine learning.

Databases6:14
Explore databases as the backbone of digital apps, covering structured data, SQL querying, concurrency, security, and reliability, and contrast OLTP with OLAP workloads.
OLTP4:24
Explore OLTP, online transaction processing, designed for large numbers of short transactions with real-time updates, high concurrency, and maintained data integrity.
OLAP4:03
Explore olap, or online analytical processing, which enables multi-dimensional analysis of large historical data through data cubes with time, geography, and product dimensions, supporting reporting, decision making, and batch updates.
OLTP vs OLAP2:41
OLTP handles fast transactions on current data with normalized schemas for speed and reliability. OLAP analyzes large historical data with denormalized schemas for complex queries and insights.
Database - Advantages3:43
Explore how databases organize structured data, scale for growing workloads, ensure high availability and security, automate tasks, and enable integration with APIs and data pipelines for data-driven decisions.
Database - Challenges4:04
Explore database challenges in large-scale distributed environments, including design complexity, performance trade-offs between normalized and denormalized schemas, and data partitioning across services. Tackle costs, data duplication, governance, and observability.

ETL & ELT Concepts6:36
Learn ETL and ELT data integration patterns, moving data from sources to a data warehouse, with pre-load data cleaning in ETL and transformations in ELT for cloud platforms and analytics.
Data Warehouse7:35
Explore how a data warehouse supports OLAP analytics by collecting and transforming data from OLTP systems through an ODS, ETL processes, and data marts.
Data Warehouse - Advantages2:55
Consolidate data from CRM, ERP, and transactional systems into a single source of truth. Enable trend analysis, forecasting, OLAP, and machine learning with clean, integrated data and scalable performance.
Data Warehouse - Challenges3:14
Explore the challenges of data warehouses, including high initial cost and maintenance. Evaluate ETL complexity, scalability with semi-structured data, batch latency, data governance and quality, integration, and vendor lock risks.
Data Warehouse - Techs3:04
Explore the technologies powering data warehouses and compare leading platforms like Amazon Redshift, Google BigQuery, Snowflake, and Azure Synapse Analytics to guide your data strategy.

Data Lake Architecture8:15
Understand data lakes as scalable storage for structured, semi-structured, and unstructured data, guided by medallion architecture—bronze, silver, gold—with etl, spark, and access for bi, data science, and machine learning.
Data Lake - Advantages2:28
Explore the advantages of data lakes: scalable, cost-efficient storage for structured, semi-structured, and unstructured data with schema-on-read flexibility and support for real-time analytics and machine learning.
Data Lake - Challenges2:35
Explore the challenges of data lakes, including data swamp risk without governance, complex integration with legacy systems, and limited acid support affecting reliability, auditability, and performance in large-scale queries.
Data Lake - Techs2:59
Explore data lake technologies across Amazon S3 with Lake Formation governance, ADLS, Google Cloud Storage and Big Lake, and Hadoop ecosystems with Hive and Spark, plus Iceberg and Hudi.

Data Lakehouse Architecture5:22
Explore how data lake house merges data lakes and warehouses into unified storage enabling performance analytics with governance and acid-compliant tables, using open formats like Delta Lake, Iceberg, and Hudi.
Data Lakehouse - Advantages7:16
Explore how data lakehouse unifies lakes and warehouses to deliver cost-effective, scalable analytics, with real-time and batch processing, governance, security, interoperability, and support for Parquet, Avro, JSON.
Data Lakehouse - Challenges4:17
Examine data lakehouse challenges, benefits, and the integration of Spark, Delta Lake, Iceberg, and Trino. Understand governance, metadata, performance, costs, and organizational readiness shaping implementation.
Data Lakehouse - Techs3:04
Explore how data lakehouses use storage formats and table formats with transaction layers and compute engines to enable time travel, metadata governance, and scalable analytics.
Warehouse vs Data Lake vs Data Lakehouse2:45
Compare data lakes, data warehouses, and data lakehouses, outlining schema on read versus schema on write, all data types support, real-time analytics, governance, and BI and machine learning use cases.

Requirements

No Experience is required!

Description

This course provides a comprehensive exploration of modern data platform architectures, focusing on Data Warehouses, Data Lakes, and Data Lakehouses. Learners will begin by understanding the fundamentals of databases, including OLTP and OLAP systems, and their roles in operational and analytical workloads. The curriculum covers the evolution from traditional data warehouses—centralized, structured repositories for business intelligence—to scalable, flexible data lakes that store structured, semi-structured, and unstructured data for advanced analytics and machine learning.

Participants will master the differences between ETL (Extract, Transform, Load) and ELT (Extract, Load, Transform) processes, and learn how to select the right approach for cloud-native platforms. The course highlights the advantages and challenges of each architecture, such as governance, scalability, cost efficiency, and integration complexity.

A major focus is on the emerging Data Lakehouse paradigm, which unifies the best features of lakes and warehouses. Learners will explore key technologies like Delta Lake, Apache Iceberg, and Hudi, and understand how lakehouses enable ACID transactions, schema enforcement, and interoperability with BI and AI tools.

By the end of the course, students will be equipped to design, implement, and optimize data platforms that support real-time analytics, advanced data science, and robust governance for enterprise environments.

This is an ideal course for:

Data Engineers
Data Managers
Data Architects

Who this course is for:

Data Engineers
Data Managers
Data Architects

Data Warehouse, Data Lake & Data Lakehouse - Fundamentals

What you'll learn

Explore related topics

Course content

Introduction1 lecture • 1min

Databases6 lectures • 25min

Data Warehouse, ETL & ELT5 lectures • 23min

Data Lake4 lectures • 16min

Data Lakehouse5 lectures • 23min

Conclusion1 lecture • 3min

Requirements

Description

Who this course is for: