
Explore the fundamentals of data warehousing, including why businesses need a data warehouse, oltp vs olap, etl, dimensional modeling with facts and dimensions, star and snowflake schemas, and architecture.
Identify what a data warehouse is, its role in consolidating data from multiple sources through ETL, and its use for reports, dashboards, and decision making.
Discover how a data warehouse consolidates and summarizes data from multiple systems to enable consistent reporting across departments. Use sliced, fact-based insights to support executive decisions.
Demonstrate how dashboards and data warehouses consolidate time-based sales, costs, margins, and customer data to reveal what’s happening in a business.
Identify the ideal data warehouse as easily accessible, fast, consistent in language across teams, flexible to change, secure, and foundational for decision making, with data populated overnight or within day.
Compare online transactional processing (OLTP) and online analytical processing (OLAP), showing OLTP handles current, high-volume transactions in transactional databases, while OLAP powers historical data analysis in data warehouses.
Learn how extract, transform and load consolidate data from diverse sources—databases, legacy systems, cloud, on premises servers, and mobile devices—into a centralized data warehouse, ensuring consistency and accuracy.
Dimensional modeling optimizes data warehouse structures by organizing data into facts, dimensions, and attributes in a star schema, enabling fast, understandable analysis of products, locations, time, sales amounts, and profits.
Fact tables store additive measurements such as sales, profit, and quantity, with one row per measurement event identified by a composite key and linked to dimensions via foreign keys.
Provide the context for business processes with dimension tables linked to a fact table, featuring product key details and primary/foreign key relationships.
The star schema centers a sales fact with dimensions such as store, employee, customer, product, and date. The snowflake schema normalizes these dimensions into additional tables, depending on business needs.
Install and integrate data warehousing tools using SQL Server, SSMS, Visual Studio, and SSIS to extract, transform, and load data for dashboards and business intelligence.
Install the SQL Server 2019 developer edition and set up SQL Server Management Studio (SSMS), including downloading, accepting prompts, and pinning SSMS to the start menu.
Learn to download and restore the AdventureWorks data warehouse backups in SQL Server Management Studio, including copying the .bak to the C drive and restoring to your instance.
Download and install Visual Studio community 2019, selecting the data storage and processing workload to work with SQL Server. Learn about the free license, sign-in options, and restart prompts.
Install the SQL Server Integration Services extension for Visual Studio from the online marketplace, download the SSIS project package, restart when prompted, and verify by creating a new SSIS project.
Navigate Visual Studio 2019 by exploring solution explorer, server explorer, and the SARS toolbox, adjust layouts, and open or create projects to work with data flow tasks.
Create etl package that extracts from a flat file, uses lookups for currency and date keys, and loads into a fact currency rate plus table in Adventure Works DW 2012.
Start by creating a new integration services project in Visual Studio, naming it, and choosing its storage location, with a solution containing one project and one package.
Add and configure a flat file connection manager, name it descriptively, browse the source file, define columns and data types, and prepare keys for currency and date.
Add and configure an OLE DB connection manager to connect to local host DW 2012 database using Windows authentication, test the connection, and verify access to the Adventure Works database.
Drag a data flow task from the SSIS toolbox onto the control flow pane, rename it to extract, and double-click to enter the data flow tab.
In the data flow tab, add the flat file source using the flat file connection manager, name it extract sample currency data, and verify columns before clicking OK.
Configure lookup transformations by dragging a look up object, naming it currency key, and connecting it to the prior task to pull currency keys via a full cache. Return the new currency and date keys by adding a second look up for the date key and enabling partial cache for the dim date.
Add and configure the OLE DB destination to load data into the database, map input columns, and create the fact currency rate table using the currency key for dimension lookups.
Format and center all objects, then annotate data flow that extracts data from a file and writes to fact currency rate plus table using currency key and date key columns.
Test the lesson 1 package by debugging, confirming successful package execution, data load, and date and lookup matches, and verify the new fact currency rate plus table with five columns.
This course is an overview of basic data warehousing concepts, a guide for installing software and a step-by-step tutorial on using the software. This course is divided into three main sections:
Data Warehousing Basics
We begin with a conversation about data warehousing concepts, including the following:
What is a data warehouse?
Why do we need a data warehouse?
What is Extract, Transform and Load (ETL)?
The difference between OLAP and OLTP databases
The Star and Snowflake schemas
Fact and Dimension tables
These concepts can be difficult to understand, so I explain them using real conversation and examples. We have this conversation first so that you are familiar with the ideas as they are discussed in the later sections of the course.
Data Warehousing Software Installation
If you want to become good at data warehousing, you need to use the software. In this section I start by talking with you about the software and explain how the different pieces work together. Next is a step-by-step walkthrough of installing SQL Server Developer, SQL Server Management Studio (SSMS) and Visual Studio Community with the SQL Server Integration Services (SSIS) package. The versions of software we download are all free for you to use.
SSIS Tutorial: Create a Project and Basic Package with SSIS
The last section of the course is a step-by-step tutorial on using the ETL tool SSIS. The tutorial is broken down into nine steps:
Step 1: Create a new integration services project
Step 2: Add and configure a flat file connection manager
Step 3: Add and configure an OLE DB connection manager
Step 4: Add a data flow task to the package
Step 5: Add and configure the flat file source
Step 6: Add and configure the lookup transformations
Step 7: Add and configure the OLE DB destination
Step 8: Make the Lesson 1 package easier to understand
Step 9: Test the Lesson 1 package
Before we being the tutorial, I discuss the steps in detail. I explain the source data, how the data will be transformed and how we will get the data to it's destination database.