
Explore Apache Superset architecture, Docker fundamentals, install and run Superset, upload data, build dashboards, and prepare production deployments on AWS with security, CSS templates, and Snowflake integration.
Docker packages an application with code, libraries, and settings into a container that runs identically anywhere. This eliminates the works on my machine problem by ensuring consistent behavior.
Bundle code, libraries, binaries, and configuration into immutable Docker images built in layers, a portable blueprint that can be shared and deployed anywhere.
Explore docker networking with two BusyBox containers to see IP-based communication. Create a user defined network to enable name based resolution and container discovery.
Follow the chapter roadmap for installing Apache Superset with Docker, explore the UI and dashboards, and learn how datasets connect to data.
Discover essential resources for Apache Superset, including official docs, preset documentation, GitHub releases, and Docker Hub tags, to install, configure, and troubleshoot with confidence.
Install Apache Superset with Docker Compose by building a custom image, configuring settings, creating an admin user, and loading sample dashboards to explore the interface.
Create a simple yet realistic Covid dashboard in Apache Superset by exploring the dataset, spinning up PostgreSQL in Docker, configuring Superset, and building charts into a dashboard.
Outline the project structure for a three-container Apache Superset setup, with a Covid dataset database, a separate metadata database, and a main app, using env files for portability.
Create a COVID dashboard in Apache Superset by dragging visualizations into a layout with headers and markdown, maps, pies, and bars with country and deaths filters on a horizontal bar.
Explore the roadmap from version 4 to 6, including ui changes, sql lab updates, toml, custom head tag injections, ant design theming, and jinja capabilities.
Learn how uv, a fast, all-in-one Python package manager written in Rust, replaces pip, and handles virtual environments, dependencies, and Python versions, while understanding TOML-based pyproject configurations.
Set up a two-container docker environment for the superset project, including metadata database and custom logos and favicon, then iterate by updating the superset config to observe UI changes.
Customize the Apache Superset logo in a docker setup by adjusting the logo target path, image, width, and tooltip, then add right-side text and restart the container to apply changes.
Modify the browser tab layout in Superset by updating the favicon and tab text, using Visual Studio Code to edit the config and restart the container.
Snowflake, a fully managed cloud data platform, separates storage and compute for elastic scalability and cost efficiency, with a three-layer architecture and easy superset integration for analytics.
Prepare a Snowflake dataset for Apache Superset by creating warehouses, setting role permissions, building a superset database and denormalized views, and loading sample data for dashboards.
Register a dataset in Superset by adding a Snowflake connection, selecting the public schema and view, and creating the dataset from the widget menu.
Create a world map chart to visualize revenue by country, fix the country field to full name, save as country level revenue, and scale Snowflake warehouse to medium.
Explore building a calendar heatmap in apache superset to visualize daily revenue intensity, configure time ranges, adjust visuals, and optimize for dashboard clarity.
Build a tree map to visualize net revenue by market segment with rectangles showing parts of a whole and hierarchies; note Superset lacks interactive drill down and labels/colors affect readability.
Configure filters in superset, including value and time range types, with region and nation pre-filters to shape a dashboard across all visuals.
Explore how to customize dashboards in Apache Superset, from coloring and CSS customization to dynamic SQL with Jinja, and set up static and dynamic row-level security with roles and permissions.
Embed a branding image in a Superset dashboard with CSS, loading an AWS S3 image, resolving content security policy issues, and positioning via an after pseudo-element.
Explore static row level security in Superset, where hardcoded policies use regular or base filters to modify where clauses, and configure rules in settings attached to roles.
Demonstrate creating a static dataset and treemap, then configure two regular row level security policies with group keys to show how or and and combinations constrain regions and market segments.
Configure static row level security with a base type in Apache Superset by creating roles, assigning users, and applying a not in region filter.
Learn how to retrieve and filter user roles in Superset 6 using the built-in current user roles macro, with a live SQL Lab example and where-in filter.
Explore alerts and reports in Superset, automated notifications triggered by data conditions and scheduled dashboard snapshots delivered via email or Slack, with Gmail examples.
Understand caching basics, including cache hit and miss, and how Redis provides fast in-memory caching and as a message broker with producers, consumers, and queues.
Explore Celery, an open source task system for asynchronous execution and scheduling with workers and Celery Beat. Use Redis as broker and result backend to queue, distribute, and track tasks.
Explore the project setup for a dockerized Apache Superset deployment, including a dedicated config, sql directory for alerts, centralized envs, and dual docker compose files for one-time init.
Use shell scripts to initialize and upgrade the superset metadata database, create an admin user and credentials, and run superset with gunicorn and event workers for scalable docker deployment.
Set up an alert in Apache Superset by supplying a numeric query and threshold, trigger when discounts exceed 50%, and receive hourly email notifications with a chart image.
Clean up the dockerized superset setup by bringing down services with docker compose down, removing containers and images, and preserving volumes to keep the metadata database intact for future use.
Welcome to "Apache Superset: From Zero to Hero with Docker and AWS"!
This hands-on course is designed to take you from your very first dashboard to a fully deployed production environment in the cloud.
Apache Superset is one of the fastest-growing open-source BI platforms used by data teams worldwide. Whether you're a Data Analyst, BI Developer, or Data Engineer, this course will guide you step by step through installation, configuration, dashboard building, customization, and enterprise-grade deployment.
By the end of the course, you’ll be able to:
Install and run Superset locally using Docker
Build interactive dashboards and visualizations
Connect to databases like PostgreSQL and Snowflake
Customize Superset to match your brand
Implement advanced features such as Jinja templating, Row Level Security, and custom styling
Set up alerts, reports, and background task processing
Deploy your Superset environment on AWS with production-ready components
This course focuses on real-world examples, clear explanations, and practical demonstrations - no unnecessary theory, just the knowledge you need to become confident with Apache Superset.
Section 1: Introduction
Get to know the course, its audience, and what tools we’ll use.
Meet your instructor and understand course goals
Learn who this course is for and what to expect
Review the setup and tools required
Section 2: Overview
Understand Superset’s architecture and how it compares with other BI tools.
Explore Superset’s core components and data flow
Compare Superset with Tableau, Power BI, and Looker
Learn about different installation options
Section 3: Docker Introduction
Build the foundation to run Superset locally using Docker.
Learn what containers, images, and volumes are
Understand Docker Compose and how services interact
Section 4: Getting Started with Superset
Dive into the Superset UI and learn its core capabilities.
Explore dashboards, datasets, and visualizations
Understand the navigation and main features
Discover how Superset organizes data sources
Section 5: Building the COVID-19 Dashboard
Create your first interactive dashboard using real data.
Connect Superset to a database
Build and configure visualizations step by step
Assemble and publish your first dashboard
Section 6: Superset 5.0 & 6.0 Updates
A newly added chapter that introduces key updates from newer versions of Superset.
Configure newer superset versions
Dive into AntD theming for in-depth customisation
Learn new Jinja capabilities and visualisations
Section 7: Customization
Make Superset reflect your brand and style.
Change logos, colors, and themes
Section 8: Snowflake Dashboard
Work with a cloud data source and advanced filters.
Connect Superset to Snowflake
Explore datasets and build a new dashboard
Learn Superset filtering in detail
Section 9: Advanced Features
Unlock Superset’s most powerful capabilities.
Use Jinja templating for dynamic queries
Implement Row Level Security (RLS)
Manage roles and permissions
Add custom CSS styling
Section 10: Alerts & Reports
Enable automated reports and notifications in Superset.
Set up Celery and Redis for async tasks
Configure email alerts and report scheduling
Understand production-level configuration needs
Section 11: AWS Deployment
Deploy Superset in the cloud like a pro using IaC approach.
Set up Superset on AWS ECS Fargate
Integrate with RDS PostgreSQL and Redis
Ensure scalability, reliability, and secure access