Apache Druid : A Hands-on Course

Name: Apache Druid : A Hands-on Course
Rating: 4.0 (195 reviews)

Complete, In-depth & HANDS-ON practical course on one of the latest database in real-time analytics

Created byShruti Mantri

Last updated 4/2021

English

What you'll learn

Learn a cutting-edge high performance real-time analytics database : Apache Druid
Understand the working of each and every component of Apache Druid
Hands-on Practicals helping you understand Apache Druid clearly
Even learn those concepts which are not properly explained in Druid's official documentation

Course content

7 sections • 21 lectures • 2h 23m total length

An Introduction to Apache Druid3:07
Explore Apache Druid, a high-performance real-time analytics database that enables fast ad-hoc analytics, instant data visibility, high concurrency, streaming and batch data ingestion with sub-second latency.
Course Objectives1:05
Install Apache Druid, explore the Druid console, load data from multiple sources, run queries to explore Druid features, and get an introduction to Druid internals and industry use.
Where to use & where not to use Druid1:38

Single Server Installation5:38
Install Apache Druid on a local machine via the single-server quickstart, meet Linux or macOS requirements and Java 8 update 92 or later, then start micro-quickstart.
Overview of the Druid Console10:58
Explore the Druid console through its tiles, data sources, segments, and services, learn how to load data, manage ingestion tasks, and run queries with real-time insights.

Load data from file using console7:24
Load data from a local disk file into a Druid data source via the console, create and submit an ingestion spec, then query with sql syntax and apply countryName filter.
Other ways to load data from file13:09
Explore multiple methods to load local disk files into druid, including the console load data, the ingestion tab json tasks, the bin post-index-task script, and curl calls.
Transformations over data7:15
Transform data during load in Apache Druid by adding transformed columns, such as uppercasing country names and computing comment lengths, using transform expressions like upper and strlen.
Applying filters over data4:54
Apply a pre-row filter at load time with a like condition on countryName to pass only Australia; the sample confirms Germany yields no records.
Parsing nested data6:00
Load a nested json file, flatten the address fields into city, state, and pincode, then set parse time, disable roll up, set day granularity, and query to confirm five rows.
Rollup Data7:44
Data Deletion11:22

Integrating Superset with Druid9:51
Learn to integrate Apache Druid with Superset, configure security and host settings, start Superset via Docker, connect to Druid, and create datasets and time series visualizations.
Industry Use-case: Druid at Netflix6:53
See how Netflix uses Apache Druid for real-time insights at scale, ingesting two million events per second and delivering subsecond to a few-second queries with roll-up and compaction.

Requirements

Basic knowledge of SQL is a good to have, but optional.

Description

Apache Druid is a high performance real-time analytics database. It comes up with some excellent features, and has proved to be really efficient in managing big amounts of data, even real time data, and serving the results in sub-second latency. It is fast, resilient, scalable, secure, easily queryable, and easy to ingest.

Apache Druid is the latest database in the Big Data technology and is rapidly gaining momentum in the market. It is playing a crucial role in the real-time analytics pipeline.

Demand of Druid in market is already swelling. Big companies like Netflix, Airbnb, Google, Walmart have already started using Apache Druid to process their Real-time Big data and thousands other are diving into.

Apache Druid takes in the best features from Search platforms, Timeseries databases and OLAP systems. So, if you have data that is organized around time, if you are doing slicing and dicing of that data for user-facing analytics, if you are doing full-text search, these are all markers of a good use-case for Druid.

What's included in the course?

Complete course on Apache Druid concepts and capabilities explained from Scratch to Production use-cases.
Each and Every Apache Druid concept is explained with a HANDS-ON.
Include even those concepts, the explanation to which is not very clear even in Druid official documentation.
Related Commands and Datasets used in lectures are attached in the course for your convenience.

Who this course is for:

Students who want to learn Apache Druid from SCRATCH to its Live Project Implementation.
Developers who want to upgrade themselves to Apache's latest Big Data real-time analytics database.
Developers who are exploring a high-performing database in the Big Data world.
Developers who are exploring database that can handle real-time data with analytical capabilites.

Apache Druid : A Hands-on Course

What you'll learn

Explore related topics

Course content

Introduction3 lectures • 6min

Installation and Overview2 lectures • 17min

Understanding Data Load7 lectures • 58min

Other Data Loads3 lectures • 22min

Querying3 lectures • 16min

Druid Internals1 lecture • 9min

Use cases2 lectures • 17min

Requirements

Description

Who this course is for: