SQL, NoSQL, Big Data and Hadoop

Name: SQL, NoSQL, Big Data and Hadoop
Rating: 4.3 (521 reviews)

A comprehensive journey through the world of database and data engineering concepts - from SQL, NoSQL to Hadoop

Created byMichael Enudi

Last updated 1/2020

English

What you'll learn

Build an intuition from RDBMS system through NoSQL to the Big Data on the Cloud and Hadoop platform
Understand various distributed database classifications
Understand when and how to use Redis or Key-Value Stores
Understand when and how to use MongoDB or Document-oriented databases
Understand and use HBase as a Wide-Columnar Store
Understand and use Time series database (InfluxDB)
Understand and use Elasticsearch as a search engine
Understand and use Neo4J as a Graph Database Management System
Understand large scale distributed data storage and processing in Hadoop
Understand when and how to use and build Streaming architecture with Apache Kafka
Use Apache Hive and Understand where to use it in respect to big data platforms
Understand a number of SQL-on-Hadoop Engines and how they work
Understand how to use data engineering capabilities to enable a data-driven organization

Course content

13 sections • 129 lectures • 22h 6m total length

Introduction7:07
Building a Data-driven Organization - Introduction3:48
Data Engineering6:09
Learning Environment & Course Material3:30
Movielens Dataset3:18

Introduction to Relational Databases8:36
SQL4:47
Movielens Relational Model14:52
Movielens Relational Model: Normalization vs Denormalization15:33
MySQL5:06
Movielens in MySQL: Database import5:42
OLTP in RDBMS: CRUD Applications17:05
Indexes15:34
Data Warehousing15:27
Analytical Processing16:36
Transaction Logs6:19
Relational Databases - Wrap Up3:04
Relational databases dominate the market with a flexible ad hoc query model using select and where clauses; learn sql topics from oltp to olap, normalization, and stored procedures.

Introduction to KV Stores2:21
Explore key value stores, the simplest, high-performance databases with a key-value model, no SQL, and no joins, ideal for distributed in-memory caching and fast lookups using Redis.
Redis3:57
Install Redis7:07
Time Complexity of Algorithm4:39
Data Structures in Redis : Key & String20:16
Data Structures in Redis II : Hash & List18:14
Data structures in Redis III : Set & Sorted Set20:34
Data structures in Redis IV : Geo & HyperLogLog10:33
Data structures in Redis V : Pubsub & Transaction7:50
Modelling Movielens in Redis11:23
Redis Example in Application28:54
KV Stores: Wrap Up2:03

Introduction to Document-Oriented Databases4:33
MongoDB3:47
MongoDB installation1:30
Movielens in MongoDB13:12
Movielens in MongoDB: Normalization vs Denormalization11:19
Movielens in MongoDB: Implementation10:00
CRUD Operations in MongoDB12:46
Indexes15:30
MongoDB Aggregation Query - MapReduce function9:19
MongoDB Aggregation Query - Aggregation Framework15:39
Demo: MySQL vs MongoDB. Modeling with Spark1:49
Document Stores: Wrap Up3:07

Introduction to Big Data With Apache Hadoop6:22
Big Data Storage in Hadoop (HDFS)15:46
Big Data Processing : YARN11:01
Installation12:40
Data Processing in Hadoop (MapReduce)14:01
Examples in MapReduce24:51
Data Processing in Hadoop (Pig)11:45
Examples in Pig21:18
Data Processing in Hadoop (Spark)8:48
Examples in Spark22:36
Data Analytics with Apache Spark8:59
Data Compression5:41
Data serialization and storage formats19:50
Hadoop: Wrap Up7:06

Requirements

No strict requirement but knowledge of relational database will be helpful.
A Windows, Linux or Mac Machine to set up a lab
Any Hadoop Vendor Sandbox like Cloudera Quickstart or HDP VM (Hadoop)

Description

A comprehensive look at the wide landscape of database systems and how to make a good choice in your next project

The first time we ask or answer any question regarding databases is when building an application. The next is either when our choice of database becomes a bottleneck or when we need to do large-scale data analytics.

This course covers almost all classes of databases or data storage platform there are and when to consider using them. It is a great journey through databases that will be great for software developers, big data engineers, data analysts as well as decision makers. It is not an in-depth look into each of the databases but promises to get you up and running with your first project for each class.

In this course, we are going to cover

Relational Database Systems, their features, use cases and limitations
Why NoSQL?
CAP Theorem
Key-Value store and their use cases
Document-oriented databases and their use cases
Wide-columnar store and their use cases
Time-series databases and their use cases
Search Engines and their use cases
Graph databases and their use cases
Distributed Logs and real time streaming systems
Hadoop and its use cases
SQL-on-Hadoop tools and their use cases
How to make informed decisions in building a good data storage platform

What is the target audience?

Chief data officers
Application developer
Data analyst
Data architects
Data engineers
Students
Anyone who wants to understand Hadoop from a database perspective.

What this course does not cover?

This course does not access any of the databases from the administrative perspective. So we don't cover administrative tasks like security, backup, recovery, migration and the likes.
Very in-depth features in the specific databases in discussion. An example is that we will not go into the different database engines for MySQL or how to write a stored procedures.

What are the requirements?
The lab for this course can be carried out in any machine (Microsoft Windows, Linux, Mac OX).
However, the training on HBase or Hadoop will require you to have a hadoop environment. The suggestion for this will be to to use a pre-installed sandbox, a cloud offering or install your own custom sandbox.

What do I need to know to get the best out of this course?
This course does not assume any knowledge of NoSQL or data engineering.
However a little knowledge of RDBMS (even Microsoft Access) is enough to get you into the best position for this course.

Who this course is for:

Chief Data Officers
IT Decision Makers
Database Architects
Software Developers
Big data Engineers
Anyone who wants to understand the where each NoSQL class of database best fits.
Anyone who is curious about NoSQL or Big Data Systems

SQL, NoSQL, Big Data and Hadoop

What you'll learn

Explore related topics

Course content

Introduction5 lectures • 24min

Relational Database Systems12 lectures • 2hr 9min

Database Classification4 lectures • 32min

Key-Value Store12 lectures • 2hr 18min

Document-Oriented Databases12 lectures • 1hr 43min

Search Engine10 lectures • 2hr 15min

Wide Column Store11 lectures • 1hr 50min

Time Series Databases8 lectures • 1hr 21min

Graph Databases12 lectures • 2hr 11min

Hadoop Platform14 lectures • 3hr 11min

Requirements

Description

Who this course is for: