Learn Big Data Testing (Hadoop, Hive, Cassandra, HBase etc.)

Name: Learn Big Data Testing (Hadoop, Hive, Cassandra, HBase etc.)
Rating: 3.5 (165 reviews)

Learn Big Data Testing (Hadoop, Hive, Cassandra, HBase, Unix, Shell, Pig etc.)

Created byBig Data Engineer

Last updated 1/2026

English

What you'll learn

At the end of this course, students would understand the different Big Data Testing Technologies which are used in Big Data and can start working in testing
At the end of this tutorial, students would be able to start their career in Big Data Testing at different levels.
The course is designed to introduce the core concepts, tools, and methodologies associated with Big Data Testing.
This knowledge will allow participants to pursue Big Data Testing roles at various professional levels.

Course content

7 sections • 41 lectures • 6h 21m total length

Introduction1:57
Explore big data testing with Hadoop, Hive, Cassandra, and HBase, and learn how to start, stop, and harness the file system to manage data workflows.
What is Big Data and why it is important?4:45
Discover what big data is and why it matters, and explore how testing big data systems with Hadoop, Hive, Cassandra, and HBase supports reliable analysis.
What is Big Data and Why We Need Big Data Technologies?3:16
Explore what big data is and why we need big data technologies, focusing on storage and analysis and the challenges with databases.

Cassandra Overview and Cassandra Background2:42
Explore Cassandra overview and Cassandra background within the big data testing framework, covering the role of Cassandra alongside Hadoop, Hive, and HBase.
Cassandra Architecture and SPOF11:19
Explore Cassandra architecture and its fault tolerance, highlighting no single point of failure, multi-node backups, master-slave roles, and load distribution.
Cassandra Query15:45
How to Load Data from TXT or CSV files to Cassandra Table?9:09
Load data from text or csv files into a Cassandra table by creating the table with columns and a primary key, then copy data from the file.
Cassandra Collections - SET Type6:58
Explore the Cassandra set type by creating and manipulating sets of values, such as names and phone numbers, and inserting, updating, and querying them within a table.
Cassandra Collections - MAP Type6:32
Explore Cassandra map type collections, modeling a table with id, name, and a map column, and learn inserting, updating, and querying map values such as home and office.
Cassandra Collections - LIST Type6:03
Cassandra collections overview: use the list type to store multiple values in one column, with emails as examples, and insert list values alongside a primary key.
Drop and Truncate in Cassandra6:35
Learn how to drop and truncate in Cassandra by walking through dropping tables, inserting records, and querying data to understand schema and data state.

HBase Queries5:24
Explore starting the hbase service, using the hbase shell, and verifying status to run queries and view values with timestamps.
Table Creation in HBase12:06
Learn how to create an HBase table, define column families, and insert employee data for big data testing.
How to work with Enable, Disable and Describe in HBase Table?6:51
Learn how to enable and disable an HBase table, and how to use describe to view table metadata, with practical steps from creating a table to handling common errors.
HBase Filter6:10
Investigate how HBase filters control which data to display from a table, using column families, prefixes, and value-based criteria to scan and reveal employee records.
ACID and CAP Theorem in HBase6:59
Explore how acid properties apply to HBase and examine the cap theorem in distributed systems. Understand how isolation, consistency, and availability shape data guarantees in practice.

Different Hadoop Running Modes4:44
Understand Hadoop's running modes, starting with standalone mode that uses the local file system for input and output, and note that this mode is now obsolete.
HDFS Comamnds -3 (Cloudera)12:06
Demo shows using hdfs commands in a Cloudera environment to list directories and files, view data, and create directories and files within the Hadoop cluster.
HDFS Comamnds -4 (Cloudera)7:20
Practice hands-on with HDFS commands on Cloudera to read files, inspect directories, and verify file sizes and availability, gaining practical insight into HDFS data management.
HDFS Comamnds -5 (Cloudera)7:13
Learn to copy files from your local system to a Hadoop cluster using HDFS commands, verify transfer with directory listings, and manage files and directories across local and cluster storage.
HDFS Comamnds -6 (Cloudera)7:16
Master HDFS commands to copy files between local and HDFS using put and get, verify with ls, and manage data across a Cloudera cluster.
HDFS Commands -7(Ubuntu)6:35
Learn to run HDFS commands on Ubuntu and start the Hadoop cluster by launching five daemons, including the NameNode, DataNodes, ResourceManager, and NodeManager.
HDFS Commands - 8(Ubuntu)12:40
What all are differrent V's in Big Data?4:31
Different Big Data Supported Data Types5:50
Explore different big data supported data types and how relational databases interact with these formats in Hadoop ecosystems like Hive, Cassandra, and HBase.
Diffrent between Hadoop 1.x and Hadoop 2.x6:39

What is Hive and Different Hive Features3:59
Explore Hive and its features, noting that it is designed for batch processing and not designed for online processing, in the context of big data testing.
How to Install Hive on Ubuntu Machine?6:29
Discover how to install Hive on an Ubuntu machine by downloading the Hive package, extracting it with tar, creating an installation folder, using sudo, and starting Hive after setup.
Hive Queries13:56
Learn to create databases and tables in Hive, insert data, and run Hive queries to explore schemas and content, essential for big data testing with the Hadoop ecosystem.
Differerent Hive Tables - Managed Table13:45
Learn how to create a managed Hive table and load data from the local file system, mapping columns such as name, country, and company, then query the table.
Different Hive Tables - External Table9:03
Create an external table in Hive, specify its location, and load data from external sources. Verify data at the specified location through the external table.
Hive Static Partition17:09
Explore Hive static and dynamic partitions to organize data by column values. Create static partitions in advance for a table like employee; use dynamic partitions when values are unknown.
Hive Dynamic Partition22:24
Hive Bucketing12:07
Explore how Hive bucketing works by creating buckets, configuring enforced bucketing, and analyzing regions and bucket counts to manage dynamic data efficiently.
Hive Index Implementation16:46
Explore Hive index implementation by creating a named index on a table and column, and learn how it affects query performance.
Hive View Implementation11:05
How to Compare Two Hive Tables?17:44
Discover techniques to compare two Hive tables by extracting distinct values, handling nulls, and applying unions and joins to identify differences and harmonize data.

Requirements

As such there is no prerequisite required for this course and it is useful for any level of tester. Any Tester profile candidate can start this tutorial.

Description

This course is for Testing profile candidate who wanted to build there career into Big Data Testing. So I have designed this course so they can start learning big data technologies. All the users who are working or looking for Job in QA profile or wanted to move into big data testing domain should take this course and go through the complete tutorials.

I have included the material which is needed for big data testing profile and it has all the necessary contents which includes practical examples as well depends on questions and there practicality.

It will give the detailed information for different topics like big data hadoop, hive, Hbase, Cassandra, Unix, Shell, Pig along with Agile which is needed by the tester to move into bigger umbrella i.e. Big Data Testing.

This course is well structured with all elements of different technologies in practical manner separated by different topics. Students should take this course who wanted to move into big data testing to advance their career.

Who this course is for:

Any Level of Testing Profile Candidate is the target audience for this course.

Learn Big Data Testing (Hadoop, Hive, Cassandra, HBase etc.)

What you'll learn

Explore related topics

Course content

Big Data Testing - Introduction3 lectures • 10min

Cloudera Environment Setup Process1 lecture • 20min

Complete Section for Cassandra8 lectures • 1hr 5min

Complete Section for HBase5 lectures • 38min

Complete Section for Hadoop10 lectures • 1hr 15min

Complete Section for Hive11 lectures • 2hr 24min

PIG Section3 lectures • 30min

Requirements

Description

Who this course is for: