Mastering Apache Cassandra Developer and Admin from Scratch

Master Cassandra Developer and Admin concepts and techniques for web development. Become a Master in NOSQL database.

Created byEasylearning guru

Last updated 12/2015

English

What you'll learn

Understand the fundamentals of Cassandra
Installation and Configuration of Apache Cassandra
Understand the Architecture of Cassandra and various components for configuring it.
Learn how to implement Cassandra Query Language
Understanding the various building blocks of Cassandra
Learn how to create a database and hence inserting data.
Understand the various data modelling techniques.
Learn the key concepts of reading and writing data.
Learn how to update and delete the data.
Get to know how Cassandra interacts with the clients.
Understand various Cassandra monitoring and administration techniques.
Get to know how to maintain Apache Cassandra
Creating an application using Apache Cassandra
Learn how to add nodes to the cluster.
Understand how to monitor the nodes in the Cluster.
Get an understanding of establishing the Cluster for multiple Datacenters.

Course content

17 sections • 100 lectures • 6h 9m total length

Introduction to Cassandra3:59
Explore Apache Cassandra, a masterless distributed white column store designed for high availability across commodity servers, offering native language drivers, fault detection and recovery, and easy operations.
Prerequisites1:00
Outline the prerequisites for mastering Apache Cassandra, noting familiarity with keys, indexes, and distributed systems can be advantageous and that you will learn them during the course.
What you will learn?4:14
Use Cases for Cassandra5:40

NoSQL Databases7:16
Explore the rise of NoSQL databases, their schema-less, highly scalable design for big data and real-time apps, and the four types—key-value, wide-column, document, and graph—using Cassandra as a key example.
CAP theorem6:52
Cap theorem explains the tradeoffs among consistency, availability, and partition tolerance, detailing the cp, ap, and ca combinations and contrasting them with acid and base properties in Cassandra.
NoSQL vs RDBMS1:47
What is Cassandra?4:28
Explore Cassandra, an open source distributed, decentralized database inspired by Bigtable, designed for high availability, no single point of failure, elasticity, and tunable consistency.

Downloading Cassandra1:55
Download Apache Cassandra from the official site, select the latest version, and use community or enterprise options via Planet Cassandra and mirror links for tarball downloads.
Ensuring Oracle Java 7 is Installed1:55
Install Oracle JDK 7 or higher, verify its presence with java -version, then install Cassandra, ensuring the environment uses Oracle Java rather than OpenJDK.
Installing Cassandra7:17
Install Cassandra across platforms, review the Cassandra.yaml configuration, inspect data and log directories, and set up directories and permissions while examining the partitioner settings.
Starting Cassandra2:26
Learn to start Cassandra from the command line, locate its running process ID in a new terminal, verify the process is active, and safely stop it with Control-C.

Cassandra – A Distributed Database2:10
Cassandra distributes data across multiple nodes to enable a distributed database with horizontal scalability. Its peer-to-peer architecture avoids a single point of failure and ensures data availability across data centers.
System Keyspaces1:49
Explore the system keyspace in Cassandra, which stores cluster operation data such as space usage and system settings, plus notes on node bootstrap, migrations, and dynamic loading.
Peer to Peer Model3:44
What Gossip Protocol is for?3:42
Explore how the gossip protocol powers Cassandra's peer-to-peer communication, enabling failure detection and replication through epidemic-style rounds where nodes exchange digests, acks, and state information.
Anti-entropy and Read Repair4:18
Explore anti-entropy and read repair in Cassandra, using the gossip protocol to update replicas to the newest version and a hash tree to summarize blocks for background repairs.
Memtables, SSTables and Commit Logs2:28
Discover how Cassandra uses memtables and commit logs to capture writes in memory, then flushes to SSTables on disk, where immutable files are compacted to support crash recovery and reads.
Compaction, Bloom Filters and Tombstones4:44
Explore how compaction frees space by merging sstables, how tombstones mark deletions until compaction, and how bloom filters reduce disk reads to boost Cassandra performance.

Components for Configuring Cassandra1:11
Explore how to configure Cassandra from default to customized setups, focusing on keyspace, replicas, replica placement strategy, replication factors, virtual nodes, petitioners, and niches.
Keyspaces1:14
Explore keyspaces, the basic unit in Cassandra, created using the create keyspace command, and how a keyspace defines a database-like hierarchy with tables, column names, and values.
Replicas2:05
Explore how Cassandra handles replicas, replication factor, and replica placement within the ring, including tokens, partitioners, and replication strategies.
Replica Placement Strategies5:00
Explore replica placement strategies in Cassandra, including SimpleStrategy, NetworkTopologyStrategy, and rack placement, to optimize data distribution across data centers and racks.
Replication Factor3:52
Learn how replication factor determines how many copies of each data are stored and distributed across Cassandra clusters, and how consistency levels relate to replication factor.
Virtual Nodes2:10
Explore virtual nodes in Cassandra, where each node handles many token ranges or slices, defaulting to 256, to ease adding nodes and keep the cluster balanced.
Partitioners2:51
Mastering Apache Cassandra: partitioners determine how keys are sorted and distributed across nodes, shaping range queries and performance; Cassandra offers random, order-preserving, and byte-order partitioners.
Snitches4:46
This lecture explains how Cassandra snitches determine node proximity and cluster topology, guiding routing and replication across data centers and racks, and reviews simple, dynamic, and property file snitches.

Describing Cassandra1:58
Describe how Cassandra stores its data: a cluster contains keyspaces, each keyspace holds column families, and each column family contains rows with multiple column names and their values.
Cluster1:14
Explore how a Cassandra cluster distributes data across multiple nodes, delivering a single logical view. Learn about nodes, replicas, and the replication factor that keeps data available.
Keyspaces and Column Family4:28
Explains keyspaces as the outer container for data and column families as ordered collections of rows and columns. Shows columns as basic units with a name, value, and clock.

What is a Cassandra Database?2:43
Discover what Cassandra is by examining keyspaces, including system and system traces keyspaces. Learn to list keyspaces and describe a keyspace to view its tables and definitions in the cluster.
Query differences between RDBMS and Cassandra2:41
Defining a KeySpace5:25
Create keyspaces with the create keyspace command, choose a replication strategy such as network topology strategy or simple strategy, and set replication factor per data center; verify with describe keyspace.
Data Types4:40
Mastering Apache Cassandra from scratch reveals a wide range of data types, including string, 64-bit long, blob, boolean, counters, lists, maps, sets, and inet, timestamp, uuid, and text.
Creating a Table4:04
Create a Cassandra table by defining columns like id, date_time, and text, with a primary key and clustering by date. Use the keyspace and drop tables as needed.
A Primary Key2:31
A Partition Key2:55
Specifying the Clustering Order5:25
Specify the clustering order for a Cassandra table by selecting the column and choosing ascending or descending. Understand defaults and when you must redefine the table to change the order.
Deleting a KeySpace2:09
Learn to delete a keyspace in Cassandra using the drop keyspace command and verify deletion with describe keyspace to confirm removal.

Different ways of Writing the Data1:43
Using the INSERT INTO Command6:18
Mastering Apache Cassandra: use insert into to add a single row with specified id, date time, and event, and use select to retrieve specific columns from the activity table.
Using the COPY Command6:03
Unpack the Cassandra copy command, showing how to copy data between tables by listing columns and matching order, then run a select to verify results.
Storing data in Cassandra4:47
Discover how Cassandra stores data, highlighting the partition key as the internal factor that uniquely identifies rows alongside the primary key, and inspect table contents with examples.

Design differences between RDBMS and Cassandra4:29
Discover how Cassandra differs from RDBMS: no standard query language but an API, range-based sorting instead of order by, secondary indexes, normalization, and no referential integrity or joins.
Design Patterns3:48
Explore cassandra design patterns such as materialized views with a second column family for denormalization techniques, valueless columns, and aggregate keys for efficient lookups.
Using the WHERE Clause3:56
Learn how the where clause in Cassandra retrieves data by specifying the partition key or primary key, and why queries on non-key columns fail without secondary indexes, with practical examples.
Using Secondary Indexes6:30
learn how Cassandra uses secondary indexes for non primary key lookups, with a per node hidden index, and that they don’t speed queries; consider a query-specific table.
A Composite Partition Key8:52
Explore how a composite partition key uses multiple columns to prevent endless partition growth in cassandra, with Wakil ID, date, and time.

Requirements

Prior knowledge of any database will be helpful.
Understanding of SQL syntax can also prove to be advantageous.

Description

In this course you will learn about Apache's NoSQL Database-Cassandra and how is it used to store the Big-Data.

It starts with an introduction to the database along with its prominent use cases, an insight view of its architecture and various components involved in configuring the database.
Also you learn the various operations that can be performed on the database such as creating the database, inserting data, deleting and updating data.
You will learn how to monitor your database and the concepts like adding nodes to the cluster and managing these nodes will also be explained.
This course contains lectures as videos along with the hands-on implementation of the concepts, additional assignments are also provided in the last section for your self practice, working files are provided along with the first lecture "Introduction to Cassandra" and some links for further reading are also provided for more help.

Who this course is for:

This Apache Cassandra course is meant for the learners who wish to understand the concepts of this NoSQL Database starting from the basics as well complex concepts such as adding nodes to the Cluster.

Mastering Apache Cassandra Developer and Admin from Scratch

What you'll learn

Explore related topics

Course content

Introduction to the Course4 lectures • 15min

Getting started with Apache Cassandra4 lectures • 20min

Installing Cassandra4 lectures • 14min

Architecture of Cassandra7 lectures • 23min

Configuring Cassandra8 lectures • 23min

Cassandra Query Language2 lectures • 8min

Building Blocks of Cassandra3 lectures • 8min

Creating a Database9 lectures • 33min

Inserting the Data4 lectures • 19min

Data Modeling in Cassandra5 lectures • 28min

Requirements

Description

Who this course is for: