Udemy
    •  
    •  
    •  
    •  
    •  
    •  
    •  
    •  
Turn what you know into an opportunity and reach millions around the world.
Learn More
Your cart is empty.
Keep shopping
From 0 to 1: The Cassandra Distributed Database
Rating: 4.1 out of 5(2,613 ratings)
16,175 students

From 0 to 1: The Cassandra Distributed Database

A complete guide to getting started with cluster management and queries on Cassandra
Created byLoony Corn
Last updated 10/2016
English

What you'll learn

  • Set up a cluster, keyspaces, column families and manage them
  • Run queries using the CQL command shell
  • Design primary keys and secondary indexes with partitioning and clustering considerations
  • Use the Cassandra Java driver to connect and run queries on the cluster

Course content

10 sections46 lectures5h 54m total length
  • You, This Course and Us1:45

    Discover how Cassandra delivers scalable, high-performance data management as a distributed NoSQL database, using a columnar data model with keyspaces and column families, and a Cassandra-specific SQL interface.

Requirements

  • The basics of SQL and traditional relational databases
  • The basics of Java in order to use the Cassandra Java library

Description

Taught by a team which includes 2 Stanford-educated, ex-Googlers  and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing. 

Has your data gotten huge, unwieldy and hard to manage with a traditional database? Is your data unstructured with an expanding list of attributes? Do you want to ensure your data is always available even with server crashes? Look beyond Hadoop - the Cassandra distributed database is the solution to your problems.

Let's parse that.

  • Huge, unwieldy data: This course helps your set up a cluster with multiple nodes to distribute data across machines
  • Unstructured: Cassandra is a columnar store. There are no empty cells or space wasted when you store data with variable and expanding attributes
  • Always available: Cassandra uses partitioning and replication to ensure that your data is available even when nodes in a cluster go down


What's included in this course:

  •  The Cassandra Cluster Manager (CCM) to set up and manage your cluster
  •  The Cassandra Query Language (CQL) to create keyspaces, column families, perform CRUD operations on column families and other administrative tasks
  • Designing primary keys and secondary indexes, partitioning and clustering keys
  • Restrictions on queries based on primary and secondary key design
  • Tunable consistency using quorum and local quorum. Read and write consistency in a node
  • Architecture and Storage components: Commit Log, MemTable, SSTables, Bloom Filters, Index File, Summary File and Data File
  • A real world project: A Miniature Catalog Management System using the Cassandra Java driver

Who this course is for:

  • Yup! Engineers and analysts who understand traditional, relational databases and want to move to big data storage systems
  • Nope! Students who are just starting out understanding databases and have no prior experience with one