Taught by a team which includes 2 Stanford-educated, ex-Googlers and 2 ex-Flipkart Lead Analysts. This team has decades of practical experience in working with large-scale data processing.
Has your data gotten huge, unwieldy and hard to manage with a traditional database? Is your data unstructured with an expanding list of attributes? Do you want to ensure your data is always available even with server crashes? Look beyond Hadoop - the Cassandra distributed database is the solution to your problems.
Let's parse that.
What's included in this course:
Using discussion forums
Please use the discussion forums on this course to engage with other students and to help each other out. Unfortunately, much as we would like to, it is not possible for us at Loonycorn to respond to individual questions from students:-(
We're super small and self-funded with only 2 people developing technical video content. Our mission is to make high-quality courses available at super low prices.
The only way to keep our prices this low is to *NOT offer additional technical support over email or in-person*. The truth is, direct support is hugely expensive and just does not scale.
We understand that this is not ideal and that a lot of students might benefit from this additional support. Hiring resources for additional support would make our offering much more expensive, thus defeating our original purpose.
It is a hard trade-off.
Thank you for your patience and understanding!
Cassandra manages huge datasets using it's columnar layout which is more efficient and saves space.
What are our requirements of a product catalog system and why do we need a distributed, columnar, de-centralized database to manage this?
What use cases does Cassandra work with? When would you use Cassandra over other databases?
How does Cassandra stack up against HBase? HBase is the columnar store available in the Hadoop eco-system.
Install and set up Cassandra on your machine.
If you are unfamiliar with softwares that require working with a shell/command line environment, this video will be helpful for you. It explains how to update the PATH environment variable, which is needed to set up most Linux/Mac shell based softwares.
All the configuration options available on a column family.
Collections and counters allow you to store rich data in your column family
Primary keys are made up of partition and clustering keys. Partition keys determine how data is distributed across a cluster.
Primary keys are made up of partition and clustering keys. Clustering keys determine how data is laid out on a single node.
The design of partition keys determine what queries are valid in your cluster. See the restrictions on queries based on partition keys.
The design of clustering keys determine what queries are valid in your cluster. See the restrictions on queries based on clustering keys.
Allow querying on additional columns by enabling secondary indexes. There are trade-offs when using this though!
Loonycorn is us, Janani Ravi and Vitthal Srinivasan. Between us, we have studied at Stanford, been admitted to IIM Ahmedabad and have spent years working in tech, in the Bay Area, New York, Singapore and Bangalore.
Janani: 7 years at Google (New York, Singapore); Studied at Stanford; also worked at Flipkart and Microsoft
Vitthal: Also Google (Singapore) and studied at Stanford; Flipkart, Credit Suisse and INSEAD too
We think we might have hit upon a neat way of teaching complicated tech courses in a funny, practical, engaging way, which is why we are so excited to be here on Udemy!
We hope you will try our offerings, and think you'll like them :-)