0102 Understanding What Cassandra Is

Infinite Skills
A free video tutorial from Infinite Skills
High Quality Training
4.3 instructor rating • 327 courses • 1,238,123 students

Learn more from the full course

Getting Started With Apache Cassandra

Learn everything you need to know to use Apache Cassandra

08:08:18 of on-demand video • Updated January 2017

  • Gain a solid understanding of working with Apache Cassandra
English [Auto] So what is Cassandra Cassandra is open source it's an Apache project. Cassandra is in no sequel database technology. You may or may not have heard of no sequel it's a relatively new expression for most people it refers to not only sequel meaning an alternative to the traditional relational database technologies such as Oracle. My sequel sequel server Apache Cassandra is a distributed database technology as well. Instead of the database only being able to live on one server it can actually be spread across multiple servers. This is huge. This allows a database to grow almost infinitely because it's no longer restricted to having to fit all on one server. Cassandra is a big data technology which provides massive scalability. I'm aware of some instances of Cassandra where the cluster is a thousand nodes a thousand servers that make up a cluster that the database is spread across. That's a lot of data and Apache Kassandra is commonly used to create a database that is spread across nodes in more than one data center for high availability. Not only can your database live across multiple servers but those servers can be spread across more than one data center. This is a reason that people often choose Cassandra over other database technologies because they want to make sure that if one of their data centers goes down that another way it can continue to serve requies so that their application is always available. And finally if some of this seems familiar to you it could be that you are familiar with Amazon dynamo which is a distributed database technology or Google's big table which was created for storing huge amounts of data Cassandras based on both of these technologies. Let's go ahead and take a look at the Apache Cassander home page so we can learn a little bit more about Cassandra. If you'd like to follow along with me you can go ahead and open up a browser window to go to the home page. You can enter Kassandra dot Apache dot org. This is the home page for Apache Cassandra. And here it mentions linear scalability which like it sounds is the case with Cassandra. You can add more and more nodes to the cluster and it will still have top performance. It will still be highly performance. There's not a degradation of performance as you're adding additional nodes fault tolerance. Each node each server in a Cassander cluster has the same functionality as the others. You don't have to worry about a master node going down and the whole thing stopping to work because Cassandra doesn't work that way. It works in a way so that each node has the same capable as the others which makes it easy to add additional nodes and makes it so there is not a single point of failure. Also here we see commodity hardware. That's another great selling point is that you do not have to buy specialized servers in order to run Kassandra. You can use commodity hardware. We're going to go ahead and scroll down a little bit here. There's a nice section down here. Talks about fault tolerant. We're just talking about that highly performance. Cassandra is highly performant and it's for creating realtime applications. It's not to be confused with analytic technologies like Hadoop where they run at night. They run in batches. Those sorts of applications are not designed for real time. Whereas Cassandra is making that differentiation between a real time highly available technology as opposed to a batch process analytic type of technology. De-centralized as we mentioned no single points of failure are mentioned this is the last one. Highly durable as we were talking about you could have an entire data center go down and your database could still go on. So this movie was getting you started with what Cassandra is it's a highly scaleable fault tolerant distributed database technology that provides high availability.