Name: vSAN Basic Architecture
Uploaded: 2018-10-11T14:06:45Z
Duration: 11 min 32 s
Description: Understand the basic concepts and networking requirements of an ESXi host cluster. Learn about the vSAN VMkernel port, and what it is used for. Understand how vSAN objects are mirrored across multiple hosts for redundancy. Learn about the Hyrbid and All-Flash vSAN architectures.

A free video tutorial from Rick Crisci

Making Cloud, Dev, and Virtualization easy to understand

Instructor rating

61 courses

525,532 students

Lecture description

Understand the basic concepts and networking requirements of an ESXi host cluster. Learn about the vSAN VMkernel port, and what it is used for. Understand how vSAN objects are mirrored across multiple hosts for redundancy. Learn about the Hyrbid and All-Flash vSAN architectures.

Learn more from the full course

Clear and Simple VMware vSAN 6.7 (Virtual SAN)

Configure, manage, troubleshoot, and optimize vSAN in your VMware vSphere environment.

03:07:49 of on-demand video • Updated September 2023

Configure, Monitor, Optimize, and Design VMware vSAN deployments

Create shared storage for vSphere Clusters using the local capacity of ESXi hosts

English [Auto]

In this video. I'll walk you through some of the basic architecture of virtual sand. And we'll start with the very most basic, the host cluster. So a cluster is simply a logical grouping of ESXi hosts. So let's say you have a group of ESXi hosts and you want to allow virtual machines to automatically failover to another host if their host fails. That's high availability. Right. And we have to create a cluster in order to enable high availability. We have to create a cluster in order to enable doctors. So with doctors we can have virtual machines that automatically get motioned from host to host for load balancing purposes. Those are a couple of features that require a host cluster in order for us to enable them. And another feature that requires it is Virtual San. So step one of setting up vSAN is to create an ESXi host cluster. That's going to be the very first step in our process. Now that being said, there are some prerequisites. We have to be at the right version of vSphere. We have to have the right version of vCenter, and we also have to have some supported hardware. And we also need to set up some VM kernel ports. So on each of these esxi hosts here, you can see we've got a couple of things going for us. Let's focus on ESXi zero one for a moment. ESXi zero one has two VM nics. And a V neck is a physical Ethernet port on the ESXi host. So this host has two physical Ethernet adapters. Let's say that there are ten gigabit per second Ethernet adapters and each one of these physical adapters is connected to a different physical switch. And you can say the same thing on host Sky zero two and the same thing on Sky zero three. So all three hosts have this in common. They have two physical ten gig VM necks, and each of those VM necks is connected to a different physical switch. And then what we've also done on each of these esxi hosts is we have created a VM kernel port and we have tagged that VM kernel port for vSAN traffic. So if you're not really familiar with VM kernel ports, what this basically means is we've created this little port and we've given it an IP address and we've said, Hey, if there is traffic related to vSAN, if a virtual machine needs to transmit traffic related to vSAN from host to host, use this VM kernel port. So we have to have that network under the surface in order for vSAN to work properly. And we'll see it in action here in a couple of slides. Now one final thing that I want to note in regards to this network that I've shown you here. There's a couple of design best practices that I have incorporated. Number one, I've got physical redundancy. If either of these switches fails, there is still another switch up and running that can be used to pass all of the necessary traffic. So I've got physical redundancy enabled. I've also got nothing else connected to these switches. This is a dedicated physical network specifically just for vSAN traffic. Okay, so how are my virtual machine objects actually stored and how do these VM kernel ports come into this picture? So here we see VM one and VM one is one of my virtual machines that is stored on vSAN. And as VM one has reads or writes that need to be executed, they are going to be pushed over the physical network using this vSAN VM kernel port. To the appropriate destination. Host. So here we can see the active vmdk for this particular virtual machine. And there's also going to be another copy of the Vmdk over here. This is a mirror copy just in case the primary copy is on a host that fails. So the vSAN VM kernel port is there to basically handle all of the traffic that's going to have to flow over this vcN network. The virtual machine is running on one host, it's virtual disk is on another host. So when it wants to read and write to and from that virtual disk, we're going to leverage a VM kernel port to push that traffic over the network. And hopefully what will end up happening is the majority of the read operations will be satisfied by our flash capacity. So what we see here is something called a hybrid configuration. We're going to have a lesson that breaks down the difference between hybrid and all flash. But for the moment, we're focused strictly on what we call the hybrid configuration. So what does that mean? Well, on each of these esxi hosts, we have some traditional magnetic storage devices. These are what we call our capacity devices. We've got traditional hard disks. And then we've also got a cache tier, which is SSD. And the SSD is a lot faster than the traditional hard disks. So on each of these hosts, I've got kind of these big capacity devices, these hard disks that are going to store a whole lot of data. And then sitting in front of them. I've got this cached here of SSD, which is much faster and more expensive. So now let's look at what happens when virtual Machine one wants to read some sort of data from its virtual disk. The VM kernel port is used to push that read over the physical network and it eventually hits the destination host where it's active vmdk resides and look what's happening. It's hitting this SSD on host ESX zero two and you'll notice it's happening very quickly. Write this read is happening very fast. It's hitting the SSD and the SSD is acting as a read cache. So the purpose of the read cache is to store the most frequently read data on SSD. So 70% of this SSD is going to be dedicated to a read cache. A copy of all of the most frequently read data is going to be located in that SSD. There's also a copy of that same data along with a whole lot of other data here on this capacity device. But the hope is that when data is read from the Vmdk, most of the time the data will get read from that SSD because it's so fast. If the data is not present on the SSD, this is what we call a cache miss. And you can see this read operation is happening much more slowly. Virtual Machine needed some sort of data that actually was not present in the read cache and therefore the data had to get served up by the capacity device. In this case in the hybrid configuration. Our capacity device is a hard disk and so this read is going to be much slower than the read from SSD. How about rights? We've been talking about reads so far. What if my virtual machine needs to write some sort of data to disk? Well, here's the first thing we have to consider. Number one, there are multiple copies of this vmdk. This virtual machine has one copy of the Vmdk on ESXi zero two, but we have to prepare for the possibility that ESXi zero two could fail. So in this case, another copy of that Vmdk is being mirrored to ESXi zero three. And that way if ESX zero two fails my virtual machines data is not lost. So when the right occurs, here's what's going to happen. When the virtual machine needs to execute a right. The right is going to be sent to both of those esxi hosts. It's going to be mirrored. If you're familiar with Raid, this is very similar to the way that writes are mirrored across a raid array. One copy of the data. Sent to each of these esxi hosts. That way, they both always have a current version of that. Virtual machines vmdk just in case one of the hosts fails. And the other thing that you may notice here is watch this. Right? It's going to hit the SSD first. That's what we call the right buffer. So what happens is any time these virtual machines that are on vSAN need to write some sort of data, the writes are carried out against the write buffer on SSD. 30% of my SSD is dedicated to being a write buffer, and I sort of equate this to checking a book back into the library. So if I want to check a book into the library, I can just walk in, drop it on the front desk and I'm done. The librarian is going to take that book and reshelve it. They're going to do the hard work, the time consuming work. My experience is I just simply drop it on the desk and walk away. It's very quick for me. And it's the same thing with this right operation when the virtual machine needs to write some sort of data to its vmdk, it's going to be written to the write buffer and that's going to happen very quickly. So from the perspective of the virtual machine, once this write hits the write buffer, it's done. And then on the back end, the data is actually written from the write buffer to the capacity device. So to our virtual machines, it always feels like they're writing to SSD. The write speeds are always really quick and then after the fact, virtual sand handles getting that object written from the SSD to the capacity tier. Okay, so in review, virtual sand can only be enabled on a cluster of esxi hosts, and each one of those hosts has to have a VM kernel port that is marked for vSAN traffic. That's where all of our virtual San reads and writes are going to flow over that VM kernel network. Virtual machine objects are striped and mirrored across hosts, just in case we have a host failure and read caches and write buffers are used to improve performance. Then on the back end we have the actual capacity devices.

More about this course