
Explore linux high availability clustering with pacemaker, deploying active-passive and active-active clusters using KVM labs, shared storage, quorum, resource groups, and other clustering concepts.
Explore Linux high availability clustering and pacemaker components; learn how clusters use active-passive and active-active failover across multiple nodes to keep web services available.
Explore how storage clusters use gfs2 cluster file system to enable simultaneous read and write across nodes, with resource groups including content, floating IP, and httpd configuration in shared storage.
Explore failover in high availability clusters by migrating web services to another node when a node fails, and learn fencing to prevent corruption by rebooting malfunctioning nodes.
Explore shared storage across a four-node cluster, enabling failover as nodes access website content. Understand quorum, a voting rule that preserves cluster integrity and prevents resource start without majority.
Explore the architecture of a four-node Linux high availability cluster, including hardware components, public and private networks, fencing, shared storage, and the Corosync and Pacemaker cluster services.
Establish a Linux high availability cluster using a KVM environment, creating three nodes on a server, configuring KVM, disk images, ISO storage, a bridge network, and passwordless ssh.
Configure a basic kvm-based multi-node cluster (node A, B, C) using tmux for synchronized commands, installing pacemaker, corosync, and fence agents, and enable firewall and pcsd for cluster communication.
Configure a basic Linux high availability cluster by installing packages, creating a cluster user, authenticating nodes, and setting up cluster zero with pcs. Note the missing stonith device for fencing.
Configure fencing in a Linux high availability cluster to isolate unresponsive nodes, prevent data corruption, and enable safe failover using an external fencing agent and stonith, such as fence_xvm.
Install and configure fencing components to enable high availability. Create a secret key, configure fence_wd with multicast, open port 1229 UDP, and deploy fence_xvm stonith agents on all nodes.
Practice configuring and testing fencing in a Linux high availability cluster by using fence_xvm to power cycle nodes A, B, and C, verify status, and confirm automatic node recovery.
Start and stop cluster services on individual nodes or across all nodes, and enable or disable auto-join behavior to manage node readiness in a Pacemaker Corosync cluster.
Learn to add a new node to a running high availability cluster online, without downtime, using pacemaker and fencing agents with firewall configuration and authenticated cluster node addition.
Configure fencing for a newly added node in Linux high availability clustering using pxe stonith and the fence_xvm agent, copying the secret key to the target and verifying the reboot.
Remove a node from a running four-node cluster with pxe cluster node remove and delete its fencing via pxe stonith delete, then use standby commands to suspend resources during maintenance.
Understand quorum operations through majority voting, how many votes keep a cluster online, and how fencing interacts to maintain operational services when nodes fail.
Practice lab demonstrates managing quorum in a four-node Linux high availability cluster using Corosync quorum, showing how votes determine cluster operation as nodes fail and rejoin.
Configure quorum with the wait_for_all option to require all cluster members online before calculating quorum. Demonstrate how this prevents fence race by enabling the switch in a four-node setup.
Illustrate quorum calculation and how auto_tie_breaker keeps a two-node cluster operational with 50% quorum. Demonstrate split brain and how the lowest node ID wins quorum when nodes cannot communicate.
learn to create and configure high-availability resources in a two-node cluster using pacemaker, grouping ip, file system, and web service resources for reliable failover.
Configure a resource group in a three-node cluster to run a clustered Apache web service, using a floating IP, NFS file system, and httpd resource, with ordered start and failover.
Learn to manage cluster resources and resource groups with start, stop, disable, move, and ban operations across nodes A, B, and C using the high availability add-on.
Inspect and configure cluster logging to troubleshoot high availability issues, using corosync and pacemaker logs, adjust log file locations, enable debugging, and restart the cluster to apply changes.
Troubleshoot resource failures by checking logs, fail counts, and configuration files, then use debug mode and update the resource to restore service.
Troubleshoot cluster network issues in a three-node Linux high availability cluster by pinging the offline node, verifying SSH, and inspecting Corosync quorum and logs to restore connectivity.
Configure an active/passive NFS resource group in a three-node pacemaker cluster with shared iSCSI storage, exporting a ZFS file system to clients and ensuring seamless failover.
Configure an active-passive NFS resource group on a three-node cluster to share iSCSI target storage, create a file system, export it via NFS, and enable failover with a floating IP.
Learn to set order constraints to define start sequence, location constraints to assign resources to specific nodes, and co-location constraints to control same-node placement for resource groups.
Learn to configure order and location constraints in a two-node cluster, guiding startup order and node preferences for resources like NFS and web server, using constraint order and constraint location.
Set location constraints and resource stickiness in linux high availability clustering to keep the web server resource group on node A during cluster restarts and avoid outages.
Explore co-location constraints in pacemaker, forcing or preventing two resources or resource groups from running on the same node, using A with B and minus infinity to never co-locate.
Explore two-node cluster challenges and how quorum prevents node failure, using the two_node_node setting to reduce required votes so a failed node doesn't crash the cluster.
Split brain in a two-node Linux cluster happens when nodes lose communication, each thinks it is online and activates shared resources, risking data corruption; fencing prevents by rebooting peers.
Learn how fencing and fence racing prevent split brain in two-node Linux high availability clusters. Apply delayed fencing to break the race and keep the web service on one node.
Explains that in a two-node cluster, the system waits for both nodes to start before forming quorum, enabling single-node operation if one node is down, via two_node and wait_for_all settings.
Configure and access storage on a Linux server using iSCSI targets and initiators over IP networks, sharing storage via LUNs and SCSI commands.
Learn the core iSCSI fundamentals, including initiators and targets, iqn naming, and access control with ACLs, plus discovery and login steps, all within the ESXi context.
Explore iSCSI component terminology, including iSCSI qualified names, initiators and targets, LUNs, login and discovery, and portal concepts, and learn how to configure ESXi iSCSI targets and initiators.
Log into the target server, install the target CLI package, and manage target services with the Stargate Seela utility to configure megastores, block, file, and ramdisk storage objects.
Create and expose a block store with iSCSI on the target, define a lun and acl, and configure the client initiator to access the shared lun.
Configure an iSCSI target server and connect a client by performing discovery, scanning LUNs, logging in, and verifying shared storage with partitioning and systemctl-managed services.
Learn how to manage high-availability logical volumes with LVM and clustered LVM on shared storage, configure LVM and iSCSI targets, and understand active-active versus active-passive setups.
Discover how to share a disk across cluster nodes using iSCSI. Create a storage object from /dev/sdc, attach a lun, configure ACLs, verify wwn across nodes, and prep for lvm.
Configure a vm for linux high availability clustering using an iSCSI shared disk, create a physical volume, cluster vg, and a 500 MB cluster lv with zfs, enabling cross-node activation.
Configure lvm resources in a clustering environment by creating an lvm resource for a vg, then a zfs file system resource mounted at a mount point, validating failover across nodes.
Clustered LVM provides active-active access to shared storage across all nodes. It uses gfs2; ext4 and xfs are not supported, with VMD and DLM propagating metadata to prevent corruption.
Configure a clustered LVM in a three-node high-availability cluster using the distributed lock manager and CLVM, install LVM cluster packages, and create DLM and kvmd resources.
Create a volume group from a shared disk across nodes using a physical volume, then create a logical volume. Show that the vg and lv reflect on all nodes.
This course helps you to understand high availability clustering by providing a strong concept and hands-on experience with Pacemaker and Corosync components of Red Hat / CentOS Enterprise High Availability Cluster.
Lab Design : KVM based lab setup in this course module.
Subtitle is not added in this course module.
Course Content:
Introduction
What is clustering & cluster types?
Advantages of Clustering Servers
Concepts and techniques
Resource and resource groups
Failover, Fencing, Shared Storage, Quorum
Cluster Architecture
Lab Setup using KVM Environment
Configuring a Basic Cluster
Configuring a fencing agent using KVM host mahcine
Troubleshooting fencing device
Managing Cluster Nodes
Starting & Stopping Cluster Services
Enabling & Disabling the Cluster Services
Adding & Removing A Cluster Node
The Standby & unstandby Nodes
Quorum Operations
Lab Session on quorum
Managing Quorum Calculations
Cluster Setup Switches
1) wait_for_all
2) auto_tie_breaker
Creating and Configuring Resources
Create and configure high-availability resources.
Creating a clustered Apache service
Managing Resources
Troubleshooting High-Availability Cluster
Inspect and configure cluster logging
Troubleshooting resource failures
Troubleshooting cluster network issues
Complex Resource Group
Configuring an Active/Passive NFS Resource Group
Lab Session
Managing Constraints
Types of constraints:
Order, Location & Colocation Constraint
Practice Lab Session
Two Node Cluster Issues
No room for node failure
Split Brain
Fence death/fence racing
The cluster does not start until both nodes have started.
Practice Lab Session
Managing iSCSI Initiators
iSCSI fundamentals
Configuring an iSCSI Server
Several types of backing Storage
block, fileio, pscsi & ramdisk
Creating iSCSI Target
Lab Session to create a block backstore from the targetcli shell
Managing High Availability Logical Volumes
Clustered LVM & HA-LVM
Lab Session to shared a disk (lun) to all cluster nodes
Practice Lab Session on HA-LVM
Managing Clustered Logical Volumes
Active/Active configuration of logical volumes
Distributed Lock Manager (DLM) for lock management
clvmd daemon
Practice Lab Session
Global File System 2 (GFS2)
GFS2 concepts
Creating a GFS2 formatted Cluster File Systems
Managing a GFS2 File System
Managing a GFS2 Resource in the cluster
Growing & Repairing a GFS2 File System