Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Linux High Availability Clustering

Name: Linux High Availability Clustering
Rating: 4.6 (1441 reviews)

Design and deploy a high availability cluster to provide active/passive or active/active services using HA Pacemaker

Created byShikhar Verma

Last updated 8/2024

English

What you'll learn

Students should understand HA cluster concepts and how to manage different nodes, start/stop services
Creating High-Availability Clusters
Managing Cluster Nodes and Quorum
Managing Fencing
Creating and Configuring Resources
Managing Constraints
About Order, Location & Colocation Constraints
Troubleshooting High-Availability Clusters
Controlling Complex Resource groups
Managing Two Node Clusters
Concept about Split Brain, Fence death/fence racing
Managing iSCSI Initiators
Configuring & Managing High-Availability Logical Volumes
Managing Clustered Logical Volumes
GFS2 Concepts
Creating a GFS2 formatted Cluster File Systems
Growing & Repairing a GFS2 File System

Course content

11 sections • 56 lectures • 6h 39m total length

Overview about the course module8:41
Explore linux high availability clustering with pacemaker, deploying active-passive and active-active clusters using KVM labs, shared storage, quorum, resource groups, and other clustering concepts.
Introduction7:25
Explore Linux high availability clustering and pacemaker components; learn how clusters use active-passive and active-active failover across multiple nodes to keep web services available.
Resource & Resource Group6:43
Explore how storage clusters use gfs2 cluster file system to enable simultaneous read and write across nodes, with resource groups including content, floating IP, and httpd configuration in shared storage.
About failover, fencing4:00
Explore failover in high availability clusters by migrating web services to another node when a node fails, and learn fencing to prevent corruption by rebooting malfunctioning nodes.
About shared storage & quorum3:33
Explore shared storage across a four-node cluster, enabling failover as nodes access website content. Understand quorum, a voting rule that preserves cluster integrity and prevents resource start without majority.
Cluster Architecture6:23
Explore the architecture of a four-node Linux high availability cluster, including hardware components, public and private networks, fencing, shared storage, and the Corosync and Pacemaker cluster services.
Lab Setup using KVM Environment12:50
Establish a Linux high availability cluster using a KVM environment, creating three nodes on a server, configuring KVM, disk images, ISO storage, a bridge network, and passwordless ssh.
Configuring a Basic Cluster - lab session 17:31
Configure a basic kvm-based multi-node cluster (node A, B, C) using tmux for synchronized commands, installing pacemaker, corosync, and fence agents, and enable firewall and pcsd for cluster communication.
Configuring a Basic Cluster - lab session -24:57
Configure a basic Linux high availability cluster by installing packages, creating a cluster user, authenticating nodes, and setting up cluster zero with pcs. Note the missing stonith device for fencing.
Fencing configuration4:17
Configure fencing in a Linux high availability cluster to isolate unresponsive nodes, prevent data corruption, and enable safe failover using an external fencing agent and stonith, such as fence_xvm.
Lab Session - fencing device10:35
Install and configure fencing components to enable high availability. Create a secret key, configure fence_wd with multicast, open port 1229 UDP, and deploy fence_xvm stonith agents on all nodes.
Troubleshooting fencing device8:04
Practice configuring and testing fencing in a Linux high availability cluster by using fence_xvm to power cycle nodes A, B, and C, verify status, and confirm automatic node recovery.
Quiz

Starting & Stopping Cluster Services8:48
Start and stop cluster services on individual nodes or across all nodes, and enable or disable auto-join behavior to manage node readiness in a Pacemaker Corosync cluster.
Adding & Removing A Cluster Node8:00
Learn to add a new node to a running high availability cluster online, without downtime, using pacemaker and fencing agents with firewall configuration and authenticated cluster node addition.
Fencing Configuration5:09
Configure fencing for a newly added node in Linux high availability clustering using pxe stonith and the fence_xvm agent, copying the secret key to the target and verifying the reboot.
Standby & unstandby Nodes6:07
Remove a node from a running four-node cluster with pxe cluster node remove and delete its fencing via pxe stonith delete, then use standby commands to suspend resources during maintenance.
Quorum Operations7:42
Understand quorum operations through majority voting, how many votes keep a cluster online, and how fencing interacts to maintain operational services when nodes fail.
Practice Lab Session4:47
Practice lab demonstrates managing quorum in a four-node Linux high availability cluster using Corosync quorum, showing how votes determine cluster operation as nodes fail and rejoin.
quorum options "wait_for_all"9:45
Configure quorum with the wait_for_all option to require all cluster members online before calculating quorum. Demonstrate how this prevents fence race by enabling the switch in a four-node setup.
quorum options "auto_tie_breaker"5:45
Illustrate quorum calculation and how auto_tie_breaker keeps a two-node cluster operational with 50% quorum. Demonstrate split brain and how the lowest node ID wins quorum when nodes cannot communicate.
Quiz

Create and configure high-availability resources5:43
learn to create and configure high-availability resources in a two-node cluster using pacemaker, grouping ip, file system, and web service resources for reliable failover.
Practice Lab Session11:38
Creating a clustered Apache service13:32
Configure a resource group in a three-node cluster to run a clustered Apache web service, using a floating IP, NFS file system, and httpd resource, with ordered start and failover.
Managing Resources5:25
Learn to manage cluster resources and resource groups with start, stop, disable, move, and ban operations across nodes A, B, and C using the high availability add-on.
Quiz

Inspect and configure cluster logging11:26
Inspect and configure cluster logging to troubleshoot high availability issues, using corosync and pacemaker logs, adjust log file locations, enable debugging, and restart the cluster to apply changes.
Troubleshooting resource failures10:05
Troubleshoot resource failures by checking logs, fail counts, and configuration files, then use debug mode and update the resource to restore service.
Troubleshooting cluster network issues6:36
Troubleshoot cluster network issues in a three-node Linux high availability cluster by pinging the offline node, verifying SSH, and inspecting Corosync quorum and logs to restore connectivity.

Configuring an Active/Passive NFS Resource Group5:33
Configure an active/passive NFS resource group in a three-node pacemaker cluster with shared iSCSI storage, exporting a ZFS file system to clients and ensuring seamless failover.
Practice Lab Session17:45
Configure an active-passive NFS resource group on a three-node cluster to share iSCSI target storage, create a file system, export it via NFS, and enable failover with a floating IP.

Managing Constraints10:07
Learn to set order constraints to define start sequence, location constraints to assign resources to specific nodes, and co-location constraints to control same-node placement for resource groups.
Order Constraints14:48
Learn to configure order and location constraints in a two-node cluster, guiding startup order and node preferences for resources like NFS and web server, using constraint order and constraint location.
Location Constraints7:00
Set location constraints and resource stickiness in linux high availability clustering to keep the web server resource group on node A during cluster restarts and avoid outages.
Colocation Constraints4:34
Explore co-location constraints in pacemaker, forcing or preventing two resources or resource groups from running on the same node, using A with B and minus infinity to never co-locate.

No room for node failure4:09
Explore two-node cluster challenges and how quorum prevents node failure, using the two_node_node setting to reduce required votes so a failed node doesn't crash the cluster.
Split Brain6:49
Split brain in a two-node Linux cluster happens when nodes lose communication, each thinks it is online and activates shared resources, risking data corruption; fencing prevents by rebooting peers.
Fence death/fence racing3:52
Learn how fencing and fence racing prevent split brain in two-node Linux high availability clusters. Apply delayed fencing to break the race and keep the web service on one node.
The cluster does not start until both nodes have started3:19
Explains that in a two-node cluster, the system waits for both nodes to start before forming quorum, enabling single-node operation if one node is down, via two_node and wait_for_all settings.

About iSCSI targets5:00
Configure and access storage on a Linux server using iSCSI targets and initiators over IP networks, sharing storage via LUNs and SCSI commands.
iSCSI Fundamentals3:33
Learn the core iSCSI fundamentals, including initiators and targets, iqn naming, and access control with ACLs, plus discovery and login steps, all within the ESXi context.
iSCSI Component Terminology3:11
Explore iSCSI component terminology, including iSCSI qualified names, initiators and targets, LUNs, login and discovery, and portal concepts, and learn how to configure ESXi iSCSI targets and initiators.
Practice Lab Session - 17:32
Log into the target server, install the target CLI package, and manage target services with the Stargate Seela utility to configure megastores, block, file, and ramdisk storage objects.
Practice Lab Session - 29:28
Create and expose a block store with iSCSI on the target, define a lun and acl, and configure the client initiator to access the shared lun.
Practice Lab Session - 35:32
Configure an iSCSI target server and connect a client by performing discovery, scanning LUNs, logging in, and verifying shared storage with partitioning and systemctl-managed services.

About Clustered LVM & HA-LVM3:30
Learn how to manage high-availability logical volumes with LVM and clustered LVM on shared storage, configure LVM and iSCSI targets, and understand active-active versus active-passive setups.
Shared a disk(lun) to all clusterd nodes using iSCSI Server9:30
Discover how to share a disk across cluster nodes using iSCSI. Create a storage object from /dev/sdc, attach a lun, configure ACLs, verify wwn across nodes, and prep for lvm.
Configured HA-LVM - 111:18
Configure a vm for linux high availability clustering using an iSCSI shared disk, create a physical volume, cluster vg, and a 500 MB cluster lv with zfs, enabling cross-node activation.
Configured HA-LVM - 27:40
Configure lvm resources in a clustering environment by creating an lvm resource for a vg, then a zfs file system resource mounted at a mount point, validating failover across nodes.

Active/Active configuration of logical volumes5:15
Clustered LVM provides active-active access to shared storage across all nodes. It uses gfs2; ext4 and xfs are not supported, with VMD and DLM propagating metadata to prevent corruption.
Configure a clustered LVM13:46
Configure a clustered LVM in a three-node high-availability cluster using the distributed lock manager and CLVM, install LVM cluster packages, and create DLM and kvmd resources.
Configure a volume group for clustered lvm2:55
Create a volume group from a shared disk across nodes using a physical volume, then create a logical volume. Show that the vg and lv reflect on all nodes.

Requirements

PC or Laptop with internet Connection
Red Hat Certified Engineer (RHCE) certification or equivalent experience.
CentOS 7, RHEL7 ISO Image
Vmware workstation

Description

This course helps you to understand high availability clustering by providing a strong concept and hands-on experience with Pacemaker and Corosync components of Red Hat / CentOS Enterprise High Availability Cluster.

Lab Design : KVM based lab setup in this course module.

Subtitle is not added in this course module.

Course Content:

Introduction

What is clustering & cluster types?
Advantages of Clustering Servers
Concepts and techniques
Resource and resource groups
Failover, Fencing, Shared Storage, Quorum
Cluster Architecture
Lab Setup using KVM Environment
Configuring a Basic Cluster
Configuring a fencing agent using KVM host mahcine
Troubleshooting fencing device

Managing Cluster Nodes

Starting & Stopping Cluster Services
Enabling & Disabling the Cluster Services
Adding & Removing A Cluster Node
The Standby & unstandby Nodes
Quorum Operations
Lab Session on quorum
Managing Quorum Calculations
Cluster Setup Switches
1) wait_for_all
2) auto_tie_breaker

Creating and Configuring Resources

Create and configure high-availability resources.
Creating a clustered Apache service
Managing Resources

Troubleshooting High-Availability Cluster

Inspect and configure cluster logging
Troubleshooting resource failures
Troubleshooting cluster network issues

Complex Resource Group

Configuring an Active/Passive NFS Resource Group
Lab Session

Managing Constraints

Types of constraints:
Order, Location & Colocation Constraint
Practice Lab Session

Two Node Cluster Issues

No room for node failure
Split Brain
Fence death/fence racing
The cluster does not start until both nodes have started.
Practice Lab Session

Managing iSCSI Initiators

iSCSI fundamentals
Configuring an iSCSI Server
Several types of backing Storage
block, fileio, pscsi & ramdisk
Creating iSCSI Target
Lab Session to create a block backstore from the targetcli shell

Managing High Availability Logical Volumes

Clustered LVM & HA-LVM
Lab Session to shared a disk (lun) to all cluster nodes
Practice Lab Session on HA-LVM

Managing Clustered Logical Volumes

Active/Active configuration of logical volumes
Distributed Lock Manager (DLM) for lock management
clvmd daemon
Practice Lab Session

Global File System 2 (GFS2)

GFS2 concepts
Creating a GFS2 formatted Cluster File Systems
Managing a GFS2 File System
Managing a GFS2 Resource in the cluster
Growing & Repairing a GFS2 File System

Who this course is for:

Linux High Availability Clustering course is aimed at senior system administrators responsible for maximizing resiliency though high availability clustering services

Linux High Availability Clustering

What you'll learn

Explore related topics

Course content

Introduction12 lectures • 1hr 25min

Managing Cluster Nodes8 lectures • 56min

Creating and Configuring Resources4 lectures • 36min

Troubleshooting High-Availability Cluster3 lectures • 28min

Complex Resource Group2 lectures • 23min

Managing Constraints4 lectures • 36min

Two Node Cluster Issues4 lectures • 18min

Configuring iSCSI Targets & Initiator6 lectures • 34min

Managing High Availabiity Logical Volumes4 lectures • 32min

Managing Clustered Logical Volumes3 lectures • 22min

Requirements

Description

Who this course is for: