
Develop a systematic approach to Linux diagnostics and troubleshooting, breaking problems into pieces, gathering evidence, and applying root cause analysis with proper documentation.
Learn to set up a centralized rsyslog server with syslog on CentOS 7, forwarding client logs to a central destination for easier troubleshooting and log management.
Learn to verify file and directory integrity with aid by creating a snapshot database, detecting changes or new files, updating the database, and monitoring permission changes.
Learn to audit a Linux system with auditd, create on-the-fly and permanent watch rules to monitor a directory and file changes, and search the audit log for events.
Learn to identify hardware on a Linux system using commands like ls cpu, /proc/cpuinfo, meminfo, dmidecode, lspci, sdparm, and lsusb to diagnose CPU, memory, disk, USB, and PCI hardware issues.
Understand the virtual file system (vfs) and the common file model that lets standard system calls access ext4, xfs, and nfs across file systems, with page, inode, and dentry caches.
Explore how to check and repair a zfs file system using zfs_repair, including dry-run mode, unmounted state, automatic use of backup super blocks, and handling corrupted super blocks.
Join a lab session to recover from LVM issues by practicing troubleshooting of volume groups, physical volumes, and logical volumes, using archive and backup files for volume group restore.
Explore recovering metadata of a physical volume in an lvm setup, including the header, uid, and volume group configuration entries, using backups from /etc/lvm and related tools.
Recover a corrupted disk header and lvm metadata in a volume group by recreating the physical volume with the same unique id, restoring from backup, and using vg cfg restore.
practice lab on iscsi target and initiator demonstrates creating block storage, luns, and acl mappings using the target cli, installing iscsi packages on the client, and preparing initiator connections.
Learn to manage IP version networking on Linux with nmcli commands, check interface status, add IP addresses, and activate or deactivate connections.
Understand how a missing grub configuration file blocks boot, then boot from grub by selecting the boot partition, kernel, and init ram fs, then restore grub.conf or use rescue mode.
boot xl7 into rescue mode, equivalent to single user mode, requiring the root password; use grub2 or edit the linux line to rescue.target, noting local mounts and no networking.
Boot into troubleshooting mode using installation media to reset the root password when rescue or emergency modes cannot boot the system.
Learn to locate Linux logs such as messages and audit.log on the server, and use the sundry Linux troubleshooter in graphical mode to analyze SELinux alerts and access denied events.
Explore pluggable authentication modules (pam) and how they sit between Linux applications and the native authentication system to unify and secure user login across ssh, ftp, and apache.
Senior system administrators who are interested in troubleshooting more are the target audience for this course.
Every section of this course has a different set of quizzes, and there is also many project lab assignment that will help you demonstrate what you have learned.
Linux Diagnostics And Troubleshooting
Course Content
Introduction
What is troubleshooting?
Troubleshooting a login issue
Collecting Information
System Journal
Troubleshoot a web server issue using the log files
Using Red Hat Resources
Collecting Information with SOSREPORT
Practice Lab Session
Troubleshoot a FTP Connectivity Issue
Quiz - MCQ
Monitoring Systems
System Monitoring with Cockpit
Performance Co-Pilot or pcp
Centralized log server using rsyslog
Practice Lab Session
Intrusion detection software to monitor changes
Advanced Intrusion Detection Environment (AIDE)
Practice Lab Session
System Auditing with auditd
Quiz - MCQ
Identifying Hardware Issues
Identify various hardware and their problems
Hardware Error Reporting using mcelog and rasdaemon
Memory Testing using memtest86+ package
Managing Kernel Modules
Loading and unloading modules
Managing module options
Troubleshooting to disable MSI-X interrupt handling in the driver
Quiz - MCQ
Troubleshooting Storage Issues
Overview of Linux Storage Stack
Virtual File System
Filesystems & Device Manager
Device Mapper Multipath
Recovering from File System Corruption
Checking & Repairing ext3/ext4 fs using e2fsck
Checking & Repairing xfs fs using xfs_repair
Recovering LVM Issues
Practice Lab Session - using vgcfgrestore
Migrating a Volume Group from one system to another
Recovering Metadata in LVM
Practice Lab Session
Configuration of Iscsi target & Initiator
Practice Lab Session on iscsi target & initiator - 1
Practice Lab Session on iscsi target & initiator - 2
Quiz - MCQ
Troubleshooting RPM Issues
Resolve package management dependency issues
Identify & fix dependency issue
Recover a corrupted RPM Database
Working with transaction history using Yum command
Reverting & repeating transaction using yum command
Practice Lab Session
Quiz - MCQ
Troubleshooting a Network Issues
Check Network Connectivity
Scanning Network Ports using nmap
Communicating with a remote service using nc command
Monitoring Network traffic using iptraf-ng
Troubleshooting a network issue (One network interface is not working properly)
Troubleshooting tips related with device names of ethernet interface
Disable consistent network device naming
Overview NetworkManager
Practice Lab Session
Capturing Packets with tcpdump
Practice Lab Sessions
Quiz - MCQ
Troubleshooting a Boot Issues
Linux Boot process
Booting of RHEL 7 with systemd
Overview of Grub2
Grub2 features
Booting into Grub Menu
Protect Grub by appying a password
Lab on Basic System Recovery (Initramfs file missing/Corrupted)
Lab on Basic System Recovery (Grub related issues like grub is missing)
Lab on Basic System Recovery (Master Boot Record missing/corrupted)
Rescue Modue in RHEL 7
Practice Lab Session
Reset the root passwd using installation disk
Reset the root passwd using rd.break
Repairing File System Issues at boot
Fix issues at /etc/fstab at boot - Lab Session
Assignment - This lab project assignment aims to explore some common Red Hat Linux boot issues and provide solutions to resolve them.
Troubleshooting a Security Issues
Troubleshooting a SELinux Issue
Changing SELinux context
troubleshooting ftp connectivity issue using booleans
SELinux Audit Logs & Troubleshooting
Overview of PAM Security
Concepts of PAM
PAM Modules & Configurations
PAM Module Groups
Control Flags in PAM
PAM Modules
last lecture