Udemy Business

Teach on Udemy

Turn what you know into an opportunity and reach millions around the world.

Learn More

Your cart is empty.

Keep shopping

Synchronization in Linux Kernel Programming

Name: Synchronization in Linux Kernel Programming
Rating: 4.6 (206 reviews)

Linux Kernel Programming - Synchronization and Concurrency

Created byLinux Trainer

Last updated 11/2020

English

What you'll learn

Synchronization concepts in Linux Kernel

Course content

11 sections • 120 lectures • 4h 31m total length

Problem1:01
Illustrate a memory race in kernel programming, where a function returns a pointer; two processes see null and allocate memory, overwriting each other and causing memory loss; session discusses remedies.
Introduction to concurrency3:01
Explore concurrency and context switching, distinguishing the illusion of parallelism on a single core from true parallelism on multi-core systems, and learn to identify cores and processor usage.
Background of Multiprocessing7:53
Explain how multiprocessors evolved from private per-CPU operating systems to a single symmetric multiprocessing kernel with per-region locks, solving system-call bottlenecks and avoiding the big kernel lock.
Preemption and context switch in Linux Kernel3:43
Preemption in user and kernel space2:47
Preemption forcefully switches running processes between user space and kernel space, making user programs preemptible and avoiding kernel lockups when loops run in kernel space with the config_preempt option.
When can kernel preemption happen0:46
Trigger kernel preemption when returning to kernel space from an interrupt handler, or when a kernel task calls schedule or blocks and calls schedule, causing a context switch.
Example of kernel preemption2:12
Reentrancy3:12
Explore the kernel control path, system calls and interrupts, and how the linux kernel remains re-entrant to support concurrent kernel mode execution on a uniprocessor, using locking for shared data.
Synchronization Race Condition and critical regions2:10
Learn how synchronization prevents race conditions in the Linux kernel by protecting global data in critical regions, with examples of non re-entrant functions and preemption.
Causes of concurrency1:02
Identify the main causes of concurrency in the Linux kernel, including interrupts, softirq and tasklets, preemption, sleeping, and symmetric multiprocessing.
Solution for concurrency2:11
Find out maximum number of processors in Kernel2:23
Determine the maximum number of CPUs the smp kernel can support using the nr_cpus variable and kernel configuration, override it with a kernel parameter, and check online CPUs via num_online_cpus.
Find out which processor is running kernel control path1:37
Identify the processor running the kernel control path by using SNP_processor_id to obtain the current processor number, print it, and verify it against user-space observations.
Linux Kernel Module Example of processor id of Kernel Thread1:39
Explore how a Linux kernel thread prints its processor id across init, thread function, and exit, revealing scheduler-driven switching between processors.
Linux Kernel Module Example of processor id on uniprocessor system2:54
Explore how a uniprocessor system handles kernel and user processes through a linux kernel module example, illustrating proc cpuinfo shows one processor, and how scheduling and preemption enable apparent multi-tasking.

Introduction1:01
Explore per CPU variables as a simple, efficient synchronization technique by giving each CPU its own array element, preventing race conditions and aligning with main memory to avoid cache issues.
Example of Per CPU variables4:58
Explore how per CPU variables are implemented with a proc file per CPU, using read and write handlers, get_cpu and put_cpu to disable and enable preemption while updating values.
New Interface of Per CPU Variables3:48
The 2.6 kernel introduces a per-cpu interface to simplify per-cpu data. It explains compile-time static definitions, get and put interfaces, preemption control, and l-values.
Example of new interface of Per CPU Variables0:58
Define a per-CPU variable, assign an initial value of five, disable kernel preemption to obtain an lvalue, increment it, re-enable preemption, and verify the value increments from five to six.
Example of per cpu1:40
Use per cpu variables and cpu id to access other processors with locking, initialize per cpu counters, and increment to ten while reflecting updates across online cpus.
Example of for_each_online_cpu0:44
Allocating per cpu data at runtime2:04
Allocate per cpu data at runtime using the per cpu wrapper, returning a void pointer for each processor. Access the data by the pointer, disabling preemption and then re-enabling it.
Problems with Per CPU Variables1:19
Explore the problems with per-cpu variables, including lack of protection against asynchronous functions and interrupt handler interactions. Apply additional synchronization primitives to safely share data across interrupt handlers and CPUs.

Problem Read Modify Write5:16
Two kernel threads increment a shared global variable using non-atomic read-modify-write operations, causing race conditions and inconsistent results due to non-atomic memory access and bus arbitration.
Introduction to Atomic Operators3:15
Learn how atomic operators in the Linux kernel ensure race-free read-modify-write sequences using atomic_t, atomic.h, and lock instructions, with differences on SMP versus uniprocessor kernels.
Example of Atomic Operators2:58
Explore atomic operations in Linux kernel programming, using atomic.h macros such as atomic_init, atomic_inc, atomic_dec, atomic_set, atomic_read, atomic_add, and atomic_sub to ensure read-modify-write safety across CPUs.
Common uses of Atomic Operations1:54
Atomic Operation and test4:34
Understand architecture-dependent atomic operations in the Linux kernel, including decrement, increment, subtract, and add with test variants, illustrated through atomic.h usage in arch and include directories.
Atomic add subtract and return1:14
Explore atomic add, subtract and return to perform modification and read the latest value in a single atomic call, building on atomic add, increment, decrement, and test APIs.
More Atomic Operations4:02
64-bit Atomic Operations1:48
Explore 64-bit atomic operations in Linux kernel programming, using atomic64_t and atomic_64 APIs, with hashed spinlocks as fallback for unsupported architectures and applications in the performance counter subsystem.
Atomic bitwise operations4:02
Explore how the Linux kernel provides architecture-specific atomic bitwise operations via header implementations, using generic pointers to set, clear, and toggle bits (0–31 or 0–63) with practical examples.
Atomic bitwise operations with return value1:14
Explore atomic bitwise operations in linux kernel programming, including test and set bit, test and clear bit, and test and change bit, all returning the old value (0 or 1).
Non Atomic bitwise operations0:36
Explore non-atomic bitwise operations in Linux kernel programming, comparing non-atomic versions to atomic operations to understand when non-atomic may be faster, depending on processor single instruction cycles.

Introduction2:49
Limit atomic operations to word or doubleword sizes; custom structures or shared data cannot be updated atomically, while spin locks protect short critical sections by allowing only one CPU.
Spinlock API's3:19
Learn how spinlocks in the Linux kernel protect short critical sections to ensure atomicity and prevent race conditions by using spin_lock and spin_unlock with spinlock_t.
Initializing spinlock0:51
learn how to initialize a spinlock at runtime using malloc, choosing between static or dynamic allocation. initialize to unlock, then lock, unlock, and free the memory.
Spinlock Example of two kernel threads1:16
Demonstrates a spinlock controlling access to a shared counter between two kernel threads, forming a critical section where only one CPU updates the counter at a time.
What happens if i acquire a lock which is already held by CPU3:40
In the Linux kernel, spinlocks are not recursive; acquiring a spinlock already held by the same CPU causes busy spinning and deadlock, potentially stalling the CPU.
Implement busyloop using spinlock in char drivers2:59
Explore implementing a busy loop using a spinlock in a character device driver to guard a shared buffer with lock in open and unlock in Linux kernel programming.
spin_trylock1:16
Can i use spinlock if resource is shared between process and interrupt context4:42
Use spinlocks with irqsave to protect resources shared by process and interrupt contexts; disable interrupts, enter a critical section, and restore prior interrupt state after unlock.
Is kernel preemption disabled when spinlock is acquired2:34
Learn how kernel preemption interacts with spinlocks: preemption is disabled in the spinlock critical region and re-enabled on unlock, with uniprocessor and multiprocessor implications.
Important points to consider while using spinlock1:29
Example of calling msleep in critical section2:49
This lecture demonstrates calling sleep (msleep) inside a spinlock with preemption disabled, explaining potential deadlock and why sleep in a spinlock is not recommended.
Will spinlock exists on uniprocessor system1:29
Explain spinlock behavior on uniprocessor systems, showing how preemption settings turn spinlocks into empty operations, especially when used between interrupts or in process context.
Implementation of spinlock4:58
Explain how spinlocks implement mutual exclusion using a lock bit in a two-state model (locked and unlocked), with busy-wait loops and architecture-specific atomic operations in the linux code.

Introduction0:55
Implementation of semaphore1:44
Learn how a Linux kernel semaphore uses an integer value and two operations, P and V, to control entry into critical sections; P blocks when zero, and V wakes waiters.
Types of semaphores1:05
Can I use counting semaphores in critical section0:56
Explore whether counting semaphores can be used in a critical section and learn that binary semaphores are used in the kernel for mutual exclusion.
Semaphore API2:34
Learn the Linux semaphore API and its kernel implementation, including struct semaphore with a spinlock, usage count, and wait list, plus dynamic and static initialization using down and up.
Linux Kernel Module Example using semaphore API0:48
Allocate memory for a semaphore with malloc and initialize it to one to create a binary semaphore. Decrement to enter the critical section and up to end it.
Linux Kernel Module Example using down and up1:32
Linux kernel module example using semaphore down and up to control access to a critical region, showing initialization, decrementing and incrementing the count, and printed values.
Linux Kernel Module Example calling down twice4:28
Demonstrate a Linux kernel module example of calling down twice on a binary semaphore, managing entry to a critical region and queuing when blocked, and examining uninterruptible task state.
Linux Kernel Module Example of producer consumer3:02
down_interruptible2:55
Explore the distinction between down and down_interruptible in linux kernel synchronization: how interruptible sleep lets a waiting process receive signals and return, unlike uninterruptible sleep.
down_trylock1:05
Explore the semaphore down_trylock API: it acquires when available and returns non-zero if not. Starting with value one, down makes it zero, and a second down blocks.
down_timeout2:14
down_killable2:32
Explore how down_killable restricts signal delivery to fatal signals in Linux kernel programming, while down_interruptible allows any signal to be delivered, as shown with SIGKILL.
Important points while using semaphore1:49
Use semaphores for long-held locks, not short ones, due to queueing and sleeping overhead. They cannot run in interrupt context, do not disable preemption, and offer better utilization than spinlocks.
spinlock vs semaphore`1:29

Introduction2:25
Mutex vs semaphore3:41
Mutex Implementation in Linux Kernel1:00
Mutex API2:00
Linux Kernel Module Example using mutex API dynamic initialization0:30
Demonstrates how to use a mutex to protect a critical section by including the header, allocating, initializing, locking, unlocking, and freeing the mutex.
Linux Kernel Module Example using mutex API static initialization0:20
Illustrate a Linux kernel module example using the mutex API with static initialization. Defining a mutex eliminates extra initialization steps, demonstrating how static initialization secures synchronization.
Linux Kernel Module Example demonstrating calling sleep in critical section1:30
mutex_trylock0:42
What happens when other thread calls unlock mutex5:54
Trying recursive mutex locks1:33
Explore why recursive mutex locks trigger errors in Linux kernel synchronization, inspect debug mutexes, spinlocks, and semaphore behavior, and understand the owner field role in mutex structures.
mutex_is_locked1:15
Explore the mutex is locked api that reports whether a mutex is locked or unlocked and helps avoid recursive mutex before locking. Return 1 when locked, 0 when unlocked.
Which one do you choose between semaphore and mutex1:09
Which one do you choose between spinlock and mutex1:06
Compare spinlocks and mutexes by evaluating logging overhead and lock hold time, and decide based on interrupt context and whether sleeping is required; spinlocks are needed in interrupt contexts.

Problem Statement3:08
Demonstrates a mutex-protected shared counter in a proc file with read and write operations, showing how a single lock blocks concurrent reads.
Solution1:31
Introduction to ReadWrite Spinlock1:21
Introduce the read-write spinlock API, including rw_lock_t structures and read_lock/read_unlock and write_lock/write_unlock, and compare it to spinlock reader and writer variants.
Linux Kernel Module Example using RW Spinlock API4:06
Demonstrate a Linux kernel module using a rw spinlock with two readers and one writer, illustrating read and write locks, contention, and the impact of delays.
What happens when we call read lock and write lock one after another1:28
Acquire a read lock then request a write lock and see that upgrading does not occur here, causing a deadlock as the write lock waits for the reader to unlock.
Recursive read locks1:24
What happens when writer is waiting and reader arrives3:16
Explore how a read lock allows multiple readers to proceed while a writer waits for exclusive access, illustrating first-in, first-out fairness and writer starvation avoidance in the Linux kernel.
Linux Kernel Module Example using RWLOCKS with 3 kernel threads1:42
Illustrates a Linux kernel module example using rwlocks with three kernel threads performing read and write locks, showing how lock contention and access order evolve during sleeps.
Linux Kernel Module Example using RWLOCKS with 4 kernel threads2:14
Examine how four kernel threads contend for read and write access with rwlocks, highlighting non-deterministic ordering due to spin locks and the observed race in lock acquisition.

Introduction to ReadWrite Semaphores1:51
Explore how read-write semaphores in the Linux kernel use binary mutexes for writer-only mutual exclusion, with the rw semaphore structure and an initial zero value.
ReadWrite Semaphore API1:46
Demonstrates how read and write locks are acquired and released using down_read, up_read, down_write, up_write, with upgrade and downgrade, including an uninterruptible sleep when locking.
down_read_trylock and down_write_trylock0:33
Learn how down_read_trylock and down_write_trylock in Linux kernel synchronization work: they return 1 when the lock is acquired, unlike the normal semaphore behavior which returns 0 on success.
downgrade_write6:15
Downgrade_write converts an acquired write lock to a read lock, enabling a quick write followed by longer read access in Linux kernel synchronization.
Recursive write locks0:38
Linux Kernel Module Example using multiple threads1:58

Problem1:08
Solution0:48
Sequence Locks3:06
Enable fast, lock-free access for many readers; sequence locks added in Linux 2.6 permit writers to modify data during reads, while readers verify data validity with a sequence counter.
How Sequence Locks works1:20
Learn how a sequence lock uses a sequence counter and a spinlock to sync readers and writers, with spinlocks for writes and an initial zero value.
Write operation in sequence lock1:09
Explain how a write operation uses a write sequence lock to ensure mutual exclusion with a spin lock, incrementing the sequence number on both lock and unlock, starting from zero.
Read operation in sequence lock2:10
Linux Kernel Module Example using sequence lock1:31
Explore the Linux kernel sequence lock concept, using a sequence counter to guard reads and writes, illustrating retry on data invalidation and the behavior without locks.
Is Kernel Preemption Disabled using sequence lock1:12
Limitation of sequence locks1:09
Sequence Locks in Linux kernel0:25
Explore how sequence locks protect a 64-bit uptime value in the Linux kernel, with examples showing their use across kernel time operations and related functions.
Sequence locks in interrupts0:20
Explore sequence locks and their interrupt-ready variants, and learn how to handle contexts that run in both interrupts and process context by saving and restoring state.

Introduction7:31
Discover read-copy-update (RCU) in the Linux kernel: enable a single writer to update pointer-based data structures without blocking readers, by copying the structure and swapping the pointer after readers finish.
Linkedlist example of how to delete node lock free3:21
Demonstrates lock-free linked-list deletion by bypassing node B (set A's next to C), waiting for readers to move on, and safely freeing B after no readers remain.
RCU Design2:38
Why should i use rcu_assign_pointer1:22
Read operation in RCU2:52
Why should i use rcu_dereference1:31
Linux Kernel Module example 1 of read and write threads1:28
Demonstrate read and write threads using RCU to update a global pointer, with read side critical section and no locks, illustrating RCU’s speed.
Linux Kernel Module example 2 of read and write threads4:54
When should we free memory1:34
Explore rcu memory management: removal and reclamation occur after pre-existing readers finish, with reads inside rcu read side critical sections and pointers updated by rcu assign.
synchronize_rcu3:38
Block the calling process until all pre-existing read-side critical sections on all CPUs complete, then safely reuse or remove the old memory after synchronize_rcu returns.
call_rcu2:12
Use call_rcu to defer work until all read-side critical sections finish. Embed an rcu head, register rcu_free with call_rcu, and use container_of.
Can rcu read side critical sections be nested1:11
Shows that RCU read-side critical sections can be nested when there is no blocking or sleeping, using memory barriers instead of locks.
How does synchronize_rcu works internally2:22
Explore how synchronize_rcu works internally by tracking completion of read side critical sections through read lock, read unlock, and preemption, using context switches as the signal that readers finished.
RCU Terminology2:06
RCU variants for Linked Lists5:01
Learn how to implement a lock-free linked list using RCU variants for list APIs, addressing race conditions between readers and writers with RCU assigned pointers.
Linux Kernel Module example of rcu linked lists1:52
Advantages of RCU0:39
Question and Answer4:59
Explore synchronization in Linux kernel programming by comparing rcu and sequence lock, discussing readers and writers, grace period, copy operations, and memory costs.

Requirements

Should be able to write/understand Hello World Linux Kernel Module
Should be able to write/understand Linux Kernel Modules for /proc filesystem

Description

Update: Sep 15: Added RCU Section

What you will learn in this course

Various concepts related to concurrency like: preemption, context switch, reentrancy, critical section, race condition
Various Synchronization techniques
- Per CPU Variables
- Atomic Variables
- Spinlocks
- Semaphores
- Mutexes
- Read Write Locks
- Sequence Locks
- Read Copy Update(RCU)

API's/Macros/Structures:

spinlock_t, DEFINE_SPINLOCK, spin_lock, spin_unlock, spin_trylock, spin_lock_irqsave, spin_unlock_irqrestore,spin_lock_irq, spin_unlock_irq
atomic_t, atomic64_t, ATOMIC_INIT, atomic_inc, atomic_dec, atomic_set, atomic_read, atomic_add, atomic_sub,
atomic_dec_and_test, atomic_inc_and_test, atomic_sub_and_test, atomic_add_negative,atomic_add_return, atomic_sub_return, atomic_inc_return, atomic_dec_return,atomic_fetch_add, atomic_fetch_sub, atomic_cmpxchg, atomic_xchg,set_bit, clear_bit, change_bit, test_and_set_bit, test_and_clear_bit, test_and_change_bit,
NR_CPUS,num_online_cpus,smp_processor_id,get_cpu,put_cpu,DEFINE_PER_CPU,get_cpu_var, put_cpu_var, per_cpu, for_each_online_cpu, alloc_percpu, free_percpu, per_cpu_ptr
rcu_read_lock, rcu_read_unlock, synchronize_rcu, call_rcu, rcu_assign_pointer, rcu_dereference
seqlock_t, seqcount_t, DEFINE_SEQLOCK, seqlock_init, write_seqlock, write_sequnlock
struct rw_semaphore, DECLARE_RWSEM, init_rwsem, down_read, up_read, down_write, up_write, down_read_trylock, down_write_trylock, downgrade_write
struct rwlock_t, DEFINE_RWLOCK, rwlock_init, read_lock, read_unlock, write_lock, write_unlock
struct mutex, DEFINE_MUTEX, mutex_init, mutex_lock, mutex_unlock, mutex_trylock, mutex_lock_interruptible, mutex_unlock_interruptible, mutex_is_locked
struct semaphore, sema_init, DEFINE_SEMAPHORE, down, up, down_interruptible, down_trylock, down_timeout, down_killable

Commands used in the course

nproc
ps -eaF
ps aux

Who this course is for:

Linux Kernel Developers interested in learning various synchronization techniques

Synchronization in Linux Kernel Programming

What you'll learn

Explore related topics

Course content

Concurrency15 lectures • 39min

Per CPU Variables8 lectures • 17min

Atomic Operators11 lectures • 31min

Spin Locks13 lectures • 34min

Semaphore15 lectures • 29min

Mutex13 lectures • 23min

Read Write Locks9 lectures • 20min

Semaphores RWLocks6 lectures • 13min

Sequential Locks11 lectures • 14min

RCU Locks18 lectures • 51min

Requirements

Description

Who this course is for: