Midterm Flashcards

Question

Name some TLB replacement policies.

Answer 1

LRU (Least-Recently Used), FIFO (First-In, First-Out), and Random.

Answer 2

For certain workloads, especially with strided access patterns that exceed the TLB size, LRU can lead to poor performance, and sometimes random is better than a 'smart' policy.

Answer 3

If a process uses cached TLB entries from another process, it can lead to incorrect memory access. Solutions include: 1. Flush TLB on each switch: Costly as all recently cached translations are lost. 2. Track which entries are for which process: Use Address Space Identifiers (ASID) to tag each TLB entry.

Answer 4

Both. • Hardware-managed TLB: CPU knows where page tables are (e.g., CR3 register on x86). Page table structure is fixed. Hardware 'walks' the page table and fills the TLB. • Software-managed TLB: CPU traps into OS upon TLB miss. OS interprets page tables as it chooses. Modifying TLB entries is privileged.

Answer 5

It indicates whether the entry has a valid translation or not.

Answer 6

They determine how a page can be accessed (e.g., read, write, execute).

Answer 7

It tracks which process the TLB entry belongs to, allowing the TLB to hold translations from multiple processes.

Answer 8

It is marked when the page has been written to.

Answer 9

• No external fragmentation. • Fast to allocate and free memory. • Simple to swap-out portions of memory to disk.

Answer 10

A simple array-based page table where the virtual page number (VPN) is used as an index to find the corresponding page table entry (PTE).

Answer 11

Wasted memory within a page because the page size may be larger than the size needed by a process. It grows with larger pages.

Answer 12

With larger pages, fewer pages are needed to cover the same virtual address space, thus reducing the number of page table entries. The major problem is increased internal fragmentation.

Answer 13

An approach to reduce page table overhead by dividing the address space into segments (code, heap, stack), with each segment having its own page table. The base register points to the page table of the segment.

Answer 14

• Supports sparse address spaces, decreasing the size of page tables. • No external fragmentation. • Segments can grow without reshuffling. • Can run process when some pages are swapped to disk. • Increases flexibility of sharing at the page or segment level.

Answer 15

A hierarchical page table structure that pages the page tables themselves. The goal is to allow each page table to be allocated non-contiguously and to only allocate page table space for pages in use, supporting sparse address spaces and reducing overall memory consumption for page tables. It uses an outer-level page directory.

Answer 16

• Page directory can fit into a single page. • Page tables do not need to be allocated linearly. • Overall size of page tables is smaller. • Supports sparse address spaces.

Answer 17

• Requires more memory accesses to traverse multiple levels of page tables if there is a TLB miss. • Increases address translation time, potentially leading to slower program execution on a TLB miss. • Increased complexity compared to single-level paging.

Answer 18

A page table structure where there is one entry per physical page in the system, storing information about which process and virtual page is mapped to that physical page. Requires searching to find the correct entry, often using a hash table.

Answer 19

• To fully utilize multiple CPU cores available in modern systems for parallelism. • To avoid blocking program progress due to slow I/O by allowing other tasks to run while one thread is waiting.

Answer 20

Threads are like processes but share the same address space, allowing them to communicate easily through shared memory. This enables dividing a large task into smaller, cooperative subtasks that can run concurrently.

Answer 21

• Process ID (PID). • Address space (code, heap, most data). They share page directories and virtual memory. • Open file descriptors. • Current working directory. • User and group ID.

Answer 22

• Thread ID (TID). • Set of registers.

Answer 23

• Thread ID (TID). • Set of registers, including Program Counter (IP/PC) and Stack Pointer (SP). • Stack for local variables and return addresses.

Answer 24

User-level threads are implemented by user-level runtime libraries, and the OS is not aware of them (one-to-many mapping). • Advantages: Does not require OS support; portable. Can tune scheduling policy. Lower overhead thread operations (no system calls). • Disadvantages: Cannot leverage multiprocessors. Entire process blocks if one thread blocks.

Answer 25

Kernel-level threads are managed by the OS (one-to-one mapping). The OS provides each user-level thread with a kernel thread. • Advantages: Each thread can run in parallel on a multiprocessor. When one thread blocks, others can run. • Disadvantages: Higher overhead for thread operations. OS must scale well with many threads.

Answer 26

To create a new thread. It takes a pointer to a pthread_t, thread attributes, the function pointer to be executed, and the argument for the function.

Answer 27

To wait for a specified thread to complete its execution.

Answer 28

Concurrency can lead to non-deterministic results where the output varies even with the same inputs. This is due to race conditions and depends on the CPU schedule, which can interleave thread execution in different ways.

Answer 29

A critical section is a piece of code that accesses shared resources and must not be executed concurrently by multiple threads to avoid race conditions. We want mutual exclusion for critical sections, ensuring that only one thread is in the critical section at any time. We want critical sections to be atomic (execute as an uninterruptible group).

Answer 30

A synchronization primitive that ensures that any critical section executes as if it were a single atomic instruction. Basic operations include: • Allocate and Initialize: Create and set up the lock. • Acquire (lock): Obtain exclusive access to the lock, waiting if it's not available (mutual exclusion). • Release (unlock): Release exclusive access, allowing another thread to enter the critical section.

Answer 31

The POSIX name for a lock, used to provide mutual exclusion between threads.

Answer 32

• Correctness: Mutual exclusion (only one thread in critical section). Progress (deadlock-free). Bounded waiting (starvation-free). • Fairness: Each thread waits for roughly the same amount of time. • Performance: CPU is not used unnecessarily (e.g., minimal spinning). Low overhead when no contention.

Answer 33

• Only works on uniprocessors. • Requires privileged operations, trusting applications not to abuse it (e.g., monopolize CPU). • Turning off interrupts for too long can lead to lost interrupts and system problems.

Answer 34

The test of the flag and the setting of the flag are not atomic. An interrupt can occur between these two operations, allowing multiple threads to acquire the 'lock' simultaneously.

Answer 35

TestAndSet atomically returns the old value at a memory location and sets it to a new value. A spin lock can be built by repeatedly calling TestAndSet to set the lock flag to 1 until it returns 0 (meaning the lock was free). unlock simply resets the flag to 0.

Answer 36

A type of lock where a thread repeatedly checks (spins) if the lock is available until it can acquire it. • Advantages: Can be fast if locks are held for short periods and there are many CPUs (avoids context switch). • Disadvantages: Wastes CPU cycles while spinning, especially on a uniprocessor or when locks are held for a long time. Not fair, can lead to starvation.

Answer 37

CompareAndSwap atomically checks if the value at a memory location is equal to an expected value, and if so, replaces it with a new value. It returns the original value. A spin lock can be built by repeatedly calling CompareAndSwap to set the lock flag to 1 only if it is currently 0.

Answer 38

LL loads a value from memory. SC only stores a new value to that memory location if no other store has occurred since the last LL. SC returns 1 on success and 0 on failure. A lock can be built in a loop: LL reads the lock status; SC tries to set it to locked; if SC fails, the loop repeats.

Answer 39

FetchAndAdd atomically increments a value at a memory location and returns the old value. In a ticket lock, each thread gets a unique ticket number using FetchAndAdd. The lock grants access to threads in the order of their ticket numbers. Threads spin until their ticket number matches the current turn number. unlock increments the turn number. Ticket locks ensure progress and can be fairer than basic spinlocks.

Answer 40

Priority inversion occurs when a high-priority thread is blocked waiting for a lower-priority thread to release a resource (e.g., a lock). Spin locks can worsen this because the high-priority thread will spin (consume CPU) while waiting, preventing the lower-priority thread from running and releasing the lock.

Answer 41

Instead of spinning, a thread that cannot acquire a lock can put itself to sleep (block), allowing other threads (including the one holding the lock) to run. When the lock is released, the sleeping thread is woken up. OS support is needed for park() (to put a thread to sleep) and unpark() (to wake up a specific thread).

Answer 42

A lock implementation that combines spinning and blocking. In the first phase, the lock spins for a while, hoping to acquire the lock quickly. If the lock is not acquired, it enters a second phase where the caller blocks (goes to sleep) until the lock is free. The Linux futex lock has elements of this approach.

Answer 43

A condition variable is an explicit queue that threads can put themselves on when some condition is not as desired. Operations include: • wait(cond_t *cv, mutex_t *lock): Atomically releases the lock and puts the caller to sleep on the condition variable. When woken up, it re-acquires the lock before returning. Assumes the lock is held when called. • signal(cond_t *cv): Wakes up a single thread waiting on the condition variable (if any). • broadcast(cond_t *cv): Wakes up all threads waiting on the condition variable (if any).

Answer 44

Condition variables are always used in conjunction with a mutex lock. The mutex protects the shared state (the condition) that threads are waiting for. The wait() operation atomically releases the mutex while the thread goes to sleep and re-acquires it upon waking. This prevents race conditions when checking the condition and going to sleep.

Answer 45

A thread might check a condition and decide to wait, but before it actually calls wait() to go to sleep, another thread changes the condition and calls signal(). The first thread then goes to sleep and might wait indefinitely because it missed the signal. Using a mutex to protect the condition check and the wait() call prevents this.

Answer 46

Keep state in addition to CV’s!. CVs are used to signal threads when state changes. If the state is already as needed when a thread checks, it doesn’t need to wait for a signal.

Answer 47

Modify state with mutex held (in threads calling wait and signal). The mutex is required to ensure the state does not change between the testing of the state and waiting on the CV.

Answer 48

This is because of Mesa semantics, where a signaled thread is only guaranteed to be woken up, but the condition might have changed by the time it runs again (due to other threads running in between). Also handles spurious wakeups (where a thread might wake up without a signal). Re-checking the condition in a while loop ensures that the thread only proceeds if the condition is still true after waking up.

Answer 49

Using two condition variables allows for more directed signaling. Producers wait on the empty condition when the buffer is full and signal the fill condition when they add data. Consumers wait on the fill condition when the buffer is empty and signal the empty condition when they remove data. This prevents a consumer from accidentally waking up another consumer (when a producer should be woken) or vice versa, avoiding potential deadlocks or incorrect states.

Answer 50

A covering condition is a condition that covers all cases where a thread needs to wake up, even if it means waking up more threads than necessary. Using pthread_cond_broadcast() (wake all waiting threads) might be appropriate when the signaling thread doesn't know which specific waiting thread(s) should be woken up to make progress (e.g., in a memory allocator where different waiting threads might need different amounts of memory). The downside is potential negative performance impact as many threads might wake up needlessly, re-check the condition, and immediately go back to sleep.

Answer 51

Swap space is a reserved area on the disk used by the operating system to move (swap out) inactive pages from physical memory to free up RAM and create the illusion of a larger virtual memory than physically available. Pages can also be brought back (swapped in) from the swap space into physical memory when needed. The OS needs to remember the disk address of a given page in swap.

Answer 52

The present bit is a flag in each page table entry (PTE) that indicates whether the corresponding page is currently residing in physical memory. If the bit is set to 1, the page is in memory; if it is 0, the page is not in memory and likely resides in swap space on disk.

Answer 53

A page fault is an exception raised by the hardware when a process tries to access a virtual memory page that is valid (mapped in the page table) but not currently present in physical memory (present bit in PTE is 0). The act of accessing a page that is not in physical memory.

Answer 54

1. The hardware detects the page fault and transfers control to the OS (page-fault handler). 2. The OS determines the location of the missing page on disk (e.g., from the PTE). 3. The OS initiates a disk read to bring the page into a free physical memory frame. If no free frame is available, a page replacement policy is used to evict a page. 4. Once the disk I/O completes, the OS updates the page table entry (sets the present bit to 1, records the PFN). 5. The OS may also update the TLB. 6. The OS retries the instruction that caused the page fault.

Answer 55

A page replacement policy is the algorithm used by the OS to decide which page in physical memory to evict (replace) when a new page needs to be brought in and memory is full or below a certain threshold. It is needed to make space for incoming pages from disk and aims to minimize the number of future page faults.

Answer 56

A background OS thread responsible for freeing up memory. It typically runs when the amount of free physical memory falls below a low watermark (LW) and evicts pages until the free memory reaches a high watermark (HW). This proactive eviction helps maintain a pool of free memory.

Answer 57

To increase the efficiency of disk I/O. Writing multiple contiguous pages at once reduces disk seek and rotational overheads compared to writing them individually.

Answer 58

A cache in the CPU that is indexed (addressed) using virtual addresses instead of physical addresses. It tries to solve the performance bottleneck where address translation (TLB lookup) has to happen before a physically-indexed cache can be accessed. However, it introduces new issues related to cache coherence and aliasing (different virtual addresses mapping to the same physical address) that need to be handled by the hardware and/or OS.

Answer 59

TLB coverage is the total amount of virtual memory that can be simultaneously translated by the entries in the TLB (Number of TLB entries * Page Size). If a program accesses a number of pages exceeding the TLB coverage within a short period, it will experience a high number of TLB misses, leading to significant performance degradation as the page table needs to be.

Answer 60

Aliasing refers to different virtual addresses mapping to the same physical address.

Answer 61

TLB coverage is the total amount of virtual memory that can be simultaneously translated by the entries in the TLB (Number of TLB entries * Page Size).

Answer 62

If a program accesses a number of pages exceeding the TLB coverage within a short period, it will experience a high number of TLB misses, leading to significant performance degradation as the page table needs to be consulted for each new translation.

Midterm Flashcards

(86 cards)