Midterm review Flashcards

Question

Describe the process state: Terminated

Answer 1

The process has completed its work or ends up with an error.

Answer 2

Once a process has become blocked (e.g., by initiating an I/O operation), the OS will keep it as such until some event occurs (e.g., I/O completion); at that point, the process moves to the ready state again (and potentially immediately to running again, if the OS so decides).

Answer 3

**Message-based IPC** Pros: OS managed, messages share APIs and system calls Cons: overhead, all information that we want to pass is copied **Shared-memory-based IPC** Pros: OS is not in the path of communication, no OS overhead Cons: no fixed and well defined APIs, more error prone

Answer 4

As long as the time to context switch T is such that T\_idle is greater than twice the time to context switch it makes sense to context switch to another thread and hide the idling time

Answer 5

If you have significantly more threads ready to run than there are processors, you will usually find that your performance degrades. This is partly because most thread schedulers are quite slow at making general re-scheduling decisions. If there is a processor idle waiting for your thread, the scheduler can probably get it there quite quickly. But if your thread has to be put on a queue, and later swapped into a processor in place of some other thread, it will be more expensive. A second effect is that if you have lots of threads running they are more likely to conflict over mutexes or over the resources managed by your condition variables.

Answer 6

By multithreading the OS’s kernel we allow the OS to support multiple execution contexts, and this is particularly useful when there are in fact multiple CPUs, so that the OS context can execute concurrently on different CPUs in a multiprocessor/multicore platform.

Answer 7

The boss assigns work to the workers. Each worker performs the entire task. The boss and the workers communicate via shared queue.

Answer 8

Throughput of the system is limited by the boss thread. We can use a queue to increase throughput by making the boss add tasks to the queue and the workers retrieve task from the queue which results in lower time per task that the boss needs.

Answer 9

A negative of this approach is that it ignores locality. The boss doesn’t keep track of what each worker is doing. If we have a situation where a worker just completed a similar type of task or identical type of task, it is more likely that the same worker will be more efficient in performing the exact same task in the future. Or it may be that it already has a tool that is required to build that particular type of toy nearby on its desk. But if the boss does not know that the workers are doing, it has no way to make these kind of optimizations.

Answer 10

Each thread does some work and passes the partial result to another thread.

Answer 11

The ideal is that the stages are equal: this will provide maximum throughput, by utilizing all your processors fully. Achieving this ideal requires hand tuning, and re-tuning as the program changes.

Answer 12

* hand tuning is needed for determining the right number of pipeline stages * hardwired limit in the degree of parallelization * overall throughput is limited by slowest stage * more hand-tuning needed

Answer 13

The simplest way that threads interact is through access to shared memory. In a high-level language, this is usually expressed as access to global variables. Since threads are running in parallel, the programmer must explicitly arrange to avoid errors arising when more than one thread is accessing the shared variables. The simplest tool for doing this is a primitive that offers mutual exclusion (sometimes called critical sections), specifying for a particular region of code that only one thread can execute there at any time.

Answer 14

You can view a mutex as a simple kind of resource scheduling mechanism. The resource being scheduled is the shared memory accessed inside the LOCK clause, and the scheduling policy is one thread at a time. But often the programmer needs to express more complicated scheduling policies. This requires use of a mechanism that allows a thread to block until some event happens. This is achieved with a condition variable. A condition variable is always associated with a particular mutex, and with the data protected by that mutex.

Answer 15

If you keep your use of condition variables very simple, you might introduce the possibility of awakening threads that cannot make useful progress. This can happen if you use “Broadcast” when “Signal” would be sufficient, or if you have threads waiting on a single condition variable for multiple different reasons.

Answer 16

if() only checks once, while() keeps checking until it's satisfied.

Answer 17

Probably the most practical prevention technique (and certainly one that is frequently employed) is to write your locking code such that you never induce a circular wait. The most straightforward way to do that is to provide a total ordering on lock acquisition. For example, if there are only two locks in the system (L1 and L2), you can prevent deadlock by always acquiring L1 before L2.

Answer 18

To have a multithreaded OS kernel, it must maintain: * thread abstraction (data structure to rep. threads) * scheduling, sync To support threads at the user level, it must have: * a user-level library that is linked with the application * The library supports a data structures, scheduling, synchronization and other mechs that's needed to make resource mgmt decisions for the threads User level threads can be mapped to underlying kerner-level threads: * 1:1 * M:1 * M:M To the user level library, kernel level threads look a lot like virtual CPUs The user threading library keeps track of all the threads that represent a single process. There is a relationship between the threads and a PCB that represents that address space. For each process, we need to keep track of what kernel level threads that execute on behalf of the process, and for each kernel level thread, we need to know what address space within which that thread executes. If the system has multiple CPUs, we need a data structure to represent the CPU and maintain relationships with KLTs.

Answer 19

PCB: * Virtual Address mapping User Level library data structure for user level thread (ULT): * UL thread ID * UL registers * thread stack Kernel level thread data structure (KLT): * stack * register pointer

Answer 20

* events generated externally by components other than the CPU (I/O devices, timers, other CPUs) * determined based on the physical platform * appear asyncronously * compare to "snowstorm warning"

Answer 21

* events triggered by the CPU & software running on it * determined based on the operating system * appear syncronously or asynchronously * compared to a low battery warning

Answer 22

Hard process state: * information relevant for all of the user level threads that execute within the process * ie virtual address mapping Light process state: * information relevant for a subset of user-level threads that are currently associated with a particular kernel level thread

Answer 23

* On thread creation, the library returns a thread id, which is not a direct pointer to the actual thread data structure * it's an index in a table of pointers * table pointers point to per thread data structure * The thread data structure contains a number of fields, execution context, registers, signal mask priority, stack pointer, thread local storage, stack * The size of the data structure is known up front at compile time so we can create these thread data structures, and layer them in a contiguous way, which can help us achieve locality, make it easier for the scheduler to find the next thread (it just has to multiply the thread integers with the size of the data structure)

Answer 24

If there were a problem with the thread, if the thread ID were a pointer, then the pointer would point to some corrupt memory. By having a thread ID index into a table entry, we can encode some info into the table entry that can provide meaningful feedback or an error message

Answer 25

This includes the variables that are defined in the thread functions that are known at compile time, so the compiler can allocate private storage on a per-thread basis for each of them

Answer 26

* stack growth can be dangerous * the thread library doesn't really control the stack growth, and the OS itself doesn't know that there are multiple user level threads. * it's possible that as the stack is growing, that one thread will end up overwriting the data structure of another thread. If that happens the error will only be detected when the offended thread is run, and tracking down the offending thread is difficult.

Answer 27

Create a red zone A red zone separates information about different threads. It is a portion of the virtual address space that is not allocated. If a thread is running and its stack is increasing, if it tries to write to an address that falls into the red zone region, the OS will cause a fault. This makes it much easier to debug what happened because the fault because it is directly caused by the thread that was executing.

Answer 28

* Process * Lightweight Process (LWP) * Kernel Level Threads * CPU

Answer 29

* list of kernel-level threads * virtual address space * user credentials * signal handlers

Answer 30

* user level registers * system call args * resource usage info * signal mask * similar to ULT, but visible to kernel * not needed when the process is not running

Answer 31

* kernel-level registers * stack pointer * scheduling info * pointers to associated LWP, processes, CPU structures * information needed even when process is not running =\> NOT SWAPPABLE

Answer 32

* current thread * list of kernel-level threads * dispatching & interrupt handling information

Answer 33

1:1 Threads + OS can see / understand sync, sked, etc - User must use kernel for all ops, expensive, not portable, policies may be limited M:1 Threads + Portable, does not depend on OS policies - If kernel thread is blocked, entire process is blocked. No OS insight into process MxN Threads + Can be the best of both worlds - process can have one or multiple kernel threads - unbound or bound - Requires coord btwn kernel and user

Answer 34

They introduced system calls and special signals to allow kernal and ULT library to interact and coordinate

Answer 35

pthread\_setconcurrency 0

Answer 36

* user level threads * available kernel level threads

Answer 37

* kernel level threads * CPUs * kernel level scheduler

Answer 38

A bound thread is when a user level library requests that one of its user level threads be bound to a kernel-level thread. if a kernel level thread is to be permanently associated with a CPU in a multi CPU system, that thread is "pinned".

Answer 39

Process jumps to UL library scheduler when: * ULTs explicitly yield * timer set by UL library expires * ULTs call library functions like lock/unlock * block threads become runnable it also runs on ULT operations and signals from timer or directly from the kernel

Answer 40

Send signal to other thread running on other CPU to run library code locally, which will indicate that the low priority thread will need to be stopped and the high priority thread will start running instead

Answer 41

If the critical section is short, it would be better to spin for a few cycles on the CPU and wait for T1 to release the mutex. If it takes less time for T1 to release the mutex, we're better off spinning than taking the thread and context switching it. Fo long critical section, use the default blocking behavior

Answer 42

* "n signals pending == 1 signal pending" : at least once * must be explicitly re-enabled

Answer 43

* "if n signals raised, then handler is called n times" *

Midterm review Flashcards

(74 cards)