Dependability Flashcards
(17 cards)
Dependability 4 Means
- Fault Prevention
- Fault Tolerance
- Fault Removal
- Fault Forecasting
Dependability Attributes
Availability
Reliability
Safety
Confidentiality
Integrity
Maintainability
Dependability Impairments
Faults
Errors
Failures
Fault Types
transient fault
Permanent faults : remain int he system until they are repaired. broken wire or a software design error
Intermittent faults: are transient faults that occur from time to time. hardware component that is heat sensitive, it works for a time, stops working, cools down and then starts to work again, corrosion on connectors
What are Errors
- wrong state in system+ / subsystem (wrong data element in memory register)
- error activated, when computation accesses the faulty data
- two types of software errors 1. Bohrbugs - can be reproduced by providing a specific input sequence, value domain
- Heisenbugs - can only be observed when input data and timing in relation to all other timing activities can be exactly reproduced
Failure Modes
Constaint failure
Value failure
Fail silent
Fail sotp
Fail controlled
2 Stages for fault Prevention
Fault avoidance and fault removal
Fault tolerance Levels
- Full Fault Tolerance - the system continues to operate in the presence of faults, albeit for a limited period, with no significant loss of functionality or performance
- Graceful Degradation - the system continues to operate in the presence of errors, accepting a partial degradation of functionality or performance during recovery or repair
- Fail safe - the system maintains its integrity while accepting a temporary halt in its operation
How to detect fault (fault tolerance)
- requires fault detection and prior specification of intended behavior
- temporal failures should be decected by communication system
- logical failures have to be detected by the receiver of a message
Never give up NGU strategy
stategy to deal with rare faults outside the fault hypothesis
Differences between Fault Prevention and Fault Tolerance
Nessessary conditions for the mutual exclusion
- Only one process is allowed to enter the critical section
- No assumption can be made about the speed and the amount of processors
- No process is allowed to block another process outside the critical section
- every process has only to wait a finite time to enter the critical section
What are the 3 scheduling problems?
Race conditions
Starvation
Priority inversion
4 Resource access protocols
- Non-Preemptive Protocol
- Highest Locker Priority
- Priority Inheritance Protocol
- Priority Ceiling Protocol
[Priority inheritance protocol] direct blocking
occurs when higher priority processes tries to acquire a resource held by a lower priority process
[Priority inheritance protocol] push-through blocking
occurs when medium priority process is blocked by a lower priority process which inherited a higher priority from a process it directly blocks
In this context, briefly explain why a fault does not always lead to an error. Give an example.
(ii) Briefly describe the difference between ’fault prevention’ and ’fault tolerance’.
A fault wrong state in system / subsystem (wrong data element in memory, register. Error is activated, when computation accesses the faulty data (this can be quite a bit later). If the faulty memory location is not accessed, this may not lead to an error.
(ii)
Fault prevention attempts to eliminate any possibility of faults creeping into a system before it goes operational.
Fault tolerance enables a system to continue functioning even in the presence of faults