Security Flashcards

Question

What does the overall security of the software rely on?

Answer 1

The security of the Secure monitor code, along with the secure boot code. If there is a bug there, we can gain access to everything, because the NS bit does not mean anything here.

Answer 2

Secure boot Accessing hardware features - user applications must go through the secure world to access these - crypto engine - credential storage(key store) - true random number generator) Digital rights management Protecting/monitoring the normal world - real time kernel protection - periodic kernel measurement (go in and check if any modifications has been done to the kernel code)

Answer 3

The separation between non-secure and secure state Non secure: Rich OS (linux) Secure: Secure app/libs, -OS and -monitor

Answer 4

Separates the secure and non-secure world

Answer 5

Request from CPU -> SAU The SAU either accesses the non-secure MPU or secure MPU (memory protection unit). Which MPU is accessed depends on the NS bit

Answer 6

Memory fault

Answer 7

The NON-secure has some regions (memory addresses) that cannot be accessed, whereas these would be accessible in the secure state.

Answer 8

Code executed from a non-secure region can only access non-secure regions, whereas code executed from a secure region can access memory in both regions.

Answer 9

Can use function calls to access resources/devices in the non-secure state, e.g. I/O driver

Answer 10

Because this too must be done in a secure manner.

Answer 11

By calling a non-secure function (BLXNS): The secure world is aware of what functions exist in the non-secure world. Can simply just access these. Returning from an entry function (BXNS): The non-secure world has called into the secure world, and finished what it needed to do there, it can return from this call. In this case too, the secure state need to know where to return to.

Answer 12

A branch to a secure gateway (BL to SG): Not allowed to call/jump into whatever region in the secure region we want. To limit this, a specific instruction is used. This instruction jumps into known places in the secure world. Branch to the reserved value FNC_RETURN (BX to FNC): The secure state has called a function in the non-secure state, when this function completes it wants to return to the secure state. The non-secure state executes a function return. The function return does not have a specific address, but this address is set when the insecure function is called. Store the return address in the secure world and this can't be modified by the insecure state. This allows the secure world to have the full control of where it again will be entered, when the non-secure function returns.

Answer 13

A occurrence of the secure gateway instruction (SG) in a special type of secure region, named non-secure callable region (NSC)

Answer 14

Secure gateway instruction

Answer 15

A special type of secure region, named a non-secure callable region. In the secure boot when address spaces are set up, one of these are the NSC. In this region, the only thing we have is the secure gateway (SG) instructions, and the address into the secure world.

Answer 16

Causes the HW to switch to secure state., read an address from the top of the secure stack, and branch to that address

Answer 17

Import the "Secure gateway import library", this contains the addresses of the secure gateways. - Defines symbols for all the secure gateways A toolchain must support generating a secure gateway veneer for each entry function

Answer 18

It contains a vector of secure gateway veneers. The veneers contain the secure gateway instruction that can call specific entry functions in the secure regions. Each gateway has a specific address to call into specific entry functions.

Answer 19

Entry functions Secure code (internal functions) Secure data (Stack, heap, global data)

Answer 20

Memory Protection Unit A simpler version of the MMU (Memory management unit). MPU handles more corse grained partitioning, and MMU enables virtual memory and page tables. MPUs are normal in micro controllers where a complex OS and memory management are not needed, though the properties of the MPU and MMU are similar.

Answer 21

Programmable unit Allows privileged software to define memory access permissions for different memory regions. Monitors transactions, including fetches and data accesses from the processor. Checks if an access is allowed or not.

Answer 22

Supports a configurable number of programmable regions. Typical implementation supports 0-8 regions per security state. Smallest size programmed for an MPU region is 32 bytes. Maximum is 4GiB, must be a multiple of 32 bytes. All regions must begin on a 32-byte aligned address. Regions have independent rd/wr access permissions for privileged and unprivileged code.

Answer 23

To have one set of MPU configuration registers for the secure world and another set for the non-secure world.

Answer 24

Normal memory Device memory

Answer 25

Instructions and data

Answer 26

Used to access peripheral registers and Memory Mapped I/O (MMIO)

Answer 27

Cacheability Shareability (for data and instructions) eXecute Never (XN)

Answer 28

Shareability (for data and instructions): - Non-shareable: no one else uses it, memory accesses don't need to be synchronised with other processors. Relates to interconnects, and having clusters of CPUs - Inner shareable: Data/instructions are shared within the cluster. The shareability domain can contain multiple masters, but not necessarily all the agents in a system. - Outer shareable: Share between multiple clusters. This knowledge helps with optimisation done for cache coherency, for example. If data is non-shareable, we don't need to worry about coherency, and if it is inner we only need to have coherency within a cluster. An operation that affects an outer shareable domain also implicitly affects all inner shareable domains inside it.

Answer 29

Cacheability: - Cache policy (write-through, write-back) - Allocation (if miss, should allocate a new line?) - Transient Hint (Hint regarding temporal locality)

Answer 30

Separates between instructions and data. Marks data non-executable. Make sure an attacker cannot write code into data memory, which are actual instructions. And then make the CPU jump to this data which would make it execute it.

Answer 31

Must be used for memory regions that cover peripheral control registers. For example, if control registers was cached, we could never detect changes in the registers themselves. If we wanted to write to it, we would not write to the device itself, but the cache. This shows that some of the optimisations that are permitted for normal memory would not be safe for peripheral registers.

Answer 32

G or nG (Gathering or non-Gathering): Multiple accesses to a device can be merged into a single transaction. This is likely to be done if it is part of the data transfer, but not if it is part of the control. R or nG (Reordering or Non-reordering) E or nE (Early write Acknowledge - similar to bufferable): Can you tell a CPU immediately that a request has been done, and that it can continue. Or do you need to go all the way to a device to make sure the request has taken affect, and that the device is in the correct state before we continue.

Answer 33

Spectre & Meltdown Spectre attacks exploit core features of modern architectures. They are special because there weren't any immediate ways of mitigating these, as they exploit the design itself.

Answer 34

Speculation Speculation is heavily relied on in modern architectures to hide latency Types: - Control flow speculation - Memory disambiguation speculation - Exception speculation

Answer 35

branch resolution (taken/non-taken) BTB targets - where to branch to

Answer 36

This is when we assume that stores and loads do not alias. Have a younger load, issue it by assuming it is not dependent on an earlier store that has an unresolved address. If the store turns out to have a matching address with the load, the load will have gotten the wrong stale data from the cache and has propagated it through the system. This must then be squashed.

Answer 37

This is when we assume that instruction will not normally cause and exception. E.g. floating-point operations will cause an exception when dividing by 0. When they do, we need to roll-back and handle the exception before continuing.

Answer 38

Speculation can set the processor in an illegal state, in which secrets can leaks, or normally inaccessible data can be retrieved. This state is always squashed before it becomes architecturally visible. This means that the execution will always be EVENTUALLY correct, but there can be mistakes during execution. The exploitation needs to find a way to access this illegal data before it gets squashed. This can be done using "transmitters"

Answer 39

Transmitters are certain operations within the processor, that affects the micro-architectural state. They are called transmitters because they in some ways transmit information about the data that is used for them. The problem is, that certain aspects of micro-architectures are observable for the architectural states. It is possible to explicitly address certain operations in such a way as to retrieve information about the micro-architectural state. If we use a transmitter operation, with illegal data, we can get that illegal data back afterwards, after it has been squash. Resulting in the illegal data now being visible in the architectural state. Through select execution dependent on data, we can change the micro-architectural state to hold data (best way of doing this is by modifying caches). This data can be brought back into the architectural state afterwards.

Answer 40

The state that is part of the system, but not explicitly defined by the ISA. Not well-defined A matter of implementation There is no security guarantees for these states.

Answer 41

State explicitly defined by the ISA

Answer 42

- a load only defines that data must be retrieved from memory according to certain semantic, it does not say anything about the timing. - depending on the address of the load, we can have different latencies (L1, L2, L3, memory) - the timing is a part of the micro-architectural state - the implementation of caches, what they store and don't, how they prefetch, are all micro-architectural state.

Answer 43

Assumptions: - Have a sequence of data, where each data occupy a different cache line - All our data lives in an inclusive L2 cache (meaning all lines in L1 is also in L2). Firstly we flush all data elements in L1, leaving an empty L1 cache. Then, we access a single data element. We know that the secret is a value between 1 and 20. The base address, is the first array element that is stored in the L2 cache. So, we fetch the element at *(base + secret). If secret=3 we access the 4th element of the data-array in the L2 cache, and bring it to the L1 cache. The access time of L1 is 5 cycles and 15 for L2. This is an observable difference when using timers in the system. What we do, is time all possible cache lines that the secret can be in. Every cache line will take 15 cycles, except from the one containing the secret, as we have brought this up to the L1 cache. Since we know that the access pattern is (base + secret), we can, by looking at the access-time-graph, deduce that the secret=3.

Answer 44

0: Mistrain a branch predictor, and clear a selection of cache (make sure all data elements is in the same level of cache) 1: Acquire illegal data (secrets) through bypassing bounds checking after mistraining (accessing data elements outside the bounds of our array) 2: Transmit the data to the architectural state (in to the cache), by using the secret as an address 3: Wait for squash, and then time cache lines to retrieve secret

Answer 45

In contrast to v1, instead of you being the only process running, the spectre v2 has a victim process. Step 0: Locate a gadget, which is a series of operations associated with a function, that inadvertently transmit some data Step 1: The attacker then figures out some place where there is a branch in the victims code, then takes that branch to figure out what its index into the Branch Target Buffer is. Then, we retrain the BTB to index into the gadget instead of the actual target. Step 2: Now we let the victim program execute. This will now speculate using the BTB and jump to gadget where it will transmit secret. Step 3: As the attacker will begin by flushing out the cache, they can now time the cache line to retrieve the secret. In this case, we are looking for the access with the longest access time. The reason for this is that we used to have all the cache lines in a shared cache. The victim process then hoisted the cache line into their core, making it take a longer time to access.

Answer 46

More an exploitation of an implementation error, than exploiting speculation within the core. Step 0: Set up a long chain of instructions, and clear possible cache targets Step 1: Access kernel data. This will trigger an exception, but the exception handling is delayed (need to reach the head of the ROB). You are still given the data, though. A user process should not be able to access the data that the kernel uses, but in some systems this data is mapped into the user space so that when system calls are made, you don't have to map the data in externally. This gives a better response time from the kernel. Step 2: The kernel data is then transmitted using the secret as an address. Index into it based on some value based out of possible cache targets. Step 3: Time cache lines to find the fastes line, and retrieve secret.

Answer 47

If you can detect that we have an exception, you should give back a dud value instead of the real data. Doing this is a suggested way of mitigating this problem.

Answer 48

1: Making execution invisible 2: Delaying dangerous operations 3: Cleaning up micro-architecture

Answer 49

Don't show what you are doing For example, in regards to caches, cache lines don't move from L2 to L1 and seem to be invisible (not modified at all), until speculation is done. Caches are the most prevalent source of transmitting, but not the only one. The idea is to "hide" changes to these various side-channels until they are non-speculative. Con: - requires extra storage - the work must be placed somewhere - Need to make changes to coherence and consistency models to account for this - Requires you to think of every possible visible angle and their complications, changes done to achieve this can for example move the problem such that secrets are transmitted through different components in another part of the micro architecture.

Answer 50

Define a category of dangerous operations, and delay these while we do useful work elsewhere. Either restrict the propagation of secrets (NDA), or delay transmitters that depend on secrets (DoM, STT) NDA: Does not let potential secrets propagate through the system, won't give anyone the data until we know it's safe to do so. This is generally safe, but loses out on some potential performance The next two both prevent execution of certain operations, that they believe will create an observable difference. Delaying transmitters requires knowing all transmitters ahead of time, this can be really hard as new attacks are discovered. Delay-on-Miss: Does this on speculation on operations it knows will modify the L1/L2 cache state. Tracks general speculation. Speculative Taint Tracking: Does this on data it knows comes from a potential illegal access. Need to track dangerous data, and by this also track general speculation.

Answer 51

CleanupSpec Need to track everything that was done during speculation, and be able to revert back to previous state. This can be very expensive, as we need to keep track of every attack vector. After mispredict, restore state to how it was before speculation started. Some of this can be handled by only updating state on commit (same as invisible), but some of it must be tracked. Requires knowing all possible states that can be altered during speculation and used as transmitter, for the cleanup system to be efficient.

Security Flashcards

(76 cards)