Memory and Cache Flashcards

Question 1

Q

What is cache?

Answer

A

A small amount of fast memory which holds data fetched from and written to main memory

Question 2

Q

Why is the memory hierarchy useful?

Answer

A

It provides a balance between capacity, cost, and access time.

Question 3

Q

What is spatial locality?

Answer

A

Data nearby in memory is more likely to be used

Question 4

Q

What is temporal locality?

Answer

A

Data will be re-used within a relatively short amount of time

Question 5

Q

How is spatial locality maintained?

Answer

A

Data is transferred from main memory to cache in fixed-size blocks called cache lines. When adjacent memory locations are addressed, they are likely already in cache.

Question 6

Q

What is a cache hit?

Answer

A

What the data being read is in cache and doesn’t have to be loaded from main memory.

Question 7

Q

What is a cache miss?

Answer

A

When the next piece of data is not in cache and needs to be loaded from memory.

Question 8

Q

Why is cache useful for speeding up finite difference calculations?

Answer

A

Finite differences in the j direction access memory sequentially resulting in spatial locality

Question 9

Q

What is cache blocking?

Answer

A

Splitting the work into cache-sized blocks and working on one block at a time before moving onto the next. This allows all data that is required to be held in cache speeding up access.

Question 10

Q

How does test case size affect testing?

Answer

A

Usually, working with smaller test cases allows running in a manageable time. However reducing the problem size can sometimes change performance characteristics, such as when a significant data structure can or can’t fit in cache.

Question 11

Q

What is compiler optimisaiton?

Answer

A

The process of the compiler making adjustments to the code to produce equivalent results faster. This includes techniques such as loop interchange and cache blocking.

Question 12

Q

How are compiler optimisations turned on?

Answer

A

Using the -O3 command line flag for gcc

Question 13

Q

What is arithmetic intensity?

Answer

A

The ratio of floating point operations to data movement.

Question 14

Q

What does the roofline model tell us?

Answer

A

The roofline model tells us about floating point performance based on peak performance, memory bandwidth and arithmetic intensity.

Question 15

Q

What are the two types of performance bound?

Answer

A

Memory Bound and Compute Bound

Question 16

Q

What is a memory bound algorithm?

Answer

Study These Flashcards

A

An algorithm with lower arithmetic intensity that is limited by memory bandwidth.

Question 17

Q

What is a compute bound algorithm?

Answer

Study These Flashcards

A

An algorithm with higher arithmetic intensity that can make more efficient use of floating point hardware and is limited by floating point performance.

Question 18

Q

What is NUMA?

Answer

Study These Flashcards

A

NUMA (non-uniform memory access) is the phenomenon that memory at various points in the address space of a processor has different performance characteristics.

Question 19

Q

How does NUMA affect compute nodes?

Answer

Study These Flashcards

A

Because compute nodes are commonly made up of two sockets, accessing memory on the other controller is slower than accessing memory on the local controller.

Question 20

Q

What is first touch memory allocation?

Answer

Study These Flashcards

A

Physical memory is allocated the first time it is accessed (touched) rather than when it is allocated. Physical memory is allocated on the first memory controller to touch a given memory page.

Question 21

Q

What affect does NUMA have on OpenMP?

Answer

Study These Flashcards

A

If an array is first used in a linear loop then all memory will be touched in one controller first slowing down future parallel loops.

Question 22

Q

Answer

Study These Flashcards

A

Memory and Cache Flashcards

(22 cards)