Lesson 4--Datapaths Flashcards

1
Q

What is a datapath with respect to VLIW

A

the collection of its execution units, which performs data transformation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

In these two RSA, all operands reside in memory

A

CISC and DSP

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Where do operands reside in VLIW and RISC

A

Operands are in the registers before any work is done.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

True or False: 16 bit buses are much more efficient than 32 bit buses

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

ISA minimum requirements

A
  • -Minimum commands for accessing memory ­ load, store
  • -Minimum commands to perform arithmetic functions ­ subtraction
  • -Minimum commands for control functions ­ less than zero, equals zero, branch unconditional
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Choosing Operations tradeoffs

A
  • -Efficiency
  • -Performance
  • -Complexity
  • -Design
  • -Silicon cost
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Other architectures

A

SIMD
MIMD
microSIMD

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or False: VLIW is more efficient than microSIMD for a media processor

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

True or False: For the same number of addressible registers, microSIMD holds more multimedia data than superscalar or VLIW.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

True or False: microSIMD supports data parallelism, but at a much higher complexity of register ports

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Min instructions for accessing memory

A
  • -Load a value from mem into register

- -Store a value from a register into mem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Min instructions set to perform arithmetic functions of a processor:

A

Subtract

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Min instructions set to perform control functions of a processor:

A
  • -Result is zero
  • -Result is less than zero (no need for greater than zero)
  • -Branch unconditional (goto)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

VLIW datapath

A

Collection of its execution units, which performs data transformation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

CISC and DSP

A

All operands reside in memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

VLIW and RISC

A

Operands are in the registers before any work is done.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

True or False: 16 bit buses are much more efficient than 32 bit buses

A

TRUE

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Choosing Operation Repertoire tradeoffs

A
  • -Efficiency
  • -Performance
  • -Complexity
  • -Design
  • -Silicon cost
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Other architectures

A
  • -SIMD
  • -MIMD
  • -microSIMD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

True or False: VLIW is more efficient than microSIMD for a media processor

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

True or False: For the same number of addressible registers, microSIMD holds more multimedia data than superscalar or VLIW.

A

True

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

True or False: microSIMD supports data parallelism, but at a much higher complexity of register ports

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Min instructions for accessing memory

A
  • -Load a value from mem into register

- -Store a value from a register into mem

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Min instructions set to perform arithmetic functions of a processor:

A

Subtract

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Min instructions set to perform control functions of a processor:
- -Result is zero - -Result is less than zero (no need for greater than zero) - -Branch unconditional (goto)
26
What is the controller's role in the Datapath
The controller makes sure the data is latched properly into the buffer, coordinates ALU operations, IO devices, and Memory and checks for hazards.
27
CISC and DSP all the operands reside in memory. So there must be ______ to______operations and complex addressing modes.
Memory to memory
28
Accumulators are target registers of the ALU
This means the compiler is forced to make binding choices and optimizations too early to reduce the memory traffic.
29
True or false: VLIW and RISC use lost of registers
True
30
True or false: VLIW and RISC compiler does not need to be very aggressive with register allocation
False
31
True or false: VLIW and RISC compiler must decouple scheduling and register allocation. So first scheduling is performed, the register allocation is done.
True
32
Datapath Operations Cycles
The 32 bit processors take way more cycles than the 16 bit versions because the 32 bit operations break down the operations into 16 bit operations and use the carry bit to extend the size.
33
True or False: Datapath Operations Cycles that use 16 bit buses operate much less efficiently than 32 bit buses.
False
34
Datapath Width
The width of the datapath equals the width of the registers that hold int and float.
35
Datapath Width
The width of the datapath equals the width of the registers that hold int and float.
36
False
True or False: The initial versions for ARM supported floating point operations, was done through a software library.
37
True
True or False: The initial versions for ARM processors were extremely efficient, but the time for floating point operations was 100s of clock cycles
38
False, there was usually two
True or False: CISC and RISC usually have only one datapaths, each the width of the datapath.
39
Narrower
With CISC and RISC, the integer datapath is narrower or wider than the floating point one?
40
DSPs datapaths
These are likely to be 40 bits or 56 bits, these are ADC widths
41
8 - 32
VLIW Datapath Widths had __ to __ bit independent datapaths
42
True
True or False: VLIW Datapath Widths could be reconfigured to support floating point operations
43
Operation Repertoire
Choosing which operations to include in the ISA is difficult
44
Additional Operation Repertoire tradeoffs
--application analysis --execution frequency --implementation complexity all must be considered for this operation
45
The characteristics of application domain are:
- -Simple integer and compare operations for the basic units of execution of any program. - -datapaths are extremely important. - -In CISC Carry, Overflow, and other flags are set by the arithmetic operations
46
CISC
In CISC or RSIC: Overflow, and other flags are set by the arithmetic operations?
47
VLIW
With CISC or VLIW: Which one has more than one arithmetic operations occurring at the same time?
48
VEX
With VEX or VLIW: Where are flags stored in branch registers?
49
large and slow
Are Multipliers small and fast or large and slow?
50
smaller operations
Are VEX the multiplication is broken down into larger or smaller operations
51
upper and lower
Integer Multiplication: The 32 bit multiplication is divided into two 16 bit multiplications? upper and lower or upper only or lower only?
52
NOP and multiplication
Integer Multiplication: There is a NOP or Branch inserted to allow for the delay created with the multiplication or division?
53
Fixed Point Multiplication
Most embedded systems need short fixedpoint to represent important data types
54
In VEX there are 3 multiplication forms
In ____ there are 3 multiplication forms�low 16 * low 16, low 16 * high 16, high 16 * high 16 (number of bits)
55
T/F: Higher precision fixedpoint multiplication is cheap
False, it is expensive
56
True
True or False: Interger division is more expensive than multiplication?
57
Integer Division is more complex
With this type math, it is more complex because the answer may be an integer or a FP value
58
VEX
In ___, divs instruction is provided for basic component for an integer division.
59
35 cycles
Nonrestoring 32bit division can take __ cycles
60
Shorter and Constant
The compiler may optimize shorter or loger divisions or divisions by constants or varibles?
61
code size
Division is rarely critical and many systems favor code size or code quality to hardware design.
62
Saturated Arithmetic
This arithmetic occurs when you try to exceed the precision that is allowed for the implementation
63
It becomes an an overflow ; the result is 0X00000000
What is the result if we add 1 to a a 32 bit int 0XFFFFFFFF wrap around?
64
No
Are embedded domains overflows acceptable?
65
What is used instead of embedded domains overflows?
Saturated arithmetic is used. | Example: 60 + 43 ? 100. (not the expected 103.)
66
In SIMD or VLIW; instruction sets, the same instruction works on a large quantity of data?
SIMD
67
In SIMD or VLIW; instruction sets, each instruction works on only one data set?
VLIW
68
In SIMD or VLIW; can be multiple instructions and multiple data sets?
VLIW
69
MicroSIMD Parallel Subword Architecture
With this Architecture, it has 64 bit FU can process words as 1 64 bit, 2 32 bit, or 4 16 bit
70
microSIMD Parallel Subword Architecture
With this Architecture, data can be compacted and be fit into a long 64 bit word
71
True or False: With microSIMD, data can be compacted and be fit into a long 64 bit word.
True
72
It speeds up processing and reduces data size requirements
With microSIMD, it speeds up processing and reduces data size requirements or does it reduce processing and speed up data size?
73
MIMD
MIMD or SIMD has 4 instructions on 4 cores?
74
SIMD
MIMD or SIMD operates on four different data items with one instruction?
75
Superscalar
Superscalar or SIMD Operates on 4 instructions, four cores?
76
VLIW
VLIW or Superscalar has one instruction, four sub operations?
77
False; x86 are popular
True or False: MicroSIMD Operations are not popular
78
MicroSIMD
In embedded systems MicroSIMD or VLIW manipulates subwords
79
8 bits, 16 bits, 32 bits, 64 bits
MicroSIMD subwords are configurable bits of ___bits, ___bits, ___bits, ___bits
80
False; They need small precision qualities
True or False: Multimedia applications usually need large precision quantities
81
PADD4
PADD4 or 4PADD breaks down larger words into sub words. Then adds 4 sub words together. Then operate on the four sub words.
82
False, they have difficulties.
True or False: In practice, microSIMD operations have no difficulties.
83
microSIMD operations difficulties
- -Alignment issues are a problem when breaking into subwords. - -Structures that contain subword elements rarely align cleanly to word boundaries. - -Unoptimized pre/post loop codes are needed. - -Precision Issues there may need to be a few extra bits for holding intermediate stages of an algorithm
84
False, they have to respect control flow
True or false: MicroSIMD operations does not need to respect the control flow.
85
MicroSIMD
MicroSIMD or VLIW must mimic existing branches
86
Two operations control flow in MicroSIMD
- -it must mimic existing branches extensions | - -include partial predication
87
PCMPGT4
PCMPGT4 or PADD4; does the compare at a boundary. Then take the subwords on the integer boundaries do the add and subtraction. The PSELECT then chose the branch to take.
88
SIMD
The sequential program can be parallelized using VEX or SIMD?
89
MicroSIMD
MicroSIMD or VLIW: The data is divided into subelements and then stored in a register. The entire register is operated on at the same time, leading to 4 data points being operated on at the same time. Reducing the processing time of the operation.
90
MicroSIMD
MicroSIMD or VLIW: Can achieve impressive results with a minimal hardware complexity
91
False
True or False: complete set of microSIM extensions do not costs too much
92
True
True or False: automatic extrications microSIMD w/o hints by the compiler is still unproven.
93
False
With Manual code resturcturing is no longer needed to exploit micro SIMDparallelism
94
True
True or False: microSIMD can pack more data than VLIW
95
True
True or False: microSIMD holds more multimedia data then superscalar or VLIW architectures
96
VLIW
The complexity of register ports is higher in VLIW or microSIMD?
97
True or False: microSIMD is able to support a large number of operands.
True
98
The number of register files is much less or more for microSIMD.
Less
99
True or False: microSIMD is more powerful than MIMD, SIMD, VLIW
True
100
True
True or False: VLIW can do four different kinds of instructions, while SIMD cannot.
101
VLIW
VLIW or VEX is better for more general parallelism.
102
Constants
- -Specifically the immediate operands and literals. | - -Are known at compile time, others at load time.
103
Short or Long immediate constants tend to be used in addressing modes and fit in a single encoded operation
Short
104
Long
Short or Long immediate constants can be the width of the datapath.
105
2
There are 2, 8 or 64 methods for long immediates
106
Two methods for long immediates
- -partial immediate load - -memory allocated immediate in this case the compiler allocates long immediate in memory and emits an instruction that loads the immediate. (emits?)
107
True or False: With Constants Branch offsets are not immediate
False; They are immediate
108
Constants
MIPS or VLIW have a jump instruction has 26 bit wide offsets.
109
MIPS
PC is word aligned in MIPS or VLIW, so you can jump 28 bits.
110
T/F: Most embedded processes are 32-bit ISAs include a branch offset large enough to cover local branches
True
111
What are the special needs of datapaths to be supported in embedded computing?
find out.
112
What are the minimum instructions for accessing memory?
load value from memory to register | store a value in a register to memory
113
What the minimum instructions to perform the arithmetic functions of a processor?
subtract
114
What is the minimum instruction set to perform the control functions of a processor?
result is zero result is less than zero branch conditional