Common network architectures Flashcards

Question 1

Q

Inception: State the multiplication cost for one convolution path and why bottlenecks help.

Answer

A

Mults = H·W·C·K²·F. 1×1 bottlenecks cut C, slashing FLOPs before larger kernels.

Question 2

Q

ResNet: Write the residual output equation and its purpose.

Answer

A

y = F(x) + x. Lets the block learn a residual; identity mapping is easy so very deep nets avoid degradation.

Question 3

Q

Precision formula and plain‑language meaning.

Answer

A

Precision = TP / (TP + FP): fraction of predicted boxes that are correct.

Question 4

Q

Recall formula and plain‑language meaning.

Answer

A

Recall = TP / (TP + FN): proportion of ground‑truth objects the detector finds.

Question 5

Q

Difference between precision and recall in one sentence.

Answer

A

Precision asks ‘How many predictions are right?’; recall asks ‘How many real objects did I catch?’

Question 6

Q

Why use 1×1 convs before 3×3/5×5 in Inception modules?

Answer

A

They reduce channel depth, lowering compute while keeping representational power.

Question 7

Q

How do residual skips help gradient flow?

Answer

A

They create a direct path so gradients bypass deep stacks, making optimisation easier for very deep networks.

Question 8

Q

Give the IoU formula and common TP threshold.

Answer

A

IoU = overlap area / union area; a detection is TP if IoU ≥ 0.5.

Question 9

Q

Define AP and mAP briefly.

Answer

A

AP = area under precision‑recall curve for one class; mAP = mean AP over all classes.

Question 10

Q

Conv output size formula (no padding) and example for 28×28, F=3, S=2.

Answer

A

H_out = ⌊(H_in−F)/S⌋+1 ⇒ ⌊(28−3)/2⌋+1 = 13.

(10 cards)