CNN Flashcards

Question 1

Q

CV problems

Answer

A

classifications, object detection, neural style transfer

Question 2

Q

how to detect edges? (vertical, horizontal, dia)

Answer

A

use filter that contains 1s and 0s like
1 0 -1
1 0 -1
1 0 -1
to detect vertical edges
the values for pixels at the ver edge will be v large/small (lighter/darker color) –> diff from other pixels
similar for horizontal

Question 3

Q

what is sobel or sehorr filter in edge detection?

Answer

A

1 0 -1
2 0 -2
1 0 -1
focus on the center pix
sehorr
3 0 -3
10 0 -10
3 0 -3

Question 4

Q

what is padding

Answer

A

To prevent losing information at the edges of the image (convolution shrinks mattrix), add 0s around the orignal matrix to pad
p can be 0 1 2…

Question 5

Q

what the result dimension when apply a fxf filter on nxn mat?

Answer

A

n-f+1xn-f+1

Question 6

Q

type of padding?

Answer

A

valid: no padding
same: output size same as input size
calculated based on input and filter size (f is usually odd)

Question 7

Q

what is strided conv?

Answer

A

moving filter s steps
s can be 1 2 3…

Question 8

Q

formula to cal output dimension

Answer

A

(n+2p-f)/s+1 round down

Question 9

Q

how to do cross correlation (deconvolution)

Answer

A

rotate the filter clockwise then flip it horizontally
then do inputxfilter

Question 10

Q

what is the condtion of input and filter when the are more than 1 channel?

Answer

A

the number of channels of input and filter must be the same
the number of channels of the output will be number of filters used

Question 11

Q

what is pooling layee? what are the types of pooling?

Answer

A

at pooling layer, instead of performing multiplication (*), use MAX, MIN, AVG operation instead.

Question 12

Q

diff between conv layer and pool layer?

Answer

A

conv layer has params but pool layer doesnt
in NN, conv1+pool1–> layer 1, conv2+pool2–> layer 2…

Question 13

Q

what is parameter sharing?

Answer

A

a feature detecter eg vertical edge detecter can be applied to other image to detect vertical edge

Question 14

Q

why doing convolution?

Answer

A

parameter sharing
sparsity of connections

Question 15

Q

what is sparsity of connections in convolution?

Answer

A

in each layer, each output value depends on small number of inputs

Question 16

Q

what are some classic neural networks?

Answer

A

lenet5: conv1 avgpool1 conv2 avgpool2 fc1 fc2 softmax –> very simple, common type of arrangement

alexnet: bigger, use maxpool instead of avgpool, same arrangement

vgg16: use padding (same) to preserve output dim,

Question 17

Q

what is residual block?

Answer

A

short cut from early input to later output (before ReLU sum up the ealier and the current)

Question 18

Q

advantage of resnet

Answer

A

allow to train deeper NN without hurting generability

Question 19

Q

why resnet work well without hurting performance?

Answer

A

identity function is ez to learn –> get back past result
g(wa+b +pasta)
because wa+b is small –> past a is large –> got a

Question 20

Q

feature of resnet?

Answer

A

have skip connections

Question 21

Q

what does an 1x1 convolution do?

Answer

A

shrink the number of channels or increase it

Question 22

Q

motivation for inception network?

Answer

A

improve computational cost

Question 23

Q

what is inception block?

Answer

A

apply different filters on 1 input and concat them as the output (bottle neck) –> input to other layer

Question 24

Q

what is advantage of mobilenet?

Answer

A

no need large computational resources

Question 25

Q

what is the feature of mobilenet?

Answer

A

depthwise separable conv
depthwise: filter will be fxf (not x nc) and nc filters ( 1 channel filter but no filters = no. channels) –> output has same nc
followed by pointwise conv with 1x1xnc’ filter
–> final output is n’xn’xnc’

Question 26

Q

what is effnet?

Answer

A

width, height and resolution can be scaled uniformly (compound scaling)

Brainscape's Knowledge GenomeTM

CNN Flashcards

Brainscape's Knowledge Genome^TM