CNN Flashcards

1
Q

CV problems

A

classifications, object detection, neural style transfer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

how to detect edges? (vertical, horizontal, dia)

A

use filter that contains 1s and 0s like
1 0 -1
1 0 -1
1 0 -1
to detect vertical edges
the values for pixels at the ver edge will be v large/small (lighter/darker color) –> diff from other pixels
similar for horizontal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is sobel or sehorr filter in edge detection?

A

1 0 -1
2 0 -2
1 0 -1
focus on the center pix
sehorr
3 0 -3
10 0 -10
3 0 -3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is padding

A

To prevent losing information at the edges of the image (convolution shrinks mattrix), add 0s around the orignal matrix to pad
p can be 0 1 2…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what the result dimension when apply a fxf filter on nxn mat?

A

n-f+1xn-f+1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

type of padding?

A

valid: no padding
same: output size same as input size
calculated based on input and filter size (f is usually odd)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is strided conv?

A

moving filter s steps
s can be 1 2 3…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

formula to cal output dimension

A

(n+2p-f)/s+1 round down

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

how to do cross correlation (deconvolution)

A

rotate the filter clockwise then flip it horizontally
then do inputxfilter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what is the condtion of input and filter when the are more than 1 channel?

A

the number of channels of input and filter must be the same
the number of channels of the output will be number of filters used

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is pooling layee? what are the types of pooling?

A

at pooling layer, instead of performing multiplication (*), use MAX, MIN, AVG operation instead.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

diff between conv layer and pool layer?

A

conv layer has params but pool layer doesnt
in NN, conv1+pool1–> layer 1, conv2+pool2–> layer 2…

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is parameter sharing?

A

a feature detecter eg vertical edge detecter can be applied to other image to detect vertical edge

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

why doing convolution?

A

parameter sharing
sparsity of connections

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is sparsity of connections in convolution?

A

in each layer, each output value depends on small number of inputs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what are some classic neural networks?

A

lenet5: conv1 avgpool1 conv2 avgpool2 fc1 fc2 softmax –> very simple, common type of arrangement

alexnet: bigger, use maxpool instead of avgpool, same arrangement

vgg16: use padding (same) to preserve output dim,

17
Q

what is residual block?

A

short cut from early input to later output (before ReLU sum up the ealier and the current)

18
Q

advantage of resnet

A

allow to train deeper NN without hurting generability

19
Q

why resnet work well without hurting performance?

A

identity function is ez to learn –> get back past result
g(wa+b +pasta)
because wa+b is small –> past a is large –> got a

20
Q

feature of resnet?

A

have skip connections

21
Q

what does an 1x1 convolution do?

A

shrink the number of channels or increase it

22
Q

motivation for inception network?

A

improve computational cost

23
Q

what is inception block?

A

apply different filters on 1 input and concat them as the output (bottle neck) –> input to other layer

24
Q

what is advantage of mobilenet?

A

no need large computational resources

25
Q

what is the feature of mobilenet?

A

depthwise separable conv
depthwise: filter will be fxf (not x nc) and nc filters ( 1 channel filter but no filters = no. channels) –> output has same nc
followed by pointwise conv with 1x1xnc’ filter
–> final output is n’xn’xnc’

26
Q

what is effnet?

A

width, height and resolution can be scaled uniformly (compound scaling)