CNN Flashcards

(25 cards)

1
Q

What is the main advantage of CNNs over traditional ML for image tasks?

A

They learn features directly from raw images, removing the need for manual feature engineering.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the three main types of layers in a CNN?

A

Convolutional layers, pooling layers, and fully connected layers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does a convolutional layer do?

A

Applies filters that extract local patterns from the input data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a filter (or kernel) in a CNN?

A

A small matrix that slides across the input to detect specific patterns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What operation does a filter perform on the input?

A

A dot product between the filter and a local region of the input.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does the stride hyperparameter control in a convolution?

A

How many pixels the filter moves at each step.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the effect of increasing stride in a CNN?

A

Reduces the spatial size of the output feature map.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does padding do in convolutional layers?

A

Adds borders to the input to control output dimensions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a feature map in a CNN?

A

The output produced by applying a filter across an image.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What kind of features do early CNN layers learn?

A

Low-level features like edges and textures.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What kind of features do deeper CNN layers learn?

A

High-level features like object parts or full shapes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the purpose of pooling layers?

A

To reduce spatial dimensions and make features more robust to translation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is max pooling?

A

A pooling method that takes the maximum value in each region.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the downsides of excessive pooling?

A

Loss of spatial detail and important information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the benefits of pooling?

A

Dimensionality reduction and translation invariance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does a fully connected layer in a CNN do?

A

Connects all activations to final outputs, used for classification or regression.

17
Q

What happens before data reaches the fully connected layer?

A

It is flattened from 2D feature maps into a 1D vector.

18
Q

What kind of activation function is typically used in CNNs?

A

ReLU (Rectified Linear Unit).

19
Q

What determines the size of the receptive field in a CNN?

A

The size of the filters and depth of the network.

20
Q

What does a stride of 2 mean in a convolutional layer?

A

The filter moves 2 pixels at a time, reducing resolution more quickly.

21
Q

What is one use of CNNs beyond image classification?

A

They can be used on time series (1D) and video data (3D convolutions).

22
Q

What does ‘transfer learning’ typically modify in a CNN?

A

The fully connected layers.

23
Q

What is the typical input to a CNN?

A

An image represented as a matrix of pixel values.

24
Q

What is the role of filters in learning?

A

Each filter detects a specific visual pattern relevant to the task.

25
What is a common visualisation of CNN operations?
A filter sliding over an image to produce a feature map.