Key terms Flashcards

1
Q

What is supervised learning?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is unsupervised learning?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is unsupervised learning?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is an example?

A

A feature-label pair (sometimes, when the context is clear, we may use the term examples to refer to a collection of inputs, even when the corresponding labels are unknown)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

types of supervised learning tasks (5)

A
  1. Regression
  2. Classification
  3. Searching
  4. Recommending
  5. Sequence learning
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the difference between regression and classification?

A

Regression predicts a continuous value whilst classification predicts a categorical outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is optimisation?

A

the process of fitting the model to our observed data by altering the model parameters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the value produced by the loss function?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is a particular model parameter updated after calculating the loss?

A

The original value subtract the derivative of the loss with respect to the model parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the gradient vector?

A

A vector that contains all the partial derivates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

In order to calculate the gradient with respect to some input features, what must the output be?

A

A scalar

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Common regression problem examples (3)

A
  1. Predicting prices
  2. Predicting length of stay
  3. Forecasting demand
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the bias in linear regression?

A

a value that represents the value of y when all inputs are 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do the weights represent in linear regression?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is gradient descent?

A

an optimisation algorithm that iteratively reduces the error by updating the model parameters in the direction that incrementally lowers the loss function

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Consequence of a learning rate which is too small?

A

the optimisation process may converge very slowly

17
Q

Consequence of a learning rate which is too large?

A

the optimisation process may become unstable

18
Q

Output dimensions of Xw? (linear regression)

A

n x 1 (one output for every observation/example)

19
Q

What is the purpose of the softmax operation?

A

convert them to probabilities

20
Q

What is the purpose of activation functions?

A

to introduce non-linearity into the model, enabling it to learn and represent complex relationships in the data

21
Q

What exactly do activation functions do?

A

Decide whether whether a neuron should be activated or not by calculating the weighted sum and further adding bias to it.

22
Q

Commonly used activation functions (3)

A
  1. ReLU
  2. Sigmoid
  3. Tanh
23
Q

What happens if a neuron is not activated?

A

the neuron does not send its output signal to neurons in the next layer of the neural network

24
Q

What is generalisation?

A

the ability of a trained model to perform well on new, unseen data that it hasn’t encountered during training

25
Q

What are convolutional neural networks?

A
26
Q

Why are fully connected layers not suitable for images?

A
27
Q

What is spatial invariance?

A

a property in convolutional neural networks (CNNs) where the network learns to recognize patterns or features in the input data regardless of their specific spatial location within the input image.

28
Q

What is the channel dimension?

A

The number of channels for an image?

29
Q

What is the most popular example of sequence data?

A

Text

30
Q
A