Deep nets overview Flashcards

Question 1

Q

What is the sigmoid function?

Question 2

Q

What are layers in a deep neural net?

Question 3

Q

What is the first layer called?

Answer

A

The input layer

Question 4

Q

What is the main rationale behind a deep net?

Answer

A

We can use the outputs as inputs for another layer

Question 5

Q

What is the last layer called?

Answer

A

the output layer

What we compare the targets to

Question 6

Q

What are the layers between the input and output layers called?

Answer

A

The hidden layers

Question 7

Q

What are the building blocks of hidden layers called?

Answer

A

Hidden units or hidden nodes

Question 8

Q

What is the width of the layer?

Answer

A

the number of hidden units in a hidden layer

Question 9

Q

What are two hyperparamaters?

Answer

A

Width, depth, learning rate

Question 10

Q

What are examples of paramaters?

Answer

A

Weights (w)
Biases (b)

Question 11

Q

What are the differences between Parameters and Hyperparameters?

Answer

A

Hyperparameters are pre-set by us

Parameters are found by optimizing the model

Question 12

Q

Why is non-linearity needed?

Answer

A

So we can create more complicated relationships

It also gives us the ability to stack layers
without it stacking layers is meaningless

we cannot stack layers when we only have linear relationships

in order to have deep nets and find complex relationships through arbitrary functions, we need non-linearities

Question 13

Q

In machine learning, what are non-linearities called?

Answer

A

Activation functions or transfer functions

Question 14

Q

What do activation functions do?

Answer

A

transform inputs into outputs of a different kind

Question 15

Q

What are the 4 common activation functions?

Answer

A

Sigmoid (logistic function)
TanH (hyperbolic tangent)
ReLu (rectified linear unit)
softmax

Question 16

Q

What are the properties of softmax?

Answer

A

Range: (0,1)
They will always sum up to 1

Question 17

Q

What does the softmax transformation do?

Answer

A

transforms a bunch of arbitrarily large or small numbers into a valid probability distribution

Question 18

Q

Where is softmax activation function used?

Answer

A

In the activation of the output layer in a classification problem

Question 19

Q

What is Backpropagation?

Question 20

Q

How do we optimize the objective function?

Answer

A

The training process consists of updating parameters through the gradient descent for optimizing the objective function

minimize the loss

Question 21

Q

What is Forward Propogation?

Answer

A

the process of pushing inputs through the net

Brainscape's Knowledge GenomeTM

Deep nets overview Flashcards

Brainscape's Knowledge Genome^TM