9 - The Man Who Set Back Machine Learning (Not Really) Flashcards

Question

How can the shape and position of the sigmoid function be controlled?

Answer 1

By changing the values of w (weights) and b (bias) ## Footnote This affects the steepness and midpoint of the sigmoid curve.

Answer 2

y = s(z) ## Footnote s(z) is the sigmoid activation function applied to z.

Answer 3

It performs a linear combination of the outputs of the hidden neurons ## Footnote This involves multiplying each output by a weight and summing them.

Answer 4

To prove that a summation of hidden neurons can approximate any desired function f(x) ## Footnote This is dependent on having enough hidden neurons.

Answer 5

Both input and output vectors have only one element each ## Footnote This simplifies the understanding of the network's operations.

Answer 6

By performing a linear combination of the outputs of two hidden neurons ## Footnote The weights used can be positive or negative, affecting the final shape.

Answer 7

The approximation of the desired function improves ## Footnote More neurons allow for better representation of complex functions.

Answer 8

Weights and biases determine the shape and position of the hidden neuron outputs ## Footnote This affects the final output through linear combinations.

Answer 9

That a network with two hidden layers can approximate any function ## Footnote He believed it should also be possible with just one hidden layer.

Answer 10

It allows us to analyze functions in higher-dimensional spaces ## Footnote Functions can be represented as points in infinite-dimensional spaces.

Answer 11

A collection of vectors, matrices, and functions that satisfy certain properties ## Footnote These objects can be manipulated mathematically.

Answer 12

Proof by contradiction ## Footnote He assumed that a neural network could not approximate all functions and showed this assumption led to a contradiction.

Answer 13

That only one hidden layer was necessary for neural networks ## Footnote This led to a focus on shallow networks, delaying the deep learning revolution.

Answer 14

Increased number of hidden layers, massive training data, and computing power ## Footnote These elements were not widely available in the 1990s.

Answer 15

any function ## Footnote This belief was significant for the development of neural network theory.

Answer 16

Researchers began to increase the number of hidden layers in neural networks beyond one. ## Footnote This marked a pivotal shift in the capabilities of deep learning.

Answer 17

Cybenko proved the approximating properties of neural networks in 1989. ## Footnote His work laid the groundwork for future advancements in deep learning.

Answer 18

* Massive amounts of training data * Computing power ## Footnote These factors were not available in the 1990s.

Answer 19

He speculated that astronomical numbers of terms would be required for most approximation problems. ## Footnote This was influenced by the curse of dimensionality in multidimensional approximation theory.

Answer 20

The curse of dimensionality refers to the phenomenon where the feature space becomes increasingly sparse as dimensionality increases, making it difficult for models to generalize. ## Footnote This concept poses challenges in statistics and approximation theory.

Answer 21

They are not as susceptible to the curse of dimensionality and do not overfit the data as expected despite having massive parameters. ## Footnote This challenges existing beliefs about model complexity and data fitting.

Answer 22

Backpropagation. ## Footnote This algorithm is essential for optimizing the weights in neural networks during training.

9 - The Man Who Set Back Machine Learning (Not Really) Flashcards

(46 cards)