Deep Learning - Dr Bashivan Flashcards
(49 cards)
why should you not include every detail in a neural model?
- more difficult to interpret
- lower feasibility
- more difficult optimization
what is the current practical sweet spot for amount of detail integration?
deep neural nets
what is the up and down of verbal explanations?
- easy to communicate!
- has a narrow bandwidth :/
what is the up and down of quantitative explanations? (code)
- easily transferrable, easy communication, can answer questions without costly experiments
- requires coding literacu
what is the classic approach for studying neuroscience?
identify and characterize individual elemnts in the brain (bottom->up approach)
what is the difference between machine learning and deep learning?
machine: figure out a template/feature of what you are looking for and then classify
deep learning: feature extraction + classification happen at the same time
why is the classic approach for studying the brain not so efficient?
only considers one of few tasks at a time, and only a few neurons
give an example of the classic approach
surround modulation and two-interval discrimination
what components is the deep learning framework based on?
- architecture
- learning objective (cost functions)
- learning rule
- dataset (secondary axis)
what are 3 principles of holistic deep learning approach?
- units have ubiquitous functionality
- units’ function diversity comes from autonomous learning
- groups of units are orchestrated to facilitate internalized or external objectives
name 2 static architecture models
- multilayer perceptrons
- convolutional neural network
what is multilayer perceptrons?
each unit in a layer is connected to all the units in the previous and following layer
what in convolutional neuron network?
units are locally connected to subgroups of units
name the 2 dynamic architecture models
- recurrent neural network
- transformers
what is recurrent neural network?
internal memory gets updated based on observations
what are the 3 types of cost-functions strategies?
unsupervised, supervised, reward-based
what is unsupervised objective (cost) functions?
- learn from observations, model reproduces what it sees: predicting errors, continuity, sparsity
- has generative consistency: wake-sleep algorithm, generative neural networks
give an example (allegory?) of unsupervised objective functions
finishing someone’s sentence
what is a downside of unsupervised learning algorithms?
it may fail to discover properties of the world that ae statistically weak but important for survival
how can we solve the problem of unsupervised objective functions not discovering essential properties?
supervised objective functions
give examples of supervised objective functions
object recognition
object detection
source localization
what is reward-based cost functions?
agents try to maximize reward
how are costs encoded in the brain vs in neural net?
brain : genes
neural net:
- cost-encoding neural net (small)
- task-performing neural net (large)
what are the 3 learning rules you can use?
following a gradient, not following a gradient, partially following a gradient