Learning Flashcards

(39 cards)

1
Q

Learning

A

the acquisition, from experience, of new knowledge, skills, or responses

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Animal model of learning: Aplysia (sea slug)

A
  • has only 20,000 neurons, but still shows simple forms of learning
    Habituation: decrease in responding when a painless stimulus is repeated
  • first touch to gill = withdrawal of gill and siphon
  • repeated touches = withdrawal fades

Sensitization: increase in responding when a (noxious/painful) stimulus is repeated
- first shock to tail = withdrawal of gill
- repeated shocks = stronger withdrawal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Animal model of learning: C elegans (small nematode worms)

A
  • have only 302 neurons
  • show simple forms of memory such as chemotaxis towards/away from salt based on past experiences. E.g., if previously fed in the presence of salt, it navigates towards salt (positive taxis to NaCl)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Classical (also: Pavlovian) Conditioning

A

A neutral stimulus gains significance through association:
1. Before conditioning:
Food is the unconditioned stimulus which leads to the unconditioned response of salivation in dogs

  1. Before conditioning:
    Tuning fork is a neutral stimulus that elicits no conditioned response
  2. During conditioning:
    Tuning fork (conditioned stimulus) is paired with food (unconditioned stimulus). Result is still the unconditioned response of salivation.
  3. After conditioning:
    Tuning fork becomes a conditioned stimulus that elicits the conditioned response of salivation (without the presence of food)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Ivan Pavlov

A

Russian physiologist initially interested in digestion and salivation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Watson & Rayner (1920): The Little Albert experiment

A
  • Demonstration of Pavlov’s effects in humans:
  • Demonstration of an emotional response: after pairing the previously neutral stimulus of a rat with the unconditioned stimulus of loud noise, the rat became a conditioned stimulus and elicited fear in Little Albert
  • Was interested in generalization (e.g., to other white objects like the rat such as Santa Claus) but didn’t get that far
  • Ethically dubious
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Role of Classical Conditioning in Phobias

A
  • In anxiety clinics, spider, dog and snake phobias predominate over electrical socket and gun phobias
  • This suggests an evolutionary biases in conditioned fear –> preparedness
    Garcia & Koelling (1966):
  • Quinine taste (tonic water, CS) could be associated with nausea (US) from just one pairing, but not with shock (US)
  • Light + tone (CS) could be associated with shock (US), but not nausea (US)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Extinction: inhibition or erasing?

A

Extinction = presentation of the conditioned stimulus alone (not in association with the US) leading to a decrease in the CR.
The learning of the connection between CS and US is not erased because the conditioned response (CR) can return, either after a break (spontaneous recovery) or after 1 new pairing - 1 CS-US pairing brings back the CR (reinstatement). This shows that relearning is fast, suggesting that the original learning wasn’t erased.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Generalization & Discrimination

A

A generalization function when initial training was to a 1050 Hz tone (the CS) - a CR is still observed at frequencies close to 1050 Hz.
But the observation that the CR is reduced to similar stimuli is also evidence of (psychological) discrimination.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Second-order conditioning

A

If a stimulus is presented alongside a conditioned stimulus, it can still elicit a conditioned response - this is second-order conditioning. E.g.,
1. Tone (CS) + Food (US) = Salivation (CR)
2. Picture (second-order stimulus) –> Tone (CS) = Salivation (CR)
3. Picture (second-order stimulus) = Salivation (CR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Classical conditioning in drug overdoses

A
  1. Drug paraphernalia (needles, ashtrays, environments) become conditioned stimuli that precede drug effects
  2. Siegel model of drug overdose:
    - for heroin users, CR is opposite to the UR
    - heroin: slows heart rate/breathing (UR)
    - drug paraphernalia: increase heart rate/breathing (CR)
  3. Risk of overdose if drug administered in novel environment (fewer CS) where the preparatory and compensatory CR does not occur
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Law of Effect (who & what)

A

Thorndike - law of effect is the principle that behaviours that are followed by a “satisfying state of affairs” tend to be repeated, and those that produce an “unpleasant state of affairs” are less likely to be repeated.
- invented the “puzzle box” for cats
- if the behaviour was “correct”, the animal was rewarded with food (rewarded actions are “stamped in”)
- incorrect behaviour = no result, animal was stuck in the box (profitless actions are “stamped out”)
- over time, the number of ineffective behaviours decreased and animals escaped from the box more and more quickly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Operant Conditioning (who & what)

A

Skinner - operant behaviour refers to behaviour that an organism performs that has some impact on the environment
- operant conditioning is based on the idea of reinforcement: any stimulus or event that increases the likelihood of the behaviour that led to it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Operant Conditioning Terminology

A

increases the likelihood of behaviour:
- Stimulus is presented = Positive Reinforcement
- Stimulus is removed = Negative Reinforcement

decreases the likelihood of behaviour
- Stimulus is presented = Positive Punishment
- Stimulus is removed = Negative Punishment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Shaping

A

Learning that results from the reinforcement of successive steps to a final desired behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Superstitious learning

A

Attributing reward to the wrong response (occurs when the reward is presented after a fixed time interval regardless of what action the animal performs, leading to a random behaviour being reinforced)

17
Q

Immediate vs Delayed Reinforcement and Punishment in Operant Conditioning

A

Long delays between the occurrence of a behaviour and the reinforcer create a ‘credit assignment’ problem - the more time elapses, the less effective the reinforcer because it makes it difficult for the animal to figure out the exact behaviour they need to perform in order to obtain it.

18
Q

Immediate vs Delayed Reinforcement and Punishment in Classical Conditioning

A

Longer the delay between the CS and the UR, the harder it is to associate the UR with the CS and elicit a CR whenever the CS is presented. Exception:
Conditioned taste aversion: powerful (1 trial) learning of connection of CS with sickness (UR) occurring hours later

19
Q

Schedules of Reinforcements

A
  • Fixed-Interval
  • Variable-Interval
    ^ produce slow, methodical responding because reinforcements follow a time scale that is independent of how many responses occur
  • Fixed-Ratio
  • Variable-Ratio
20
Q

Fixed-Interval Schedule

A

reinforcers presented at fixed time periods, provided that the appropriate response is made –> show a scalloping effects: after each reinforcement, responding pauses, but as the next time interval draws to a close, there is a burst of responding

21
Q

Variable-Interval Schedule

A

a behaviour is reinforced on the basis of an average time that has expired since the last reinforcement –> produce steady, consistent responding because the time until the next reinforcement is less predictable

22
Q

Fixed-Ratio Schedule

A

reinforcement is delivered after a specific number of responses have been made.
Special case - presentation of reinforcement after every response = continuous reinforcement

23
Q

Variable-Ratio Schedule

A

delivery of reinforcement is based on a particular average number of responses, although the ratio of responses to reinforcements is variable.
- these produce slightly higher rates of responding than fixed-ratio schedules because the organism never knows when the next reinforcement is going to appear. higher the reinforcement-response ratio = higher the response rate

24
Q

Which schedules (ratio/interval) yield higher overall responding?

25
____ schedules are resistant to extinction
Variable-ratio and variable-interval
26
Vicarious conditioning
learning associations from watching others
27
Observational learning in monkeys (who & what)
Mineka - young lab-reared rhesus monkeys watch video of parent behaving fearfully, spliced with either a snake (prepared CS) or a flower (unprepared CS) - two stimuli these monkeys have never seen before - after videos, monkeys displayed fear to snake but not flower (evolutionary basis: learn to fear something relevant to their species)
28
Observational learning in children (Bandura)
An adult *model* - someone whose behaviour might be a guide for others - played with a Bobo doll aggressively and children observed these actions, thereby becoming twice as likely to interact with the doll aggressively - social learning theory - ubiquitous type of learning: more common than learning first-hand
29
Observational learning in children (Meltzoff & Marshall; Gergely)
- Even young children (14 months) are capable of sophisticated, deferred **imitation** - supports bonding and survival - imitation is rational
30
Mirror neurons (who & what)
Rizzolatti - discovered neurons in monkeys that fire when performing an action or observing that same action performed by another individual - it is argued that mirror neurons are a mechanism for observational learning
31
Cognitive aspects of conditioning
cognitive factors like expectations, surprise, informativeness are involved in operant and classical conditioning. E.g., the Rescorla-Wagner model, Tolman's cognitive maps --> the stimulus does not directly evoke a response; rather, it establishes an internal cognitive state that then produces the behaviour
32
Rescorla-Wagner model
CS functions by setting up an expectation about the arrival of a US - involves cognitive processes
33
Latent learning
Tolman: knowledge acquired without reinforcement, becoming evident as behavioural change only when it becomes relevant
34
Cognitive maps (Tolman)
Beyond simply learning "start here, end here", his rats had created a mental representation of the local geography that enabled them to find food in a novel radial arm maze. ^ pure stimulus-response training cannot explain this; higher-level abstraction of information is present
35
Intra-cranial self-stimulation paradigm
- Olds & Milner implanted electrodes in a rat's brain that electrically stimulated nucleus accumbens (where dopamine is released) when a lever is pressed - Rats repeated lever pressed to drive electrical stimulation of their own dopamine system, even to the point of exhaustion dopamine release = a reward
36
Dopamine pathway
- Dopamine released in nucleus accumbens - Medial forebrain bundle is the dopamine pathway from midbrain to basal ganglia
37
Dopamine and Classical Conditioning
Schultz - implanted recording electrodes in dopamine neurons of monkeys - pair visual stimulus (CS) with juice reward (US) - dopamine neurons (in the midbrain) signal reward **expectancy** - once the pairing has been learned, the neuron only fires to the CS, **not the reward itself** - so, dopamine is involved in **prediction and information** rather than 'pleasure' --> the "wanting" rather than the "liking"
38
Excessive dopamine in substance use disorder
- drug abuse can increase dopamine temporarily - chronic use leads to homeostatic changes in the brain (i.e., lower dopamine levels) - leads to withdrawal symptoms + increased drug-seeking as patients are looking to offset the homeostatic changes in the dopamine system ^ positive reinforcement = operant conditioning as the reinforcer of increased dopamine leads to the increase in the behaviour of drug use
39
Excessive dopamine in gambling
Environmental cues (such as bright flashing lights) can trigger dopamine release in people addicted to gambling and lead to reward-seeking = classical conditioning