Learning Flashcards by Jia Arora

Learning

the acquisition, from experience, of new knowledge, skills, or responses

How well did you know this?

Not at all

Perfectly

Animal model of learning: Aplysia (sea slug)

has only 20,000 neurons, but still shows simple forms of learning
Habituation: decrease in responding when a painless stimulus is repeated
first touch to gill = withdrawal of gill and siphon
repeated touches = withdrawal fades

Sensitization: increase in responding when a (noxious/painful) stimulus is repeated
- first shock to tail = withdrawal of gill
- repeated shocks = stronger withdrawal

How well did you know this?

Not at all

Perfectly

Animal model of learning: C elegans (small nematode worms)

have only 302 neurons
show simple forms of memory such as chemotaxis towards/away from salt based on past experiences. E.g., if previously fed in the presence of salt, it navigates towards salt (positive taxis to NaCl)

How well did you know this?

Not at all

Perfectly

Classical (also: Pavlovian) Conditioning

A neutral stimulus gains significance through association:
1. Before conditioning:
Food is the unconditioned stimulus which leads to the unconditioned response of salivation in dogs

Before conditioning:
Tuning fork is a neutral stimulus that elicits no conditioned response
During conditioning:
Tuning fork (conditioned stimulus) is paired with food (unconditioned stimulus). Result is still the unconditioned response of salivation.
After conditioning:
Tuning fork becomes a conditioned stimulus that elicits the conditioned response of salivation (without the presence of food)

How well did you know this?

Not at all

Perfectly

Ivan Pavlov

Russian physiologist initially interested in digestion and salivation

How well did you know this?

Not at all

Perfectly

Watson & Rayner (1920): The Little Albert experiment

Demonstration of Pavlov’s effects in humans:
Demonstration of an emotional response: after pairing the previously neutral stimulus of a rat with the unconditioned stimulus of loud noise, the rat became a conditioned stimulus and elicited fear in Little Albert
Was interested in generalization (e.g., to other white objects like the rat such as Santa Claus) but didn’t get that far
Ethically dubious

How well did you know this?

Not at all

Perfectly

Role of Classical Conditioning in Phobias

In anxiety clinics, spider, dog and snake phobias predominate over electrical socket and gun phobias
This suggests an evolutionary biases in conditioned fear –> preparedness
Garcia & Koelling (1966):
Quinine taste (tonic water, CS) could be associated with nausea (US) from just one pairing, but not with shock (US)
Light + tone (CS) could be associated with shock (US), but not nausea (US)

How well did you know this?

Not at all

Perfectly

Extinction: inhibition or erasing?

Extinction = presentation of the conditioned stimulus alone (not in association with the US) leading to a decrease in the CR.
The learning of the connection between CS and US is not erased because the conditioned response (CR) can return, either after a break (spontaneous recovery) or after 1 new pairing - 1 CS-US pairing brings back the CR (reinstatement). This shows that relearning is fast, suggesting that the original learning wasn’t erased.

How well did you know this?

Not at all

Perfectly

Generalization & Discrimination

A generalization function when initial training was to a 1050 Hz tone (the CS) - a CR is still observed at frequencies close to 1050 Hz.
But the observation that the CR is reduced to similar stimuli is also evidence of (psychological) discrimination.

How well did you know this?

Not at all

Perfectly

Second-order conditioning

If a stimulus is presented alongside a conditioned stimulus, it can still elicit a conditioned response - this is second-order conditioning. E.g.,
1. Tone (CS) + Food (US) = Salivation (CR)
2. Picture (second-order stimulus) –> Tone (CS) = Salivation (CR)
3. Picture (second-order stimulus) = Salivation (CR)

How well did you know this?

Not at all

Perfectly

Classical conditioning in drug overdoses

Drug paraphernalia (needles, ashtrays, environments) become conditioned stimuli that precede drug effects
Siegel model of drug overdose:
- for heroin users, CR is opposite to the UR
- heroin: slows heart rate/breathing (UR)
- drug paraphernalia: increase heart rate/breathing (CR)
Risk of overdose if drug administered in novel environment (fewer CS) where the preparatory and compensatory CR does not occur

How well did you know this?

Not at all

Perfectly

Law of Effect (who & what)

Thorndike - law of effect is the principle that behaviours that are followed by a “satisfying state of affairs” tend to be repeated, and those that produce an “unpleasant state of affairs” are less likely to be repeated.
- invented the “puzzle box” for cats
- if the behaviour was “correct”, the animal was rewarded with food (rewarded actions are “stamped in”)
- incorrect behaviour = no result, animal was stuck in the box (profitless actions are “stamped out”)
- over time, the number of ineffective behaviours decreased and animals escaped from the box more and more quickly

How well did you know this?

Not at all

Perfectly

Operant Conditioning (who & what)

Skinner - operant behaviour refers to behaviour that an organism performs that has some impact on the environment
- operant conditioning is based on the idea of reinforcement: any stimulus or event that increases the likelihood of the behaviour that led to it

How well did you know this?

Not at all

Perfectly

Operant Conditioning Terminology

increases the likelihood of behaviour:
- Stimulus is presented = Positive Reinforcement
- Stimulus is removed = Negative Reinforcement

decreases the likelihood of behaviour
- Stimulus is presented = Positive Punishment
- Stimulus is removed = Negative Punishment

How well did you know this?

Not at all

Perfectly

Shaping

Learning that results from the reinforcement of successive steps to a final desired behaviour

How well did you know this?

Not at all

Perfectly

Superstitious learning

Study These Flashcards

Attributing reward to the wrong response (occurs when the reward is presented after a fixed time interval regardless of what action the animal performs, leading to a random behaviour being reinforced)

Immediate vs Delayed Reinforcement and Punishment in Operant Conditioning

Study These Flashcards

Long delays between the occurrence of a behaviour and the reinforcer create a ‘credit assignment’ problem - the more time elapses, the less effective the reinforcer because it makes it difficult for the animal to figure out the exact behaviour they need to perform in order to obtain it.

Immediate vs Delayed Reinforcement and Punishment in Classical Conditioning

Study These Flashcards

Longer the delay between the CS and the UR, the harder it is to associate the UR with the CS and elicit a CR whenever the CS is presented. Exception:
Conditioned taste aversion: powerful (1 trial) learning of connection of CS with sickness (UR) occurring hours later

Schedules of Reinforcements

Study These Flashcards

Fixed-Interval
Variable-Interval
^ produce slow, methodical responding because reinforcements follow a time scale that is independent of how many responses occur
Fixed-Ratio
Variable-Ratio

Fixed-Interval Schedule

Study These Flashcards

reinforcers presented at fixed time periods, provided that the appropriate response is made –> show a scalloping effects: after each reinforcement, responding pauses, but as the next time interval draws to a close, there is a burst of responding

Variable-Interval Schedule

Study These Flashcards

a behaviour is reinforced on the basis of an average time that has expired since the last reinforcement –> produce steady, consistent responding because the time until the next reinforcement is less predictable

Fixed-Ratio Schedule

Study These Flashcards

reinforcement is delivered after a specific number of responses have been made.
Special case - presentation of reinforcement after every response = continuous reinforcement

Variable-Ratio Schedule

Study These Flashcards

delivery of reinforcement is based on a particular average number of responses, although the ratio of responses to reinforcements is variable.
- these produce slightly higher rates of responding than fixed-ratio schedules because the organism never knows when the next reinforcement is going to appear. higher the reinforcement-response ratio = higher the response rate

Which schedules (ratio/interval) yield higher overall responding?

Study These Flashcards

Ratio

____ schedules are resistant to extinction

Variable-ratio and variable-interval

Vicarious conditioning

learning associations from watching others

Observational learning in monkeys (who & what)

Mineka - young lab-reared rhesus monkeys watch video of parent behaving fearfully, spliced with either a snake (prepared CS) or a flower (unprepared CS) - two stimuli these monkeys have never seen before - after videos, monkeys displayed fear to snake but not flower (evolutionary basis: learn to fear something relevant to their species)

Observational learning in children (Bandura)

An adult *model* - someone whose behaviour might be a guide for others - played with a Bobo doll aggressively and children observed these actions, thereby becoming twice as likely to interact with the doll aggressively - social learning theory - ubiquitous type of learning: more common than learning first-hand

Observational learning in children (Meltzoff & Marshall; Gergely)

- Even young children (14 months) are capable of sophisticated, deferred **imitation** - supports bonding and survival - imitation is rational

Mirror neurons (who & what)

Rizzolatti - discovered neurons in monkeys that fire when performing an action or observing that same action performed by another individual - it is argued that mirror neurons are a mechanism for observational learning

Cognitive aspects of conditioning

cognitive factors like expectations, surprise, informativeness are involved in operant and classical conditioning. E.g., the Rescorla-Wagner model, Tolman's cognitive maps --> the stimulus does not directly evoke a response; rather, it establishes an internal cognitive state that then produces the behaviour

Rescorla-Wagner model

CS functions by setting up an expectation about the arrival of a US - involves cognitive processes

Latent learning

Tolman: knowledge acquired without reinforcement, becoming evident as behavioural change only when it becomes relevant

Cognitive maps (Tolman)

Beyond simply learning "start here, end here", his rats had created a mental representation of the local geography that enabled them to find food in a novel radial arm maze. ^ pure stimulus-response training cannot explain this; higher-level abstraction of information is present

Intra-cranial self-stimulation paradigm

- Olds & Milner implanted electrodes in a rat's brain that electrically stimulated nucleus accumbens (where dopamine is released) when a lever is pressed - Rats repeated lever pressed to drive electrical stimulation of their own dopamine system, even to the point of exhaustion dopamine release = a reward

Dopamine pathway

- Dopamine released in nucleus accumbens - Medial forebrain bundle is the dopamine pathway from midbrain to basal ganglia

Dopamine and Classical Conditioning

Schultz - implanted recording electrodes in dopamine neurons of monkeys - pair visual stimulus (CS) with juice reward (US) - dopamine neurons (in the midbrain) signal reward **expectancy** - once the pairing has been learned, the neuron only fires to the CS, **not the reward itself** - so, dopamine is involved in **prediction and information** rather than 'pleasure' --> the "wanting" rather than the "liking"

Excessive dopamine in substance use disorder

- drug abuse can increase dopamine temporarily - chronic use leads to homeostatic changes in the brain (i.e., lower dopamine levels) - leads to withdrawal symptoms + increased drug-seeking as patients are looking to offset the homeostatic changes in the dopamine system ^ positive reinforcement = operant conditioning as the reinforcer of increased dopamine leads to the increase in the behaviour of drug use

Excessive dopamine in gambling

Environmental cues (such as bright flashing lights) can trigger dopamine release in people addicted to gambling and lead to reward-seeking = classical conditioning

Learning Flashcards

(39 cards)