lesson 4 Flashcards

(60 cards)

1
Q

Rescorla-Wagner Model (RW)
Q1: What is the purpose of a learning theory in psychology?

A

To explain or generate learning behavior using a set of rules, often expressed mathematically.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What type of conditioning does the Rescorla-Wagner model explain?

A

Pavlovian (classical) conditioning only.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the main idea of the RW model?

A

What is the main idea of the RW model?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What does the RW equation describe?

A

: It describes how associative strength (V) changes over time as learning occurs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is associative strength (V)?

A

A measure of how strongly a conditioned stimulus (CS) is associated with an unconditioned stimulus (US).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does ΔV represent in the RW model?

A

The change in associative strength; how much learning happens on a given trial.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does α (alpha) represent in the RW equation?

A

The salience of the CS – how noticeable or attention-grabbing it is.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does β (beta) represent in the RW equation?

A

The salience of the US – typically set to 1 in most experiments.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What does λ (lambda) represent in the RW equation?

A

The intensity or value of the US – what actually happens in the world.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the equation used in the RW model?

A

ΔV = αβ(λ − V)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What does the term (λ − V) represent in the equation?

A

Prediction error – the difference between what happens (λ) and what is expected (V).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When does learning occur according to the RW model?

A

When prediction error is large; the more surprise, the more learning.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What happens when λ = V in the RW model?

A

Prediction error is zero; ΔV = 0, so no learning occurs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the RW model say about multiple CSs?

A

Each CS has its own associative strength (e.g., VA, VB, etc.), which is updated individually.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the term “asymptote of learning” mean in this context?

A

It’s the maximum level of associative strength (V) the CS can reach, which equals λ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the performance rule in RW?

A

The stronger the associative strength (V), the stronger the conditioned response (CR).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Jimmy has lived his entire life in a tiny village where there are no traffic lights. He has now moved to the metropolis of Waterloo to attend university. He is standing at a junction with a traffic light when it turns red and all the cars stop. Jimmy is very surprised, because he did not know that red lights mean stop.
Jimmy’s prediction error is?

Large

Small

Zero

Infinity

A

large

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

As a result of this experience, Jimmy will

not learn anything

learn a little

learn a lot

A

learn a lot

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

In Jimmy’s brain, the strength of the association between red lights and stopping will

Increase

Decrease

Become zero

Give him an aneurysm

A

increase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

V =

A

associative strength

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

salience of the us

A

B

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

intensity of the US

A

upside down y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

delta V

A

change in associateive strength

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

a

A

salience of the CS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Associative Strength
The strength of the association between a Conditioned Stimulus (CS) and an Unconditioned Stimulus (US). Represented by the variable V.
26
Asymptote of Learning
The maximum associative strength that a CS can reach. This is often determined by the intensity or value of the US (λ).
27
cue competition
Phenomena like blocking and overshadowing, where multiple CSs present during training interact and affect the learning about each other. The RW model was specifically developed to explain these.
28
Overshadowing (in terms of RW)
When two CSs (A and B) are paired with a US, they compete for associative strength. The presence of one CS (A) reduces the prediction error associated with the other (B), leading to less learning about B compared to when B is trained alone.
29
Blocking (in terms of RW)
Prior learning about one CS (A predicting the US) reduces the prediction error when a second CS (B) is added to A and paired with the US. Because the animal already expects the US due to A, it learns less about the new CS B.
30
Conditioned Inhibition (in terms of RW)
A CS (B) that is paired with the absence of the US in the presence of another CS (A that predicts the US) develops a negative associative strength (V B ​ <0). This negative value represents that B predicts the absence of the US.
31
RW Equation for A+US trials (Conditioned Inhibition Training)
ΔV A =α A (λ−V A )Where λ>0 (US is present).
32
RW Equation for AB [+ nothing] trials (Conditioned Inhibition Training)
For CS A: ΔV A =α A (0−[V A +V B]) For CS B: ΔV B=α B (0−[V A+V B]) Where λ=0 (no US is present).
33
Extinction (in terms of RW)
Repeated presentation of a CS without the US (λ=0) leads to a decrease in the associative strength (V) of that CS. The model suggests that the animal is "unlearning" the association until V approaches 0.
34
RW Equation for Extinction
ΔV=α(0−V) Where λ=0 (no US is present), and V is the current associative strength of the CS.
35
Limitation of RW in explaining Extinction
The RW model incorrectly predicts that extinction is simply the "unlearning" of the original association, leading to a CS becoming like a neutral stimulus again (V→0). It fails to account for phenomena like spontaneous recovery, suggesting that the original association is not entirely lost.
36
RW Prediction for Extinction of a Conditioned Inhibitor
When a conditioned inhibitor (B with V B ​ <0) is presented alone without the US (λ=0), the RW model predicts that its negative associative strength will increase towards 0. This is because the animal expects "negative US" and receives no US, resulting in a positive prediction error.
37
Behavioral Findings on Extinction of a Conditioned Inhibitor
Contrary to the RW model's prediction, conditioned inhibitors often do not extinguish readily when presented alone without the US. Their inhibitory properties tend to persist.
38
Latent Inhibition - Phase 1 (Experimental Group)Latent Inhibition - Phase 2 (Experimental Group) Latent Inhibition - Control Group
Repeated presentations of a Neutral Stimulus (NS) followed by nothing (NS → nothing). The same stimulus from Phase 1 is now paired with a US (NS → US). Does not receive Phase 1. In Phase 2, the same NS is paired with the US (NS → US).
39
Result of Latent Inhibition Experiment
The Experimental group learns the association between the NS and the US more slowly in Phase 2 compared to the Control group.
40
RW Prediction for Latent Inhibition - Phase 1
According to RW, no learning should occur in Phase 1 because there is no US (λ=0) and the initial associative strength of the NS is V=0. Therefore, ΔV=α(0−0)=0.
41
Limitation of RW in explaining Latent Inhibition
RW fails to predict latent inhibition because it assumes that learning only occurs in the presence of a prediction error driven by the US. In the absence of a US (Phase 1), there is no prediction error and thus no learning, even though the animal is clearly learning something about the CS (that it predicts nothing).
42
Latent Learning (as a broader concept)
Learning that occurs without any obvious reinforcement and is not immediately expressed in behavior. RW struggles to account for such learning.
43
Sensory Preconditioning
A phenomenon where prior pairing of two neutral stimuli (A-B) can lead to learning about one stimulus (B-US) affecting the response to the other (A) later. RW does not predict this.
44
General Limitation of RW regarding Inhibition
RW has difficulties explaining several phenomena related to inhibition, including the persistence of inhibition during extinction and the initial learning that occurs when a stimulus predicts the absence of a US (as seen in latent inhibition).
45
Core Idea of the Rescorla-Wagner Model
Learning depends on the surprisingness of the US. Learning occurs when there is a prediction error (the difference between what is expected and what is received).
46
How Associative Strengths Combine in RW
The associative strengths of all CSs present on a trial are summed to determine the total expectation of the US.
47
RW Explanation of Excitatory vs. Inhibitory CSs
Excitatory CSs acquire a positive associative strength (V>0), predicting the occurrence of the US. Inhibitory CSs acquire a negative associative strength (V<0), predicting the absence of the US.
48
Influence of Background Context in RW
Background contextual stimuli also acquire associative strength and contribute to the overall prediction of the US, influencing learning about other CSs.
49
Key Insight from RW-Stimulated Research
Conditioning is not a simple matter of pairing two events. Learning depends on the existing associations and the prediction error.
50
Shortcomings of the Rescorla-Wagner Model (Summary)
List: Incorrectly predicts the loss of inhibition when an inhibitor is presented without a US. Does not explain latent inhibition. Fails to fully account for blocking, suggesting a role for attention.
51
Mackintosh Model
Core Idea: Animals pay attention to relatively good predictors of the US and tune out bad or redundant ones. Learning is influenced by attention to the CS.
52
Pearce-Hall Model
Core Idea: Animals attend to uncertain predictors. Attention to a CS on a given trial depends on how poorly the US was predicted on the previous trial.
53
Hybrid Learning Models
Models that propose both attentional processes (as described by Mackintosh and Pearce-Hall) can operate and play complementary roles in learning.
54
Wagner's Short-Term Memory Model
Expansion of RW: Proposed that learning is influenced by the surprisingness of both the US and the CS, based on whether they are "primed" in short-term memory (due to recent presentation or retrieval from long-term memory).
55
Wagner's SOP Model (Standard Operating Procedure)
Core Idea: CS and US representations in long-term memory can be activated to different levels (A1 - focal awareness, A2 - peripheral awareness). Associations form only when both are in A1 simultaneously. Accounts for timing effects in conditioning.
56
AESOP Model
Extension of SOP: Proposed that USs have both "sensory" and "emotive" nodes, leading to different kinds of responses with different decay rates.
57
Configural Effects (in conditioning theory)
Definition: Situations where organisms respond differently to a CS when it is presented alone versus when it is presented in a compound with other stimuli. Some extensions of SOP attempt to explain these.
58
Overall Significance of Conditioning Theories
They provide frameworks for understanding the principles of learning and can account for a wide range of experimental findings in Pavlovian conditioning. SOP (with extensions) is highlighted as a particularly powerful theory.
59
Jimmy has severe mental health issues (because of all the craziness we put him through in earlier lessons). He goes to a therapist who tries to help him. Try to think through all the following scenarios with RW in mind. The therapist first wants Jimmy to be calm when in the office. So, Jimmy is allowed to go into the office and spend time there with nothing happening – to learn that it is a safe space. Later, when scary memories are brought up in there, Jimmy will be less scared than he would have been elsewhere. What kind of experiment is this? Would RW predict this effect or not?
This is a latent inhibition experiment. The office is a CS which is first presented with no US (nothing scary). Later, when it is paired with scary things, the association forms more slowly. RW would not predict this effect. In the beginning, there is no reason for Jimmy to be scared of the office (it is an NS), and so RW predicts he will learn nothing.
60
The therapist gives Jimmy toys to hold onto when reliving scary experiences. Sometimes he is given a teddy bear. Other times, the teddy bear is not available, and Jimmy is instead given both a doll and a stress ball. According to RW, which of these three things will Jimmy associate most strongly with negative memories? Why?
This is an overshadowing experiment. RW says that the teddy bear is associated with an aversive US alone, so it will gain lots of associative strength. The doll and the stress ball are always presented together, so they will compete with each other for strength, and will overshadow each other. So, Jimmy will learn less about both of them than about the bear.