Chapter 5: Operant Conditioning Flashcards

1
Q

outcomes are ________ of an animal’s behaviour

A

consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is instrumental behaviour emitted?

A

Because it is effective in producing a particular consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Operant conditioning?

A

a response defined by the effect it produces in the environment (goal- directed)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

E. L. Thorndike

A

Puzzle box
– Hungry animal placed in box
– Food visible to animal
– Measured how long it took to escape the box on successive trials

Thorndike measured latency to get out of box on successive trials

Law of effect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Law of effect

A

Responses in presence of stimulus followed by satisfying events
will strengthen the association between the S and R;
If response followed by annoying event, association is weakened

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Standard tasks for examining instrumental behaviour: Discrete trial procedure

A

– Instrumental response produced once per trial
– Each training trial ends with removal of the animal from the apparatus

ex. Operant Devices
Mazes
Inspired by observing animals in nature (burrows of rats)
measure: Running speed, Latency, Correct choices

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Standard tasks for examining instrumental behaviour: Free-operant procedure

A

– Animals remain in apparatus and can make many responses
– No intervention by the experimenter
– Developed by BF Skinner
– Need a unit to measure behaviour:
•Operant response: defined by the effect that the response produces on the environment
– Response rates are often the behavioural measure

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Standard tasks for examining instrumental behaviour: Magazine training

A

– How you establish the operant response
– Animal must be shown how to “use” the experimental set up
– You need a series of training steps
– Involves classical conditioning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Examining instrumental behaviour

• Shaping

A
– Sequence of training steps
1) Light always occurs with
food delivery 
2) Light and tone occur
with food delivery 
3) Tone occurs with food
delivery
• Light does not occur
anymore

shaping takes advantage of response variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Burrhus Frederick Skinner

A

shaping a pigeon to turn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Response Variability

A

Behaviour starts off highly variable
Successful variations are maintained;
unsuccessful variations are not

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

appetitive stimulus

A

pleasant outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

aversive stimulus

A

unpleasant outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

instrumental responses can

A

result in a stimulus

turn off a stimulus

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Reinforcer

A

Reinforcer : An event that follows behaviour,

and that behaviour increases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Punisher:

A

Punisher: An event that follows behaviour,

and that behaviour decreases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Reinforcement:

A

increases responding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Punishment

A

decreases responding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

positive

A

add something (appetitive or aversive)

20
Q

negative

A

remove something (appetitive or aversive)

21
Q

positive punishment

A

response-outcome contingency: response produces an aversive stimulus

result: decrease in response rate

22
Q

positive reinforcement

A

response-outcome contingency: response produces an appetitive stimulus
Result: incerase in response rate

23
Q

Negative reinforcement

A

response-outcome contingency: eliminates or prevents the occurrence of an aversive stimulus
result: increase in response rate

24
Q

Negatove punishment

A

omission training

response-outcome contingency: response eliminates or prevents the occurrence of an appetitive stimulus

result: decrease in response rate

25
Omission training (DRO treatment) for Self-Injurious Behaviour
Reinforced with attention when not performing self-injurious behaviour
26
Reinforcer vs. Punisher is all based on the __________
Reinforcer vs. Punisher is all based on the | target behaviour
27
Instrumental Conditioning: The Players
1. Instrumental response 2. Outcome of response (reinforcer) 3. Response-reinforcer relation
28
Instrumental response
• Thorndike “stamping in” of S-R association • Skinner “reinforcement” or strengthening of behaviour • Language suggests that instrumental conditioning results in necessarily stereotyped behaviours • Led to studies of Response Variability
29
Response Variability | Method:
– Pigeons in operant chamber – Peck two response keys a total of 8 times – No restriction on distribution of pecks between the two keys – BUT, pattern of left/right pecks on a given trial had to be different that on the previous 50 trials – Only “novel” patterns reinforced Summary: • Response variability can be increased by reinforcement • In absence of reinforcing variability in responses, responding becomes more stereotyped
30
Relevance or Belongingness
Relevance or Belongingness • Remember bright-noisy/water Classical Conditioning experiment with rats? – (taste with sickness, audiovisual with pain) • Belongingness - Proposed by Thorndike – Evolutionary history makes certain responses belong with certain reinforcers – Pawing at door vs. yawning
31
The outcome or reinforcer
• (1) Quality and quantity • (2) Previous experience with other reinforcers (shift in quality or quantity) – Similar to R-W • Larger than expected reinforcer (or US) supports excitatory conditioning • Smaller than expected reinforcer (or US) supports inhibitory conditioning
32
Manipulation of Quality vs. Quantity
• Independent groups of rats given flavoured water reinforcement • More responses when reward is greater in quantity and/or quality
33
Response-reinforcer relation
Temporal relation | Causal relation
34
Temporal relation
Time between response and outcome – Reinforcement is most effective if it happens immediately after the target response
35
Causal relation
The extent to which the response is necessary and sufficient for the occurrence of the reinforcer – Reinforcement is most effective if it only happens afterthe target response
36
Effects of Temporal Contiguity
* Immediate reinforcement preferred * Delays over 0.5 s can hinder performance * Recent research suggests: can use delays up to 30 s
37
Effects of Delayed Reinforcement
Delay between response and food reinforcement in rats | No learning in 64-sec delay condition
38
How to overcome delay in reinforcement?
1. Conditioned (secondary) reinforcer | 2. Marking procedure
39
Conditioned (secondary) reinforcer
A stimulus that becomes an effective reinforcer because of its association with a primary or unconditioned reinforcer (e.g., food) helps overcome delay in reinforcement
40
Marking procedure
Instrumental response is immediately followed by a distinctive event that makes the instrumental response more memorable helps overcome delay in reinforcement
41
Response–reinforcer contingency:
The relation of a response to a reinforcer defined in terms of the probability of getting reinforced for making the response as compared to the probability of getting reinforced in the absence of the response
42
Skinner’s Superstition Experiment
Pigeons trained with temporal-contingent | reinforcement exhibit “superstitious” behaviours
43
Behavioural Momentum
More than just response rates, reinforcers strengthen the tendency to persist in behaviour
44
Reinforcers: primary vs. secondary
Primary: US Secondary: CS (i.e. money)
45
Reinforcers: intrinsic vs. extrinsic
inside | outside (i.e money)