PSYC2050 - Wk4 Operant Conditioning Flashcards Preview

PSYC2050 > PSYC2050 - Wk4 Operant Conditioning > Flashcards

Flashcards in PSYC2050 - Wk4 Operant Conditioning Deck (54)
Loading flashcards...
1
Q

What is the difference between classical and operant conditioning?

A

Pavlovian: reflexive associations between stimuli result in involuntary responses
vs
Operant: consequences of past actions influence future voluntary behaviour

2
Q

do behaviours increase or decrease as a result of operant conditioning?

A

Both, depending on whether the past consequences of that behaviour reinforced or punished it

3
Q

What is the basic principle of operant conditioning?

A

Consequences

4
Q

When would a behaviour tend to be repeated or become more frequent in operant conditioning?

A

When it results in rewards

5
Q

What happens to behaviours that result in punishment? 2

A

They become less frequent or are avoided

6
Q

what process lead the cats to escape from Thorndike’s puzzle box?

A

Trial and error learning. Random behaviours had an effect at some point

7
Q

What did a cat have to do to escape from thorndike’s puzzle box? 3

A

Pull a string, step on a platform, and turn a latch on the door

8
Q

What is the law of effect?

A

The tendency to perform an action is increased if reward, weakened if it is not

9
Q

How does ‘shaping’ teach a new behaviour to animals?

A

The tendency to perform an action is increased if rewarded, weakened if it is not.

10
Q

How is Operant conditioning at play in the real world, without an experimenter/trainer?

A

Animals adapt behaviourally to environmental feedback (eg foraging)

11
Q

What happens if you randomly reward pigeons every 15 seconds? And why do they do this?

A

They show superstitious behaviour, which is self-perpetuating through reinforcement. The behaviour is just whatever the bird was doing before the reward.

12
Q

What is it called when random reinforcement shapes behaviour?

A

Superstitious behaviour

13
Q

What are some examples of superstitious behaviour in humans?

A

Athlete warm up rituals
Lucky clothes
Lucky charms
Pedestrian crossing buttons

14
Q

Why do people engage in superstitious behaviour?

A

we try to find links between behaviour and an outcome, even if there is no true association

15
Q

What are two things that happen in shaping?

A

Scan - observing and waiting for behaviour

Capture - reinforce behaviour resembling target behaviour

16
Q

How does baiting work? And involve Pavlovian conditioning?

A

Removing primary reward and associating it with another kind of stimulus/indicator

17
Q

What are the ways we can teach a new behaviour?

A
Shaping (scan and capture)
 Baiting
 Mimicing
 Sculpting
 Instruction (language)
18
Q

What is backward chaining? And why does it work?

A

Acquiring a new behaviour in small pieces from last one to the first. It’s easier when done in bits.

19
Q

What are types of reinforcers and punishers, both negative and positive for each?

A

R+ ice cream
R- less chores
P+ shock
P- Tv privileges

20
Q

how are reinforcers and punishers different?

A

Reinforcer: increases behaviour

Punisher: decreases behaviour

21
Q

What is the difference between positive and negative reinforcers and punishers for the animal?

A

Positive: the animal receives something

Negative: something is taken away from the animal (or environment)

22
Q

Is punishment necessarily irritating?

A

No

23
Q

Is reinforcement necessarily rewarding?

A

No

24
Q

What positive reinforcement do?

A

Adds something to increase behaviour

25
Q

What is an example of positive punishment?

A

Anti-barking collars or getting told off

26
Q

If you lose your licence, what is this in terms of operant conditioning?

A

Negative reinforcement - remove something to decrease behaviour (eg time out)

27
Q

What is bridging? And how does it help with learning using positive reinforcement?

A

A conditioned reinforcer: A useful association between an instant stimuli and a subsequent reward. This stimulus signals the reward is coming.

It bridges time between behaviour and primary reinforcement when there needs to be no time delay.

28
Q

How are horses trained? 2

A

Through shaping and negative reinforcement (removing reign pressure)

29
Q

Why is continuous reinforcement not always possible?

A

We cant always be around to deliver it

30
Q

What types of partial reinforcement can we use? 4

A

Fixed ratio - every nth
Variable ratio - on average every nth
Fixed interval - first behaviour after n seconds
Variable interval - on average, first behaviour after n seconds

31
Q

What schedule of reinforcement does gambling fall under?

A

Variable ratio - the reward will come but you dont know when exactly

32
Q

Why is variable ratio PRF very efficient?

A

It teaches and engravings persistence

33
Q

Which schedule of reinforcement is the most resistant to extinction?

A

Variable ratio

34
Q

Why are responses to fixed reinforcement schedules not linear, but variable ones are?

A

Fixed schedules have a post-reinforcement pause, where the animal learns its pointless to do anything straight away

35
Q

What are differences in response rates to various reinforcement schedules (in rats)? 4

A

VR is the fastest learning; followed by FR. VI and FI take longer to learn responses. FI takes the longest

36
Q

What is the post-reinforcement pause? And what types of schedules does it appear ?

A
The animal has a break after reinforcement
 Fixed schedules (FI & FR)
37
Q

What is more effective? Continuous or partial scheduling?

A

Continuous, but its not always possible

38
Q

Why is reinforcement more effective?

A

It strengthens the correct behaviour in the animal’s repertoire of behaviour, whereas punishment doesn’t actually tell the animal what the right thing to do is.

39
Q

What are the problems with punishment? 2

A
  • Less permanent (extinguishes faster)

- Reduces trust and increases aggression

40
Q

How do you punish effectively? 8

A
  1. No escape
  2. As intense as possible (within limits)
  3. Continuous schedule
  4. No delay
  5. Over a short period of time
  6. No subsequent reinforcement
  7. Reinforce an incompatible, appropriate behaviour concurrently
  8. Watch for side effects (aggression, fear, modelling violence, learned helplessness, change to other behaviour)
41
Q

Why cant you do bridging to reduce the delay in punishment?

A

It leads to escape behaviour, because you have signaled that punishment is coming

42
Q

What are reward variables in Operant Conditioning? 3

A

Drive, size, and delay

43
Q

What is Drive?

A

How much the organism wants the reinforcer (eg dogs that want to sniff things out/persist are good drug sniffers)

44
Q

how could drive affect studies in animal behaviour?

A

Hungry vs sated organisms will respond differently to food rewards

45
Q

What is the trade off for size of the reward?

A

Diminishing returns

46
Q

How does the size of the reward affect acquisition and extinction of behaviours?

A

Makes them happen faster

47
Q

How does delay produce problems for reinforcement and punishment?

A

Short term reinforcement are more motivating than long term punishments (eg eat the snack now vs be fat later)

48
Q

What is the three term contingency?

A
  1. Discriminative stimulus (occasion)
  2. Operant response (behaviour)
  3. Outcome (consequence)
49
Q

What does a discriminative stimulus do?

A

Signals the occasion when a particular behaviour will be punished/reinforced

50
Q

What is the key condition to operant conditioning? And when do stimuli become “signals”?

A

Learning to discriminate stimulus

When stimuli are predictive of a consequence

51
Q

What is stimulus generalisation?

A

When a response is reinforced in the presence of stimulus there is a tendency to reproduce for similar or associated stimuli

52
Q

What is stimulus discrimination?

A

Degree to which different stimuli set the occasion for particular responses. A precise degree of stimulus control

53
Q

How is stimulus discrimination taught?

A

Behaviour happens when stimulus is present, abates when absent

54
Q

Is stimulus control pervasive?

A

Much of our everyday behaviour is under stimulus control (eg traffic light signals)