instrumental learning Flashcards

1
Q

What are the instrumental conditioning procedures

A

Positive reinforcement = R->appetitive (more R)

Punishment= R-> aversive (less R)

Negative reinforcement = R->no aversive (more R)

Omission Training= R->no appetitive (less R)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is Thorndike’s law of effect

A

animals repeat actions that lead to a satisfying state of affairs, and this is called reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what does Hull believe reinforcement is down to

A

reinforcement is due to drive reduction, hence the animal will work for food if it is hungry, or for water if it is thirsty etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

simple schedules and their effects

A
  • Continuous reinforcement, CRF – reinforce every response
  • Fixed ratio, FR – reinforce every nth response. Pause after each reinforcement followed by fast responding
  • Do this because after a while, the animal will be full
  • Variable ratio, VR – reinforce every nth response on average. Continuous fast responding
  • Fixed interval, FI – reinforce the first response after time t has elapsed since the last reinforcer. Pause after each reinforcement followed by gradually increasing response rate
  • After 30 seconds, the first reponse will give you a pellet
  • As it gets closer to the thirty second deadline, animals get excited
  • Variable interval, VI – same as FI but with a variable time period. Continuous moderate response rate
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

how do we know an animal learns an action rather than an association

A

Telling the hamster to go the other way would prove the hamster knows the action and not the association.

With this bidireciton of control, the hamster can change direction. He has learnt the action. Instrumental learning is here

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what happened when animals were overtrained

A

Here, the devalued group press the lever a bit more than the nondevalued.
They press the lever but would leave the reward. This is a habitual automatic reflect.

The overtrained animals are exhibiting what Adams and Dickinson called a habit, something that an S->R account would expect, where the current outcome value has no impact on the probability of making a response in the presence of the discriminative stimulus. But what about the other group of animals?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what did Colwill & Rescorla 1990 do

A
  • Put a rat in a box
  • It can press a button or pull a lever
  • If they pull a chain they get food, lever is sugar water
  • Which outcome you get is dependent on the stimulus you receive (light of tone)
  • Pair the sugar water then with the lithium chloride
  • Testing them at extinction, they wont press the lever
  • Their action is dependent on the outcome they want to be desired
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Tony Dickinson suggested two kinds of instrumental learning;

A

habits and actions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the castaway dilemma

A

In this, someone who is castaway on a desert island is hungry but manages to find and eat coconuts. Then they become thirsty and there’s no water available - what do they do?
The answer is pretty obvious - they drink coconut milk - but would an animal have the ability to learn this?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What did Dawson and Dickinson find originaly

A

They (Dawson and Dickinson) found no difference in performance of the two actions

no castaway dilemma

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what did dickson eventually find

A

If the animal doesn’t know that the sugar water is good, then they wont solve the problems. They;; find that sugar water is a better option.
Then, they can solve this dilemma.
If they have no experience, there is no result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the model of instrumental performance

A
  • Tony Dickinson has argued for a model of instrumental performance that requires inference on the basis of these results. Thus the animal is postulated to reason that:
    1. I’m thirsty
    1. If I pull the chain I get sugar water
    1. Sugar water is good when I’m thirsty
    1. I’ll pull the chain then.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly