Biological 5: Goal-directed and habit-based behaviour Flashcards
(47 cards)
What is general vs specific PIT?
General PIT = stimulus paired with rewarding thing
Specific PIT = stimuli both paired with one specific reward
What is a positive and negative prediction error?
Positive prediction error - increase connection of stimulus response
Negative prediction error - association between cue and outcome is weakened
What does the Kamin blocking effect show?
Shows that prediction error is more important than things occurring close in time
What two things can behaviour be?
1) Purposeful, ‘goal-directed’ actions
2) Automatic, ‘habitual’ responses
What is the goal-directed system of behaviour vs the habit system of behaviour?
Goal-directed:
Stimulus -> outcome -> response
Habit:
Stimulus -> response
Do animals learn my problem solving or trial and error? (Thorndike’s puzzle box)
Trial and error
Thorndike’s puzzle shows this:
When a cat figured out how to escape from puzzle box, time it took to escape gradually decreased as trials went on
If it had solved by problem solving, there should have been a sharp drop - a lightbulb moment
The shape of animal learning curve suggests that learning is an incremental process, reflecting the strength of association between stimulus and response
What is Thorndike’s law of effect?
‘If in the presence of a stimulus a response is followed by a satisfying state of affairs, the connection between the stimulus and the response will be strengthened. If the response if followed by an annoying state of affairs, the connection between the stimulus and response will be weakened.’
Stimulus response association is mediated by outcome
Does outcome drive conditioned behaviour or habitual responses?
Conditioned behaviour
Habitual responses not based on outcome
What does a stimulus-response association result in?
Results in responding without any expectation that the response leads to the outcome (habit based responding)
What does a response-outcome association result in?
Results in responding because there is the expectation that the response will result in the outcome (goal-directed responding)
How can you tell goal-directed behaviour from habitual behaviour using the outcome devaluation test?
If behaviour persists despite outcome being lower in value = habitual
If it does not = goal directed
Mouse press lever to get cheese
Then devalue cheese - huge block of cheese - too full - cheese is less appealing
If behaviour is goal directed, then they do not press lever, because they are aware that pressing the lever leads to cheese, and right now they do not want the cheese
If behaviour is habitual, then they do press the lever because they do not know/are not concerned with the outcome
What is Baxter and Murray’s (2002) outcome devaluation test on monkeys?
Training task 1: monkeys trained so that they get cherry for choosing correct object, and nothing for choosing incorrect object
Training task 2: monkeys trained on two different objects - gets peanuts for choosing correct object and nothing for choosing incorrect object.
Prior to the test the monkey is allowed to consume one of the rewards (cherries or peanuts) until sated. In test monkey is allowed to choose one of the two correct objects.
Which one will the monkey choose?
If they avoid the one that they are full of - behaviour is goal-directed
If an animal is trained that lever 1 results in sucrose, and lever 2 results in pellets, and then sucrose is paired with LiCl (makes them ill), in the test phase, which lever should they press in behaviour is goal-directed and not habit based?
Lever 2
Same would happen vice versa if you devalued the pellets
Instrumental learning is generally the result of an outcome - goal directed
If an animal is given extended training that lever 1 results in sucrose, and lever 2 results in pellets, and then sucrose is paired with LiCl (makes them ill), in the test phase, which lever should they press?
Levers 1 and 2 equally - behaviour is habit based not goal directed
Small number of training sessions - goal directed
Extended training - habits
When is instrumental behaviour sensitive to outcome devaluation?
If it is still in early phases of training
- Instrumental behaviour is initially sensitive to outcome devaluation (i.e., it is goal-directed, R-O)
- If the instrumental behaviour is overtrained it is no longer sensitive to outcome devaluation (i.e. it is habitual, R-R)
In what mental health condition do patients show an insensitivity to outcome devaluation?
OCD - more habit responding behaviour - not based on outcome
Does stress result in less sensitivity to outcome devaluation?
Yes
What substance can make someone less sensitive to outcome devaluation?
Alcohol
Actions learnt under the influence of alcohol are less sensitive to outcome devaluation
Alcohol seeking behaviour becomes insensitive to devaluation if over-trained
What is the dorsomedial striatum involved in?
Goal-directed behaviour
What is the basolateral amygdala involved in?
Incentive value
What is the medial PFC involved in? Infralimbic vs prelimbic
Infralimbic - Encoding associations
Prelimbic - SR learning
What are the four main parts of the dorsomedial striatum?
Caudate
Putamen
Nucleus accumbens
Amygdala
Yin et al. (2005) conducted a devaluation test on rats who had lesions to the posterior dorsomedial striatum.
(left lever = sucrose, right lever = pellets, they eat one to satiety, which lever will they push?)
If DMS involved in habitual learning, there will be goal-directed behaviour
If DMS involved in goal-directed learning, there will be habitual behaviour
What did they do?
The sham rats pressed the lever much more when the outcome was NOT devalued
However the DMS lesioned rats showed no difference in lever pressing irrespective of whether the outcome had been devalued.
This is habit-based behaviour.
This shows that the DMS is important for goal-directed learning
What is an issue with devaluation tests?
If an animal is impaired on the extinction test (choice after one outcome is devalued) it may be because they cannot discriminate between the rewards
Expectation of reward drives goal-directed behaviour - need a test that is more specific in showing response to expectation of reward