learning and cond lect 2 operant cond Flashcards

1
Q

learning curve (Thorndike)

A

-how long it takes cats to escape puzzle box to access food on outside
-inside box was lever that would open box
-same cat in box, cat learns lever opens box to access food
-time for cat to open the box was plotted on a graph into a learning curve

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what does thorndikes learning curve show us

A

-responses ending in satisfactory outcome are more likely to be repeated
-discomforting responses are less likely to be repeated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is thorndikes law of effect

A

-first formal statement of importance of consequences in learning
-established the beginning of operant conditioning even if not named at this stage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

3 term contingency

A

ABCs of behaviour
-A= antecedent
-B = behaviour (what people say/ do, not attitudes or feelings)
-C = consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what are behaviour characteristics

A

they can have 1 or more dimensions, can be observed/described/recorded, have an impact on envrionment, may be overt or covert

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

effect of consequences on behaviour

A

e.g you get a higher mark on your essay for critical evaluations so you do this in your next essay
-reinforcement: after behaviour increases probability of future similar behaviours
-punishment: decreases probability of future similar behaviours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what are the 4 basic consequences

A

pos reinforcement: something added to increase likelihood of future behaviour

neg reinforcement: something removed to increase likelihood of future behaviour

pos punishment: something added to reduce likelihood of future behaviour

neg punishment: something removed to decrease likelihood of future behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

SD =

A

discriminative stimulus (antecedent)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

SR + =

A

pos reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

role of motivation in reinforcement

A

-motivation is an establishing operation (EO)
-we expect a behaviour when we have an EO
e.g
EO (thirsty) - SD (tap marked with C for cold water) - response (turn tap on with C) - SR+ (cold water presented)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

superstitious behaviour

A

skinner: you can accidently cond superstitious behaviour in animals
-when reinforcement e.g food pellet accidently follows a behaviour that did not produce the reinforcement, animals can be cond for superstitious behaviours
e.g sports players sometimes wear lucky socks as they won a game wearing these once or twice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

skinner research

A

-pigeons provided with reinforcement at regular intervals with no reference to birds behaviour
-if bird happens to be executing a particular response at same time, they tend to repeat this behaviour
-if interval is not too long that the pigeon would have forgotten, it does the same thing again and by coincidence food pellet is given again (leads to accidental reinforcement of superstitious behaviour)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

escape contingency

A

-in escape contingency a response terminates a stimulus that is present e.g sun in eyes so move hand to cover sun, glare of sun escaped

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

avoidance contingency

A

-response prevents or postpones presentation of stimulus e.g hear a friend you dislike coming so look down so disliked friend is avoided

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

similarity between pos and neg reinforcement

A

increase in response via stimulus change

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

diff between pos and neg reinforcement

A

-pos produces previously absent stimulus
-neg removes stimulus present prior to behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

factors effecting reinforcement

A

-timing between stim and response (best if stim given straight after behaviour)
-reinforcement is regular after behaviour occurs
-reinforcement needs to be specific to desired behaviour
-for humans a verbal description for which behaviour is being reinforced is helpful

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

conditioning and awareness

A

bradshaw and reed
-pos reinforcement, schedules of behaviour and awareness
-over 10 experiments
-computer task: press button to earn points, told to find best way to earn points
-ratio schedule: points earned by pressing quickly
-interval schedule: most efficient way was to wait 10 secs then press button
-pp given questionnaire asking what was best way to score points

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

findings of bradshaw and reed cond and awareness study

A

-those who scored most points were aware of how to score these points
-clear relationship with contingency awareness
-performance on schedules of reinforcement sig related to awareness of performance which produced reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

applying pos reinforcement

A

-tell learner about programme at outset
-describe desired behaviour
-use lots of praise and physical contact
-can gradually fade reinforcers once consistent e.g praise every 2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

nature of reinforcement

A

it is NOT the case that some behaviours are more susceptable to reinforcement than others

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

using reinforcement effectively

A
  1. use high quality reinforcement
  2. set easily achievable initial criterion
  3. explain contingency and provide prompts
    4.deliver reinforcer immediately after behaviour
  4. initially reinforce every occurence of behaviour
  5. gradually increase response to reinforcement delay
  6. use varied reinforcers
  7. shift from contrived to naturally occuring reinforcers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

strengths of pos/neg reinforcement

A

+ abundance of evidence for effectiveness (skinner)
+ relatively ethical compared to punishment
+ can help indiv and society to develop good pos practices
+ can be effective in LT

23
Q

weaknesses of pos/neg reinforcement

A
  • occasional chance of reinforcing non target behaviour
  • can be used unethically
  • can be lengthy to implement
  • important to get the reinforcer right
  • not useful to prevent harmful behaviours rapidly
24
Q

what is extinction

A

-reinforcement stops occurring and behaviour ceases
williams 1959
-extinction of night time tantrums
-not giving child any attention/reinforcement
-tantrum reduced/ length of crying reduced after a few nights of no attention

25
Q

lovaas and simmons

A

-extinction of self injurious behaviour
-children hitting head against wall/ with hand etc
-extinction used on 2 children, punishment used on 1
-1 childs behaviour who experienced extinction improved, the other didnt
-punishment may not prevent behaviour as it is giving child more attention

26
Q

side effects of extinction

A

-behaviour may increase in frequency, duration or intensity initially (extinction burst)/ resist extinction
-novel behaviours may occur, diff behaviours tried
-emotional response/ agg behaviour may occur

27
Q

misconception about extinction

A

-doesnt mean simply ignoring behaviour, unless attention is the reinforcer

28
Q

using extinction to reduce behaviour

A

-you have to know what the reinforcer is or to no longer remove a good stimulus after the behaviour
-must be consistent to work effectively
-using schedule of reinforcement and begin process of extinction (can take lots of time)
-can work better if another target behaviour is reinforced at same time as replacement

29
Q

strength of extinction

A

+far gentler and more ethical than punishment

30
Q

weaknesses of extinction

A

-can take time if behaviour is deeply entrenched
-if you cant follow through completely you risk behaviour returning stronger than ever
-not always apparent what the reinforcer is and how to remove it

31
Q

punishment

A

-removes/reduces behaviour
-doesnt need to be aversive

32
Q

ST contingency

A

antecedent - behaviour - consequence
-uncond punisher is stimulus whose presentation functions as punishment without being paired with any other punishers e.g pain, certain odours, tastes

33
Q

Saj Waj et al research

A

decreased life threatening rumination in 6 month old infant
-contingent delivery of small drop of lemon juice every time baby ruminates (pos punishment)

34
Q

Luce et al

A

-decreased agg in child by telling them to stand up and sit down 10 times every time they hit someone
-hitting reduced

35
Q

what is overcorrection (pos punishment)

A

client required to engage in effortful behaviour directly linked to problem behaviour in extended period (foxx and azrin)

36
Q

what is restitutional (pos punishment)

A

client must correct environmental effects of problem behaviour and restore natural environment

37
Q

what is positive practice (pos punishment)

A

client must engage in correct forms of relevant behaviour

38
Q

what is time out (neg punishment)

A

-remove person from situation

39
Q

what is response cost (neg punishment)

A

removal of specified amount of a reinforcer contingent on occurrence of a problem behaviour e.g loss of play time in mins

40
Q

what are the factors influencing the power of a stimulus to act as a punisher/reinforcer

A

-contingency (consistency)
-immediacy
-magnitude

41
Q

what are the risks with punishment

A

-emotional reactions
-escape and avoidance of punishment/punisher
-neg reinforcing punishers behaviour
-neg modelling e.g children learning the wrong behaviours
-fails to teach an appropriate replacement behaviour

42
Q

punishment in everyday life

A

in schools, law, friends, family, apps, everyday services e.g gym

43
Q

strengths of punishment

A

+stops behaviour quickly
+easy
+sometimes necessary to prevent harmful behaviours

44
Q

weaknesses of punishment

A

-potential for unethical use
-shouldnt be used as an easy option if extinction is an option
-doesnt indicate to subject which behaviours are desirable, only those undesirable
-punisher may neg reinforce their behaviour

45
Q

what is continuous reinforcement (SCHEDULES OF REINFORCEMENT)

A

CRF
-provides reinforcement for every occurrence of a behaviour
-advantageous for gaining a new skill

46
Q

intermittent schedules of reinforcement (INT)

A

-based on rule which specifies when a reinforcer is delivered
-used to strengthen established behaviours
-usually necessary for the progression to naturally occurring reinforcement

47
Q

example of CRF and INT

A

CRF - vending machine
INT - gambling machine

48
Q

intermittent pos reinforcement schedules

A

-skinner: schedules of reinforcement can be used in place of continuous reinforcement with reliable effects on patterns of behav

49
Q

what are ratio schedules (intermittent pos reinforcement schedules)

A

reinforcement delivered based on no. of responses emitted

50
Q

what are interval schedules (intermittent pos reinforcement schedules)

A

reinforcement delivered based on first response emitted after specified amount of time

51
Q

how can both ratio and interval schedules be subdivided

A

into fixed (exact values) or variable (changing value) schedules

52
Q

what are the 4 main schedules for reinforcement

A

1.fixed ratio
2.variable ratio
3.fixed interval
4.variable interval

53
Q

fixed schedules

A

-response ratio or time requirement remains constant
-fixed ratio: reinforcement after specified no of responses e.g fixed ratio 4 = after every 4th correct response
-fixed interval: reinforcement after 1st response followed by time elapse e.g reinforcement 2 mins after 1st response

54
Q

variable schedules

A

-response ratio or time requirement can change from 1 reinforced response to another
-variable ratio: reinforcement after an average no of responses across schedule e.g variable ratio 4 = reinforcement after an average of every 4th occurrence
-variable interval: reinforcement following first reponse after an average amount of time has elapsed e.g variable interval 2 mins = reinforcing first occurrence after an averaged elapsed time of 2 mins