Topic 7: Differential reinforcement Flashcards
Differential reinforcement
-Operant training procedure in which some behaviors are systematically reinforced and others are not
-Reinforcement + Extinction
-Through practice extinction can be punishment instead
4 different types of differential reinforcement
-Low rates of responding (DRL)
-High rates of responding (DRH)
-Other behavior (DRO)
-Alternative behavior (DRA)
DRL
-Reinforcer is delivered contingent upon a low rate of responding during certain period of time
-Behavior is reinforced only if it occurs no more than a set maximum
-Results in less of the behavior but does not completely stop behavior
-A behavior needs to first be learned before reinforcement can occur
Examples of DRL
-Reinforce rat pressing lever only if it has been at least 10 seconds since last lever press
-Reset clock with each response (reinforced or not)
-On road trip, child will get treat in end if they don’t ask “Are we there yet” more than once an hour
-If they exceed the limit, no reinforcement
3 different types of DRL
1) Full-session DRL
2) Spaced-responding DRL
3) Interval DRL
1) Full-session DRL
-Reinforcement delivered if fewer than a specific number of responses occur in a period of time
-Rate within session does not matter
-eg) kid can ask all three questions right away, or they can space them out
2) Spaced-Responding DRL
-Reinforcement delivered if a set amount of time passes between responses
-Wait at least “X” times between responses
-Child only raises their hand after 15 minutes passes
-Low rates come from spacing out responses, therefore slowing down occurrances
-Timing more controlled
3) Interval DRL (least discusses)
-Session divided into time intervals and reinforcement delivered if behavior does not occur more than once per interval
-eg) Child raises hand only once per 15-minute interval throughout classes
-Mix between full-session and spaced-responding
Steps for DRL
1) Identify target behavior
2) Evaluate if DRL is best procedure
3) Determine acceptable rate of the behavior
4) Full session or spaced responding DRL?
5) Once plan is set, inform client
6) Implement treatment and provide feedback
DRl use
-Useful for reducing, but not eliminating, rates of undesired behaviors
-Longer intervals and lower thresholds produce lower rates
-Can start with less strict criteria and change over time
-Concerns with superstitious behavior
-Reinforcement delivered but not contingent on the behavior happening, but on the behavior NOT happening
DRH
-Reinforcer is delivered contingent upon a high rate of responding during a certain period of time
-Eg) The behavior is reinforced only if it occurs AT LEAST a set minimum number of times in a set time period
-Results in more of the behavior, increases desired behavior
Examples of DRH
-Reinforce rat pressing lever only if it has been pressed at least ten times within 10 seconds
-Keep track of number of responses within set time period and deliver reinforcement once criteria are met
-Child gets ice-cream at the end of the day if they answer at least two questions
DRO
-AKA differential reinforcement of ZERO responding
-Reinforcement is delivered contingent upon the absence of the behavior during certain period of time
-Decreased responding to ZERO
-Useful if extinction is not possible
-If we can’t remove reinforcer, we give a more appealing reinforcer for stopping the behavior
Examples of DRO
-Reinforce a rat if it does not press the lever in the box ten seconds
-Reset clock if response made
-On road trip, child will get ice-cream at the end if they don’t ask “Are we there yet” at all
-If they ask, then no response will be made
“Other behavior”
-Extinction of undesired behavior
-Only works if we are able to withhold reinforcer
-Not reinforcing an “Other behavior”
-Reinforces absence of behavior
-However, will engage in other behaviors naturally if avoiding undesired behavior
Short VS long DRO interval
-Use short DRO intervals for frequent problem behavior
-Use long DRO intervals for infrequent problem behavior
-Determined based on baseline
Steps for DRO
1) Identify target behavior
2) Evaluate if DRO is best procedure (DRL?, Extinction?)
3) Determine effective reinforcer
4) Choose initial DRO interval
5) Inform client, this helps eliminate chance of superstitious behavior to occur
6) Implement treatment and provide feedback
2 types of DRO
1) Whole interval DRO
2) Momentary DRO
1) Whole interval DRO
-Problem must be absent for the whole interval for reinforcement
eg) No talking for entirety of class period
2) Momentary DRO
-Problem must be absent at the end of the interval for reinforcement
eg) At specific times you wander through the class to ensure everyone is quiet
DRA
-Reinforcer is delivered contingent upon the occurrence of the desirable behavior
-Occurrence of undesired behavior is put on extinction
-Reinforcement of wanted behavior and extinction of unwanted behavior
Steps for DRA
1) Identify target behaviors to increase and decrease
2) Evaluate if DRA is best procedure
3) Determine effective reinforcer
4) Determine method of extinction or devaluation
5) Inform client
6) Implement treatment and provide feedback
Is DRA appropriate?
-We need to be sure that we have a desirable behavior to increase to use DRA
-That desirable behavior needs to happen at least sometimes for it to be reinforced
-Can the undesired behavior to extinguished?
-Can we withhold the reinforcer used and if not, can we devalue/overpower it?
Desirable behaviors
-Desirable behaviors should require less effort than the undesired behavior
-Especially important if we can’t use extinction on undesired behavior
eg) Healthy food easily accessible whereas junkfood is hard to reach