Differential Reinforcement Flashcards
Differential reinforcement
an operant training procedure in which some behaviors are systemically reinforced, and others are not
Differential Reinforcement of Low Rate (DRL)
- behaviour is reinforced only if it occurs no more than a specified number of times in a given period
- results in low rates of responding
- example: reinforce pigeons peck only after 5 seconds have elapsed
- each peck occurring before 5 seconds, resets the interval and does not provide food
- Longer intervals produce even lower rates
- Useful for reducing the rate of problem behaviours
-Differential Reinforcement of High Rate (DRH)
- behavior is reinforced only if it occurs at least a specified number of times in each period
- Results in very high rates of responding
- reinforce pigeon peck only when it has pecked at least 5 times within 10 seconds
- less than 5 responses received nothing and the clock resets - Useful when the goal is to increase rates of behavior
-Differential Reinforcement of Other Behavior (DRO)
- reinforcement is contingent on complete absence of behaviour for a period of time
- reinforce pigeon only after 10 seconds have elapsed with no pecking
- reinforcement is only provided if the behaviour does not occur
- pecking resets the clock
- Not the same as DRL
- DRL reduces rates of behaviour
- DRO eliminates rate of behaviour
- Especially useful when extinction is not an option and reinforcers are intrinsic to the behaviour
- Also called Differential Reinforcement of Zero Responding
-Differential Reinforcement of Alternative Behavior (DRA)
- a desired (replacement) behaviour while an undesired behaviour is extinguished
- procedure used to to increase frequency of desirable behaviour and to decrease undesirable behaviour, reinforced each time it occurs or not reinforced depending
- combines reinforcement for desired behaviour and extinction of undesirable
Examples of DRA
- a child’s good table manners are reinforced with praise and by passing the requested food. The child’s bad table manners are extinguished by ignoring and not passing food
- Punching the correct code numbers in the ATM is reinforced with cash. Punching the wrong numbers in the ATM is extinguished by getting no cash
- typing your correct password gets you into your computer, typing the wrong password does not
- reinforce children’s playing or sharing (with continued play time) to replace fighting (terminate play time)
- Reinforce lying quietly in the dentist chair with brief break to replace disruptive behavior
Variations of DRA
-Differential Reinforcement of Incompatible Behaviour (DRI)
- behaviour that is incompatible with the unwanted behaviour is reinforced
- Increasing the rate of desired behaviour also decreases the rate of undesired behaviour because the two cannot occur simultaneously
Variations of DRA
-Differential Reinforcement of Communication (DRC)
- a communication response is reinforced to replace the problematic behavior
- the communication responses deliver the reinforcer more rapidly than the problem behavior
- Also called Functional Communication Training
Resurgence
- the reappearance during extinction of a previously reinforced behavior
- e.g.
- Train rat to press lever for food
- Extinguish lever pressing
- Train the rat to nose-poke a key for food
- Extinguish nose-poking
- result: rat will begin to press the lever again
Resurgence of DRA
-resurgence predicts the re-emergence of the problematic behavior if the alternative behavior goes onto extinction or cannot be performed
DRA is Sometimes Too Effective?
- if alternative behavior is rapidly learned then it might replace the problem behavior before it has been adequately extinguished
- thus. The problem behavior is likely to return in absence of alternative behavior being performed
Solution for resurgence of DRA
- shape the behavior to larger intermittent reinforcement schedules
- provide additional sessions of extinction where the Alternative Behavior cannot be performed, and the problem behavior is still on extinction
- May be necessary to bring behavior into stimulus control (i.e., create a S^delta)
- provide additional sessions of extinction where the Alternative Behavior cannot be performed, and the problem behavior is still on extinction
Possible solutions if DRA is too effective
- do not begin reinforcing the alternative behavior immediately
- may not be practical or ethical- Provide training sessions where the Alternative behavior cannot be performed, and the problem behavior is on extinction
- may be necessary to bring the alternative behavior into stimulus control (i.e., create S^delta)
Identifying putative behaviors
- Use reinforcer maintaining undesirable behavior
- Observe activities that are enjoyable and occur with high probability
a. Playing video games, tacking tv - Ask questions
a. what do they like?
b. What do they enjoy doing? - Conduct Preference Assessment
- Make the reinforcer contingent on a response
- Note: the reinforcer you ultimately select is only a “putative” reinforcer for the desired behavior
-Single Stimulus Assessment
- potential reinforcer presented individually multiple times in random orderings
- Percentage of approaches are calculated
-Paired Stimulus Assessment
also called forced choice or paired choice procedure
- potential reinforcers are presented in pairs
- each stimulus is presented with every other stimulus multiple times
- Percentage of approaches/selections are calculated
Multiple Stimulus Assessment
- Potential reinforcers are presented in a full array
- Items are removed as they are chosen
- Process is repeated with varied item orderings
- Items chosen first are likely to more reinforcing
How to use DRA
- Define the target behavior to increase and decrease
- Identify the reinforcer for the problem behavior
- Choose a reinforcer for the desirable behavior
- Reinforce desirable behavior immediately and consistently
a. Prompt the behavior id necessary
b. Prompt before important EO and SD are present
c. Desirable behavior should require less effort than the undesirable behavior - Extinguish or devalue the reinforcer for the undesirable behavior
a. Consider using DRL, DRO procedures if not possible - Begin to incorporate intermittent schedules of reinforcement for the desirable behavior
What makes a DRA easier
-easiest if the desirable behavior occurs occasionally or can be prompted
How can DRA promote creativity ?
- differentially reinforce responses that have not recently been used
- creates response variability rather than repetition
Ratio Strain
-disruptive of the pattern of responding due to stretching the ratio of reinforcement too abruptly or too far
What influences the limits of ratio strain
Reinforcer (magnitude, establishing operations quality, delay)
- type of schedule
- how it was stretched
Self-modification is susceptible to
Ratio strain because you control access to the reinforcer
-make contingencies very easy at first and slowly stretch ratio requirements
Antecedents
Aka controlling stimuli