Unit 2 The matching law, operant extinction and differential reinforcement procedures Flashcards

(82 cards)

1
Q

A compound a schedule that consists of two or more schedules operating simultaneously but independently of each other, each for a different response.

A

Concurrent schedule

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q
  1. Two or more reinforcement contingencies
  2. Contingencies operate independently of each other
  3. Contingencies in effect simultaneously
  4. One or more different response topographies for each contingency
A

Critical attributes of concurrent schedules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Doing one’s homework versus playing a video game

Screening verses asking nicely

A

Examples of concurrent schedules

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Hernstein (1961):

The DISTRIBUTION of behavior on CONCURRENT schedules of positive reinforcement.
-Pigeons “MATCHED” relative rates of behavior to relative rates of reinforcement” Cleary 2004

Deals with Choice. Choice = Concurrent schedules

Response Allocation (Choice) is predicted by this.

This OR That

A

Matching Law

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

B1/B1 + B2 = R1/R1+R2

B = rate of responses for each alternative

R = rate of reinforcement for each alternative

A

Matching law equation

See also picture

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Concurrent schedules and matching law examples

A

See PowerPoint slides for illustrations and video

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Relevance of the matching law to treating problem behavior

Even if problem behavior is reinforced, the greater the DIFFERENCE In Proportion of Reinforcement for a problem behavior versus reinforcement for a REPLACEMENT behavior, the greater the difference in____

A

…RATE between the two.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
  • Reinforce replacement “Behavior “ On a DENSE schedule and minimize reinforcement for problem behavior
  • We Can live with some failures of procedural INTEGRITY when using Extinction
  • May not need to use extinction at all, provided that the reinforcement schedule for appropriate behavior is dense enough and takes into account other factors.
  • Time-Contingent schedules may be used
A

Matching law and treatment of problem behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

There is more research on concurrent interval Schedules than concurrent ratio Schedules.

However, when concurrent ratio schedules are used, the alternative that produces a higher rate of reinforcement is Chosen almost exclusively

A

**Concurrent Interval verses Concurrent Ratio Schedules*

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The applied literature indicates that when problem behavior is maintained on an interval schedule, one should use interval schedules to reinforce appropriate behavior
Eg, Myerson and Hale

However, one my explore using a Dense RATIO schedule to reinforce appropriate behavior, as there is evidence that the problem behavior may decrease to a lower RATE
Vollmer 2012

A

Matching law, interval versus ratio schedules and treatment••

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
  1. Response effort
  2. Reinforcement delay
  3. Quality of reinforcement
  4. Magnitude/duration of reinforcement
  5. Concurrent PUNISHMENT contingencies
A

Other factors influencing response allocation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q
  1. Response effort: Select appropriate behavior that is lower in effort than problem behavior
  2. Reinforcement delay: Provide consequence immediately after the appropriate behavior before gradually introducing delay procedures
  3. Quality of reinforcement:High quality and magnitude reinforcers
  4. Magnitude/duration of reinforcement
    5: Consider if there are concurrent punishment contingencies
A

Considerations When applying Factors Influencing Response Allocation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The process by which a previously in forced behavior is weakened by withholding reinforcement

The contingency between the response and consequence is broken. That is, where as the behavior was previously followed by reinforcing environmental change, in this, The reinforcing environmental change does not occur when the response is emitted

In some cases, nothing happens

The (Operant) behavior no longer has any fact on the environment.

The frequency of the behavior decreases in the future it usually too late similar to the baseline level

A

Operant Extinction. (Definition of Extinction )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

“Behavior” that has an EFFECT on the environment.

A

Operant Behavior

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Discontinuation of the contingency is the essential feature

However, the behavior place on extinction has to be able to occur.

Preventing her response topography from occurring is NOT ______

A

How Extinction works

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Another way to break a “contingency” is…

Deliver the maintaining reinforcer independently of the problem behavior by using a…

A

TIME -contingent schedule

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

During extinction of behavior that is maintained by positive reinforcement, the reinforcer is withheld. I.e.., Not presented

A

Extinction procedure for behavior maintain through SR +

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Do not react to the problem behavior. Withhold all attention

Do not look at or talk to person while problem behavior is occurring

A

Extinction procedure for problem behavior maintained by attention

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Deny access to preferred materials or activity while person is engaged in problem behavior

Provide no substitute activity while the child is engaged in problem behavior

Do not re-direct to a different preferred activity

A

Extinction procedure for behavior maintain by access to materials or activities

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Sensory blocking

Masking the sensory consequences

Neurotransmitter block or antagonist

A.k.a., SENSORY EXTINCTION

A

Extinction procedure for behavior it maintained by sensory and neurotransmitter effects

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

If the scratching one’s arm behavior is maintained through automatic reinforcement, padding the arm might attenuate the sensation produced by scratching. Care must be taken!

A

Example of Sensory Extinction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Remember, in ______the response topography hast to be able to occur

If the response is prevented it is not extension

A

Extinction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Continue instruction during occurrences of problem behavior until the task is completed

Maintain the demand for task completion if the person has disengaged from the task

Requires completion of the task before beginning the next scheduled activity

Present tasks using a prompt hierarchy that culminates in guided compliance (includes a positive punishment operation)

“Waiting out” variation
•Instead of prompting compliance, block access to other activities/reinforcers until task completion.

A

Extinction procedure for behavior maintained by escape/avoidance of Task demands

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Continue plan sequence of events without interruption

A

Extinction procedure for behavior maintained by escape/ avoidance of warning stimuli.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Extinction burst Extinction induced aggression Behavioral contrast Spontaneous recovery
Unwanted Effects of extinction
26
Characterized by initial increases in response: Rate, direction, variability, aptitude. When behavior is no longer reinforced, the behavior, immediately and temporarily, will increase in frequency, duration, and intensity before it decreases. Behavior may also change in topography and emotional responses may occur
The Extinction Burst
27
Violent acts directed at the previous source of reinforcement • Manipulandum • individual is correlated with reinforced for deliveries • Locations correlated with reinforcers Violent act directed against others not correlated with reinforced deliveries Levels of aggression generally highest during early stages of Extinction
Extinction induced aggression
28
Following introduction of extinction in one setting, there can be an increase in responding in settings uncorrelated with extinction Effects most pronounced went time between components is short A significant clinical consideration
Behavioral contrast
29
Spontaneous recurrence of previously reinforced behavior Can initially be up to 50% of the pre-extinction rate A short lived phenomenon assuming the behavior is NOT reinforced
Spontaneous Recovery
30
Extinction burst reported twice as often with escape extinction then with… Extinction induced aggression was equally as likely with the two operations
Extinction following positive reinforcement
31
The recommendation to switch to a CRF is based on what has been termed of the…
Partial Reinforcement Extinction Effect | PREE
32
• Exposure to intermittent or partial schedule of reinforcement can increase resistance to extinction (cf., Lewis, 1960, Macintosh) Some studies suggest a reversed Partial Reinforcement Extinction Effect (PREE), is likely to occur when the CRF schedule produces a. Higher rate of reinforcement than the PRF (partial reinforcement) schedule. Lerman Found this to be the case in two of three participants in there applied research on problem
***Partial reinforcement extinction effect*++
33
cautionary note: Thus, even though most textbooks recommend switching to a CRF schedule prior to using extinction, this may not always be best and should be done only after a careful analysis of the actual…
Rate of reinforcement delivered on the real life intermittent schedule. (Lerman)
34
Inform caregivers of possible unwanted effects Provide caregivers with specific procedures to deal with the unwanted “Effects” Eg., Protective padding In event of spontaneous recovery, continue “Extinction” procedure
Mitigating the unwanted effects of Extinction
35
Failing to identify the maintaining variables Assuming extinction and ignoring are always the same Talking to the client when extinction consists of ignoring Using extinction when contraindicated Intermittently reinforcing the problem behavior Differentially reinforcing worst behavior. I.e.., Shaping up
Common mistakes made when using extinction
36
Three Other factors to consider when using extinction
1. Length of reinforcement history 2, Baseline schedule of reinforcement 3. Baseline parameters of reinforcement
37
Four Additional practical considerations when using extinction
1. Combine Reinforcement such as DRA + extinction 2. Use only as part of a treatment package 3. Use Rules, but be careful, Otherwise signal extinction., 4. Establishing Operations and extinction
38
In the experimental literature,, Consists of two operations: reinforcement and extinction
Differential Reinforcement
39
1. Differentiation | 2. Discrimination
Two types of Effects of Differential Reinforcement
40
Occurs when differential reinforcement consist of reinforcing some responses and not reinforcing other responses R-S Contingency
Differentiation – differential reinforcement procedure #1
41
One technique To minimize the unwanted effects of Extinction Combine with these other procedures...
* DRA * Time-contingent schedules * , Stimulus Fading
42
3 Techniques to minimize the unwanted Effects of extinction
1. Combine with other procedures; * DRA * Time-contingent schedules * , Stimulus Fading 2. Identify response class hierarchy and reinforce appropriate responses within the class 3. Switch to a CRF schedule for problem behavior PRIOR to extinction
43
Two or more response topographied .• One topography results in reinforcement • Another topography results in extinction R1 - SR R2 - Ext Leads to differentiation
Descriptions of general differential reinforcement procedure number ones
44
Raising hand versus hey teacher Talking softly verses talking loudly Pushing weigh and versus asking Two different types of questions
Examples of differentiation
45
Occurs when differential reinforcement consists of reinforcing a response when certain stimuli are present and not reinforcing the same response when those stimuli are not present This is an S - R - S contingency One or more response topography and two or more stimuli S1 - R1 - SR S2 - R1 - Ext Leads to discrimination Examples: vacant verses occupied The green versus the red card in extinction signaling procedure
Differential reinforcement procedure #2
46
Consists of reinforcing want more responses well either minimizing reinforcement for other responses, using extinction on those other responses, or using punishment for those other responses. Even FCT is seen as This type of procedure by many
Differential reinforcement in the applied a literature
47
A schedule of reinforcement in which reinforcement is Delivered whenever a specified amount of time has elapsed in which a specified a behavior does not occur ANY response may be occurring at the END of time interval Except the target behavior decreases rate of problem behavior Also at times, has the effect of INCREASING the FREQUENCY of COMPETING behaviors One behavior is selected that is NEVER Reinforced ANY other BEHAVIOR that occurs within the situational context is reinforced A Dead person can meet the DRO criterion. Good being quiet, good being still, good being dead Cooper, 2007 recommended that the INITIAL interval be set at or slightly BELOW the average BASELINE IRT Others recommend that the initial interval be set at 50% to 80% of the average baseline IRT
DRO- Differential reinforcement of Other behavior, Also known as DR0 for differential reinforcement of zero rates of responding or omission training
48
1. Given average baseline rate, estimate the IRT 2. Multiply estimated IRT times 0.5 and at times 0.8 3. Select the initial interval 4. Increase the interval depending on changes in IRT but consider increasing amount or magnitude
Setting the DRO interval – procedure- four steps
49
If the problem behavior occurs 120 times per hour do the following; * 3600 seconds/120 (Count) = 30 seconds (Estimated baseline IRIT) * Criterion between 15 20 seconds
Setting the DRO interval – example
50
1. Interval DRO procedure 2. Momentary DRO 3 DRO – P or progressive DRO
3 general types of DRO schedules
51
Reinforcement is delivered at the END of the “INTERVAL” contingent on zero occurrences of the target Problem behavior during the ENTIRE “interval” If the target behavior occurs at any time during that interval, the time interval is RESET Can use : Fixed interval, FI– DRO OR Variable interval, VI - DRO
Interval DRO
52
Reinforcement is delivered at the end of the interval contingent on the absence of the target behavior at that moment in time Fixed FM- Variable, VM
Momentary DRO
53
Same as an interval DRO except progressively greater amounts or magnitude of reinforcement for success at intervals meeting the criterion. Drops back to initial reinforcer amount contingent upon occurrence of target behavior
Progressive DRO, DRO – P
54
Some say that DRO is not really a reinforcement a base procedure, but a…
Negative punishment procedure that
55
1. Using DR0 for a problem behavior that occurs at a low to moderate rate 2. Setting the DR0 interval above the average IRT. 3. Failing to reset the timer when the problem behavior occurs, (interval DRO) 4. Inconsistent criterion for target problem behavior and adding extra criteria 5. The other side of #4: in advertently reinforcing other problem behavior 6. Reinforcing dead Person’s behavior. DRO fails Ogden Lindsley’s dead mans test. Be quiet, be still, be Docile 7. Filling to select a reinforcer that successfully competes with the maintaining consequences 8. Failing to use extinction when the problem behavior occurs. Example: withholding the plan and reinforcer when the behavior occurs during the DRO interval but providing attention for behavior maintain by attention when the problem behavior occurs ( That’s Inappropriate. I am resetting the timer. You will have to do better next time) 9. Using the same reinforcer each time the criterion is Mac to the point Satiation
DRO: 9 mistakes
56
Solutions for mistakes: 1. Using DRO for a problem behavior that occurs at a low to moderate rate 2. Setting DRO interval above the average IRT. 3. Failing To reset the timer when the problem behavior occurs – interval DRO 4. Inconsistent criterion for target problem behavior and adding extra criteria
1. Used for a higher rate behavior (several times permit or per hour) and when appropriate behavior occurs at a low rate. Switch to DRA after DRO has been sent from seconds to minutes. 2. Set the DRO interval at 50% to 80% of the average IRT 3. Reset the timer 4. Stick to the criterion or consider switching to a different procedure
57
Solutions to mistakes: Care should be taken when using DRO such that a new problem behavior is not strengthened. Consider danger that another inappropriate behavior will be reinforced. Reconsider the criterion for reinforcement. Consider switching to a different procedure.
5. the other side of #4: inadvertently reinforcing the problem behavior
58
Solutions to mistakes when using DRO: 6. Reinforcing dead person’s behavior 7. Failing to select a reinforcer that successfully competes with the maintaining consequences.
6. When using DRO take care not to just reinforced docile behavior consider using DRO in combination with DRA or DRI 7. Conduct a formal stimulus preference assessment: repeat assessment on a regular basis. Use abolishing operations and/or eliminate the EO for problem behavior
59
Solutions to DRO mistakes: 9. Using the same reinforcer each time the criterion is mad to the point of satiation
9. Vary the reinforcers
60
do not Directly contact •...The behavior to be reduced nor the function of relations likely to be operating Therefore, this procedure should be used with caution and typically in Combination with other, more direct procedures are overused and Misused Give careful consideration before selecting them as procedure of choice Combine, when possible, witH DRA or DRI
•. DRO schedules
61
A schedule of reinforcement in which reinforcement is delivered whenever a response occurs that is physically incompatible with the target behavior. The response cannot occur concurrently With a specified target behavior. This schedule Decrease the target inappropriate behavior while concurrently increasing incompatible behavior. The target inappropriate behavior is not reinforced
DRI – differential reinforcement of incompatible behavior Examples: a student cannot at the same time set properly and run around the classroom. A student cannot at the same time have her arms to her side and pull the teachers hair.
62
1. selecting the wrong incompatible behavior. Solution: consider in DRI: is the incomparable behavior in the repertoire? Can the incomparable behavior a car concurrently with the problem behavior? Will the incomparable behavior effectively compete with the problem behavior? 2. Failing to select a reinforcer that successfully competes with the maintaining consequences. Solution: conduct a formal stimulus preference assessment; repeat assessment on a regular basis. Use abolishing operations and/or eliminate the need for EO for the problem behavior
DRI mistakes 1,2
63
3. Selecting the wrong reinforcement schedule for the DRI procedure. Solution: initially, reinforce the incompatible behavior every time it occurs. Then gradually thing in the schedule to usually a variable interval schedule 4. Failing to place the target behavior on extinction. Solution: remember that differential reinforcement consist of two operations.
DRI mistakes; 3, 4
64
5. in advertently reinforcing a chain of problem behavior before incompatible behavior Possible solution: at times consider using a chained DRO – DRI 6. Reinforcing incompatible behavior but failing to reinforce alternative behavior Possible solutions: consider using DRI in combination with DRA and/or using a chained– DRI – DRA
DRI mistakes: 5, 6
65
A Schedule of reinforcement in which we reinforcement is delivered whenever a response occurs that may compete with the target behavior The target inappropriate behavior is either not reinforced, reinforced with a lower magnitude reinforcer and/or on a leaner schedule or sometimes punished The alternative behavior is reinforced are usually functionally equivalent to the target inappropriate behavior Decreases the target inappropriate behavior while concurrently increasing alternative behavior
Differential reinforcement of alternative behavior, DRA
66
1. Selecting the wrong alternative behavior. Solutions: select an alternative behavior that is already in the repertoire. Ensure the alternative. Behavior will be effective as a functionally equivalent behavior. Select alternative behavior that is topographically to similar. Select a response that is low in effort. DRA and FCT 2. Failing to select a reinforcer that successfully competes against the maintaining consequences. Solution: conduct a formal stimulus preference assessment; repeat assessment on a regular basis. Use abolishing operations and/or eliminate the EO for the problem behavior. Select a functionally equivalent behavior
DRA mistakes 1, 2
67
Selecting The wrong reinforcement schedule for procedure. Solution: initially, reinforced alternative behavior every time it occurs. Then it gradually thin schedule usually do a variable interval Inadvertently reinforcing a chain of problem behavior before alternative behavior. Solution: at times, consider using a chain DRO – DRA. A chain DRI – DR a, or even a chain DRO - DRI – DRA
DRA mistakes 3, 4
68
Schedules that require a maximum number of “RESPONSES” occur in a given TIME PERIOD. (Only so many allowed) The criterion for reinforcement is set BETWEEN the AVERAGE baseline rate and the LOWEST rate during baseline; but usually, just slightly BELOW the average baseline rate. Except in the DRL – T
DRL - Differential reinforcement of low rates of “RESPONDING
69
Example: A student receives a special treat if he gets out of his assigned desk NO MORE THAN four times during a one hour class. That is, at the end of the hour the teacher delivers a reward only if the student got up four or FEWER times during that hour. However, if by the end of the hour the student had gotten up MORE THAN four times the teacher WITHHOLDS the REWARD
Example of DRL
70
Full session Interval Spaced responding DRL aka DRL-T
Types of DRL schedules Spaced responding DRL or DRL – T
71
Reinforcement is delivered at the END of the “SESSION “ provided that the target response occurred AT or BELOW the criterion
Full SESSION DRL
72
The session is BROKEN into a series of SMALLER , EQUAL time “intervals” and reinforcement is provided at the END of each interval provided that the target response occurred AT or BELOW the criterion in THAT Interval
Interval DRL
73
Reinforcement is delivered provided that some MINIMUM amount of TIME has elapsed since the last “response” Setting the criterion: Responses are reinforced only if they are emitted AFTER a specified amount of time has elapsed since the last response. The criterion for reinforcement is set at a time ABOVE the AVERAGE IRT
Spaced responding DRL or DRL – T
74
Some say that DRL – T is not really a reinforcement based procedure but a
Negative punishment procedure
75
1. Using DRL for a serious behavior Solution: used only for minor problem behavior 2. Using full session or interval with naïve learners Solution: Consider using DRL – T. Consider using a contingency contract instead. Don’t call it what it is.
DRL mistakes and solutions
76
A progressive DRL schedule with a CHANGING criterion.
DRD: Differential reinforcement of DIMINISHING RATES of responding
77
Example: the goal was to decrease inappropriate language. At the end of each day the mean hourly rate was tabulated compared to the previous days rate. If John was under the previous days average hourly rate, he was rewarded with Mountain Dew at that time. John was primed at the beginning of each day with his previous threshold hourly rate.
Differential reinforcement of DIMINISHING Rates of Responding- DRD
78
Combines a DRL with a DRH component Consists of reinforcing behavior if it occurs at NO MORE THAN and NO LESS THAN a specified RATE or, alternatively, at NO Lower than and NO higher than a specified IRT That is, behavior is reinforced if it occurs BETWEEN a specified MINIMUM and MAXIMUM rate or IRT The reinforcement criterion is set at a specific RATE BETWEEN the MINIMUM and the MAXIMUM rate Or UPPER and lLOWER limits on reinforced IRT’s are set. A.k.a., DRIL – T with limited hold
DRP- Differential reinforcement of PACED responses (AKA, DR of Pacing)
79
A Schedule that is used for minor behaviors
Differential reinforcement of low Responses
80
Use full session or interval with higher learners
DRL
81
Consider using the schedule with naïve learners
DRL – T Contingency contract
82
Use with behaviors that occur at higher rates.
DRO