Unit 2 The matching law, operant extinction and differential reinforcement procedures Flashcards by Kathleen Dugan

A compound a schedule that consists of two or more schedules operating simultaneously but independently of each other, each for a different response.

Concurrent schedule

How well did you know this?

Not at all

Perfectly

Two or more reinforcement contingencies
Contingencies operate independently of each other
Contingencies in effect simultaneously
One or more different response topographies for each contingency

Critical attributes of concurrent schedules

How well did you know this?

Not at all

Perfectly

Doing one’s homework versus playing a video game

Screening verses asking nicely

Examples of concurrent schedules

How well did you know this?

Not at all

Perfectly

Hernstein (1961):

The DISTRIBUTION of behavior on CONCURRENT schedules of positive reinforcement.
-Pigeons “MATCHED” relative rates of behavior to relative rates of reinforcement” Cleary 2004

Deals with Choice. Choice = Concurrent schedules

Response Allocation (Choice) is predicted by this.

This OR That

Matching Law

How well did you know this?

Not at all

Perfectly

B1/B1 + B2 = R1/R1+R2

B = rate of responses for each alternative

R = rate of reinforcement for each alternative

Matching law equation

See also picture

How well did you know this?

Not at all

Perfectly

Concurrent schedules and matching law examples

See PowerPoint slides for illustrations and video

How well did you know this?

Not at all

Perfectly

Relevance of the matching law to treating problem behavior

Even if problem behavior is reinforced, the greater the DIFFERENCE In Proportion of Reinforcement for a problem behavior versus reinforcement for a REPLACEMENT behavior, the greater the difference in____

…RATE between the two.

How well did you know this?

Not at all

Perfectly

Reinforce replacement “Behavior “ On a DENSE schedule and minimize reinforcement for problem behavior
We Can live with some failures of procedural INTEGRITY when using Extinction
May not need to use extinction at all, provided that the reinforcement schedule for appropriate behavior is dense enough and takes into account other factors.
Time-Contingent schedules may be used

Matching law and treatment of problem behavior

How well did you know this?

Not at all

Perfectly

There is more research on concurrent interval Schedules than concurrent ratio Schedules.

However, when concurrent ratio schedules are used, the alternative that produces a higher rate of reinforcement is Chosen almost exclusively

**Concurrent Interval verses Concurrent Ratio Schedules*

How well did you know this?

Not at all

Perfectly

The applied literature indicates that when problem behavior is maintained on an interval schedule, one should use interval schedules to reinforce appropriate behavior
Eg, Myerson and Hale

However, one my explore using a Dense RATIO schedule to reinforce appropriate behavior, as there is evidence that the problem behavior may decrease to a lower RATE
Vollmer 2012

Matching law, interval versus ratio schedules and treatment••

How well did you know this?

Not at all

Perfectly

Response effort
Reinforcement delay
Quality of reinforcement
Magnitude/duration of reinforcement
Concurrent PUNISHMENT contingencies

Other factors influencing response allocation.

How well did you know this?

Not at all

Perfectly

Response effort: Select appropriate behavior that is lower in effort than problem behavior
Reinforcement delay: Provide consequence immediately after the appropriate behavior before gradually introducing delay procedures
Quality of reinforcement:High quality and magnitude reinforcers
Magnitude/duration of reinforcement
5: Consider if there are concurrent punishment contingencies

Considerations When applying Factors Influencing Response Allocation

How well did you know this?

Not at all

Perfectly

The process by which a previously in forced behavior is weakened by withholding reinforcement

The contingency between the response and consequence is broken. That is, where as the behavior was previously followed by reinforcing environmental change, in this, The reinforcing environmental change does not occur when the response is emitted

In some cases, nothing happens

The (Operant) behavior no longer has any fact on the environment.

The frequency of the behavior decreases in the future it usually too late similar to the baseline level

Operant Extinction. (Definition of Extinction )

How well did you know this?

Not at all

Perfectly

“Behavior” that has an EFFECT on the environment.

Operant Behavior

How well did you know this?

Not at all

Perfectly

Discontinuation of the contingency is the essential feature

However, the behavior place on extinction has to be able to occur.

Preventing her response topography from occurring is NOT ______

How Extinction works

How well did you know this?

Not at all

Perfectly

Another way to break a “contingency” is…

Deliver the maintaining reinforcer independently of the problem behavior by using a…

TIME -contingent schedule

How well did you know this?

Not at all

Perfectly

During extinction of behavior that is maintained by positive reinforcement, the reinforcer is withheld. I.e.., Not presented

Extinction procedure for behavior maintain through SR +

How well did you know this?

Not at all

Perfectly

Do not react to the problem behavior. Withhold all attention

Do not look at or talk to person while problem behavior is occurring

Extinction procedure for problem behavior maintained by attention

How well did you know this?

Not at all

Perfectly

Deny access to preferred materials or activity while person is engaged in problem behavior

Provide no substitute activity while the child is engaged in problem behavior

Do not re-direct to a different preferred activity

Extinction procedure for behavior maintain by access to materials or activities

How well did you know this?

Not at all

Perfectly

Sensory blocking

Masking the sensory consequences

Neurotransmitter block or antagonist

A.k.a., SENSORY EXTINCTION

Extinction procedure for behavior it maintained by sensory and neurotransmitter effects

How well did you know this?

Not at all

Perfectly

If the scratching one’s arm behavior is maintained through automatic reinforcement, padding the arm might attenuate the sensation produced by scratching. Care must be taken!

Example of Sensory Extinction

How well did you know this?

Not at all

Perfectly

Remember, in ______the response topography hast to be able to occur

If the response is prevented it is not extension

Extinction

How well did you know this?

Not at all

Perfectly

Continue instruction during occurrences of problem behavior until the task is completed

Maintain the demand for task completion if the person has disengaged from the task

Requires completion of the task before beginning the next scheduled activity

Present tasks using a prompt hierarchy that culminates in guided compliance (includes a positive punishment operation)

“Waiting out” variation
•Instead of prompting compliance, block access to other activities/reinforcers until task completion.

Extinction procedure for behavior maintained by escape/avoidance of Task demands

How well did you know this?

Not at all

Perfectly

Continue plan sequence of events without interruption

Extinction procedure for behavior maintained by escape/ avoidance of warning stimuli.

How well did you know this?

Not at all

Perfectly

Extinction burst Extinction induced aggression Behavioral contrast Spontaneous recovery

Unwanted Effects of extinction

Characterized by initial increases in response: Rate, direction, variability, aptitude. When behavior is no longer reinforced, the behavior, immediately and temporarily, will increase in frequency, duration, and intensity before it decreases. Behavior may also change in topography and emotional responses may occur

The Extinction Burst

Violent acts directed at the previous source of reinforcement • Manipulandum • individual is correlated with reinforced for deliveries • Locations correlated with reinforcers Violent act directed against others not correlated with reinforced deliveries Levels of aggression generally highest during early stages of Extinction

Extinction induced aggression

Following introduction of extinction in one setting, there can be an increase in responding in settings uncorrelated with extinction Effects most pronounced went time between components is short A significant clinical consideration

Behavioral contrast

Spontaneous recurrence of previously reinforced behavior Can initially be up to 50% of the pre-extinction rate A short lived phenomenon assuming the behavior is NOT reinforced

Spontaneous Recovery

Extinction burst reported twice as often with escape extinction then with… Extinction induced aggression was equally as likely with the two operations

Extinction following positive reinforcement

The recommendation to switch to a CRF is based on what has been termed of the…

Partial Reinforcement Extinction Effect | PREE

• Exposure to intermittent or partial schedule of reinforcement can increase resistance to extinction (cf., Lewis, 1960, Macintosh) Some studies suggest a reversed Partial Reinforcement Extinction Effect (PREE), is likely to occur when the CRF schedule produces a. Higher rate of reinforcement than the PRF (partial reinforcement) schedule. Lerman Found this to be the case in two of three participants in there applied research on problem

***Partial reinforcement extinction effect*++

cautionary note: Thus, even though most textbooks recommend switching to a CRF schedule prior to using extinction, this may not always be best and should be done only after a careful analysis of the actual…

Rate of reinforcement delivered on the real life intermittent schedule. (Lerman)

Inform caregivers of possible unwanted effects Provide caregivers with specific procedures to deal with the unwanted “Effects” Eg., Protective padding In event of spontaneous recovery, continue “Extinction” procedure

Mitigating the unwanted effects of Extinction

Failing to identify the maintaining variables Assuming extinction and ignoring are always the same Talking to the client when extinction consists of ignoring Using extinction when contraindicated Intermittently reinforcing the problem behavior Differentially reinforcing worst behavior. I.e.., Shaping up

Common mistakes made when using extinction

Three Other factors to consider when using extinction

1. Length of reinforcement history 2, Baseline schedule of reinforcement 3. Baseline parameters of reinforcement

Four Additional practical considerations when using extinction

1. Combine Reinforcement such as DRA + extinction 2. Use only as part of a treatment package 3. Use Rules, but be careful, Otherwise signal extinction., 4. Establishing Operations and extinction

In the experimental literature,, Consists of two operations: reinforcement and extinction

Differential Reinforcement

1. Differentiation | 2. Discrimination

Two types of Effects of Differential Reinforcement

Occurs when differential reinforcement consist of reinforcing some responses and not reinforcing other responses R-S Contingency

Differentiation – differential reinforcement procedure #1

One technique To minimize the unwanted effects of Extinction Combine with these other procedures...

* DRA * Time-contingent schedules * , Stimulus Fading

3 Techniques to minimize the unwanted Effects of extinction

1. Combine with other procedures; * DRA * Time-contingent schedules * , Stimulus Fading 2. Identify response class hierarchy and reinforce appropriate responses within the class 3. Switch to a CRF schedule for problem behavior PRIOR to extinction

Two or more response topographied .• One topography results in reinforcement • Another topography results in extinction R1 - SR R2 - Ext Leads to differentiation

Descriptions of general differential reinforcement procedure number ones

Raising hand versus hey teacher Talking softly verses talking loudly Pushing weigh and versus asking Two different types of questions

Examples of differentiation

Occurs when differential reinforcement consists of reinforcing a response when certain stimuli are present and not reinforcing the same response when those stimuli are not present This is an S - R - S contingency One or more response topography and two or more stimuli S1 - R1 - SR S2 - R1 - Ext Leads to discrimination Examples: vacant verses occupied The green versus the red card in extinction signaling procedure

Differential reinforcement procedure #2

Consists of reinforcing want more responses well either minimizing reinforcement for other responses, using extinction on those other responses, or using punishment for those other responses. Even FCT is seen as This type of procedure by many

Differential reinforcement in the applied a literature

A schedule of reinforcement in which reinforcement is Delivered whenever a specified amount of time has elapsed in which a specified a behavior does not occur ANY response may be occurring at the END of time interval Except the target behavior decreases rate of problem behavior Also at times, has the effect of INCREASING the FREQUENCY of COMPETING behaviors One behavior is selected that is NEVER Reinforced ANY other BEHAVIOR that occurs within the situational context is reinforced A Dead person can meet the DRO criterion. Good being quiet, good being still, good being dead Cooper, 2007 recommended that the INITIAL interval be set at or slightly BELOW the average BASELINE IRT Others recommend that the initial interval be set at 50% to 80% of the average baseline IRT

DRO- Differential reinforcement of Other behavior, Also known as DR0 for differential reinforcement of zero rates of responding or omission training

1. Given average baseline rate, estimate the IRT 2. Multiply estimated IRT times 0.5 and at times 0.8 3. Select the initial interval 4. Increase the interval depending on changes in IRT but consider increasing amount or magnitude

Setting the DRO interval – procedure- four steps

If the problem behavior occurs 120 times per hour do the following; * 3600 seconds/120 (Count) = 30 seconds (Estimated baseline IRIT) * Criterion between 15 20 seconds

Setting the DRO interval – example

1. Interval DRO procedure 2. Momentary DRO 3 DRO – P or progressive DRO

3 general types of DRO schedules

Reinforcement is delivered at the END of the “INTERVAL” contingent on zero occurrences of the target Problem behavior during the ENTIRE “interval” If the target behavior occurs at any time during that interval, the time interval is RESET Can use : Fixed interval, FI– DRO OR Variable interval, VI - DRO

Interval DRO

Reinforcement is delivered at the end of the interval contingent on the absence of the target behavior at that moment in time Fixed FM- Variable, VM

Momentary DRO

Same as an interval DRO except progressively greater amounts or magnitude of reinforcement for success at intervals meeting the criterion. Drops back to initial reinforcer amount contingent upon occurrence of target behavior

Progressive DRO, DRO – P

Some say that DRO is not really a reinforcement a base procedure, but a…

Negative punishment procedure that

1. Using DR0 for a problem behavior that occurs at a low to moderate rate 2. Setting the DR0 interval above the average IRT. 3. Failing to reset the timer when the problem behavior occurs, (interval DRO) 4. Inconsistent criterion for target problem behavior and adding extra criteria 5. The other side of #4: in advertently reinforcing other problem behavior 6. Reinforcing dead Person’s behavior. DRO fails Ogden Lindsley’s dead mans test. Be quiet, be still, be Docile 7. Filling to select a reinforcer that successfully competes with the maintaining consequences 8. Failing to use extinction when the problem behavior occurs. Example: withholding the plan and reinforcer when the behavior occurs during the DRO interval but providing attention for behavior maintain by attention when the problem behavior occurs ( That’s Inappropriate. I am resetting the timer. You will have to do better next time) 9. Using the same reinforcer each time the criterion is Mac to the point Satiation

DRO: 9 mistakes

Solutions for mistakes: 1. Using DRO for a problem behavior that occurs at a low to moderate rate 2. Setting DRO interval above the average IRT. 3. Failing To reset the timer when the problem behavior occurs – interval DRO 4. Inconsistent criterion for target problem behavior and adding extra criteria

1. Used for a higher rate behavior (several times permit or per hour) and when appropriate behavior occurs at a low rate. Switch to DRA after DRO has been sent from seconds to minutes. 2. Set the DRO interval at 50% to 80% of the average IRT 3. Reset the timer 4. Stick to the criterion or consider switching to a different procedure

Solutions to mistakes: Care should be taken when using DRO such that a new problem behavior is not strengthened. Consider danger that another inappropriate behavior will be reinforced. Reconsider the criterion for reinforcement. Consider switching to a different procedure.

5. the other side of #4: inadvertently reinforcing the problem behavior

Solutions to mistakes when using DRO: 6. Reinforcing dead person’s behavior 7. Failing to select a reinforcer that successfully competes with the maintaining consequences.

6. When using DRO take care not to just reinforced docile behavior consider using DRO in combination with DRA or DRI 7. Conduct a formal stimulus preference assessment: repeat assessment on a regular basis. Use abolishing operations and/or eliminate the EO for problem behavior

Solutions to DRO mistakes: 9. Using the same reinforcer each time the criterion is mad to the point of satiation

9. Vary the reinforcers

do not Directly contact •...The behavior to be reduced nor the function of relations likely to be operating Therefore, this procedure should be used with caution and typically in Combination with other, more direct procedures are overused and Misused Give careful consideration before selecting them as procedure of choice Combine, when possible, witH DRA or DRI

•. DRO schedules

A schedule of reinforcement in which reinforcement is delivered whenever a response occurs that is physically incompatible with the target behavior. The response cannot occur concurrently With a specified target behavior. This schedule Decrease the target inappropriate behavior while concurrently increasing incompatible behavior. The target inappropriate behavior is not reinforced

DRI – differential reinforcement of incompatible behavior Examples: a student cannot at the same time set properly and run around the classroom. A student cannot at the same time have her arms to her side and pull the teachers hair.

1. selecting the wrong incompatible behavior. Solution: consider in DRI: is the incomparable behavior in the repertoire? Can the incomparable behavior a car concurrently with the problem behavior? Will the incomparable behavior effectively compete with the problem behavior? 2. Failing to select a reinforcer that successfully competes with the maintaining consequences. Solution: conduct a formal stimulus preference assessment; repeat assessment on a regular basis. Use abolishing operations and/or eliminate the need for EO for the problem behavior

DRI mistakes 1,2

3. Selecting the wrong reinforcement schedule for the DRI procedure. Solution: initially, reinforce the incompatible behavior every time it occurs. Then gradually thing in the schedule to usually a variable interval schedule 4. Failing to place the target behavior on extinction. Solution: remember that differential reinforcement consist of two operations.

DRI mistakes; 3, 4

5. in advertently reinforcing a chain of problem behavior before incompatible behavior Possible solution: at times consider using a chained DRO – DRI 6. Reinforcing incompatible behavior but failing to reinforce alternative behavior Possible solutions: consider using DRI in combination with DRA and/or using a chained– DRI – DRA

DRI mistakes: 5, 6

A Schedule of reinforcement in which we reinforcement is delivered whenever a response occurs that may compete with the target behavior The target inappropriate behavior is either not reinforced, reinforced with a lower magnitude reinforcer and/or on a leaner schedule or sometimes punished The alternative behavior is reinforced are usually functionally equivalent to the target inappropriate behavior Decreases the target inappropriate behavior while concurrently increasing alternative behavior

Differential reinforcement of alternative behavior, DRA

1. Selecting the wrong alternative behavior. Solutions: select an alternative behavior that is already in the repertoire. Ensure the alternative. Behavior will be effective as a functionally equivalent behavior. Select alternative behavior that is topographically to similar. Select a response that is low in effort. DRA and FCT 2. Failing to select a reinforcer that successfully competes against the maintaining consequences. Solution: conduct a formal stimulus preference assessment; repeat assessment on a regular basis. Use abolishing operations and/or eliminate the EO for the problem behavior. Select a functionally equivalent behavior

DRA mistakes 1, 2

Selecting The wrong reinforcement schedule for procedure. Solution: initially, reinforced alternative behavior every time it occurs. Then it gradually thin schedule usually do a variable interval Inadvertently reinforcing a chain of problem behavior before alternative behavior. Solution: at times, consider using a chain DRO – DRA. A chain DRI – DR a, or even a chain DRO - DRI – DRA

DRA mistakes 3, 4

Schedules that require a maximum number of “RESPONSES” occur in a given TIME PERIOD. (Only so many allowed) The criterion for reinforcement is set BETWEEN the AVERAGE baseline rate and the LOWEST rate during baseline; but usually, just slightly BELOW the average baseline rate. Except in the DRL – T

DRL - Differential reinforcement of low rates of “RESPONDING

Example: A student receives a special treat if he gets out of his assigned desk NO MORE THAN four times during a one hour class. That is, at the end of the hour the teacher delivers a reward only if the student got up four or FEWER times during that hour. However, if by the end of the hour the student had gotten up MORE THAN four times the teacher WITHHOLDS the REWARD

Example of DRL

Full session Interval Spaced responding DRL aka DRL-T

Types of DRL schedules Spaced responding DRL or DRL – T

Reinforcement is delivered at the END of the “SESSION “ provided that the target response occurred AT or BELOW the criterion

Full SESSION DRL

The session is BROKEN into a series of SMALLER , EQUAL time “intervals” and reinforcement is provided at the END of each interval provided that the target response occurred AT or BELOW the criterion in THAT Interval

Interval DRL

Reinforcement is delivered provided that some MINIMUM amount of TIME has elapsed since the last “response” Setting the criterion: Responses are reinforced only if they are emitted AFTER a specified amount of time has elapsed since the last response. The criterion for reinforcement is set at a time ABOVE the AVERAGE IRT

Spaced responding DRL or DRL – T

Some say that DRL – T is not really a reinforcement based procedure but a

Negative punishment procedure

1. Using DRL for a serious behavior Solution: used only for minor problem behavior 2. Using full session or interval with naïve learners Solution: Consider using DRL – T. Consider using a contingency contract instead. Don’t call it what it is.

DRL mistakes and solutions

A progressive DRL schedule with a CHANGING criterion.

DRD: Differential reinforcement of DIMINISHING RATES of responding

Example: the goal was to decrease inappropriate language. At the end of each day the mean hourly rate was tabulated compared to the previous days rate. If John was under the previous days average hourly rate, he was rewarded with Mountain Dew at that time. John was primed at the beginning of each day with his previous threshold hourly rate.

Differential reinforcement of DIMINISHING Rates of Responding- DRD

Combines a DRL with a DRH component Consists of reinforcing behavior if it occurs at NO MORE THAN and NO LESS THAN a specified RATE or, alternatively, at NO Lower than and NO higher than a specified IRT That is, behavior is reinforced if it occurs BETWEEN a specified MINIMUM and MAXIMUM rate or IRT The reinforcement criterion is set at a specific RATE BETWEEN the MINIMUM and the MAXIMUM rate Or UPPER and lLOWER limits on reinforced IRT’s are set. A.k.a., DRIL – T with limited hold

DRP- Differential reinforcement of PACED responses (AKA, DR of Pacing)

A Schedule that is used for minor behaviors

Differential reinforcement of low Responses

Use full session or interval with higher learners

DRL

Consider using the schedule with naïve learners

DRL – T Contingency contract

Use with behaviors that occur at higher rates.

DRO

Unit 2 The matching law, operant extinction and differential reinforcement procedures Flashcards

(82 cards)