Ch 13 Schedules of Reinforcement Flashcards

1
Q

type of intermittent reinforcement schedule where reinforcement is delivered after an average/unpredictable number of responses

A

variable ratio (VR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

type of intermittent reinforcement schedule where reinforcement is delivered for the first response after a varying amount of time passed

A

variable interval (VI)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

type of compound schedule of reinforcement in which two or more basic schedules (e.g. FR, VI) occur in sequence, and reinforcement is delivered only after all components have been completed- without any clear signals indicating when one component ends and next begins

-involves 2 or more schedules in a fixed sequence with no cues signaling the transition
-reinforcement is given only at end of entire sequence
-used to teach complex behavior chains to increase persistence and independent responding

e.g. Tandem FR-3 FI-2min;
learner must emit 3 responses (FR3) then wait 2 min (FI2min). Only then is reinforcement delivered. No cues indicate that the schedule changed from FR to FI.

A

tandem schedule (tand)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

process of gradually increasing response requirement or time between reinforcers after a behavior has been acquired, in order to make the reinforcemement more naturalistic, efficient, and sustainable over time

-involves reducing rate or frequency of reinforcement
-helps prevent reinforcer dependence
-promotes maintenance and generalization of skills

e.g. FR1->FR3->FR5

FI 1min-> FI 5min

increasing token requirement from 5 to 10

A

schedule thinning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

refers to how and when reinforcement is delivered for a behavior. defines conditions under which a response will be reinforced, and plays a key role in acquisition, maintenance, and strength of behavior

-must be selected and adjusted based on individual needs and goals

A

schedule of reinforcement

2 main categories;
1. continuous reinforcement (CRF)- every instance of target behavior is reinforced; best for teaching new behaviors

  1. intermittent reinforcement (INT)- some but not all target responses are reinforced; builds resistance to extinction and helps with maintenance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

when reinforcement is thinned too quickly- especially in ratio schedules (FR or VR)- causing a breakdown in responding. can lead to avoidance, aggression, or sudden drop in target behavior

-happens when response requirement becomes tooo high

A

ratio strain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

reinforcement schedule in which response requirement increases systematically until the individual stops responding. Commonly used to assess strength or value of a reinforcer.

-number of required responses increases after each reinforcer
-point at which learner stops responding is the breaking point
-used for reinforcer assessment

e.g. trial 1-1 response
trial 2- 3 responses
trial 3- 5 responses

A

progressive-ratio (PR) schedule of reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

schedule in which response requirement systematically increases- either after each reinforcement or after a set number of sessions- to evaluate or build persistence, endurance, or reinforcer strength.

-demand increases gradually
-used to test or build motivation, endurance, or behavioral persistence
-often used in reinforcer assessments or strengthening existing behaviors
-can be arithmetic (e.g. +2 each time) or geometrice (e.g. x2 each time)

A

progressive schedule of reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

temporary pause in responding that occurs immediately after reinforcement is delivered, before the individual begins responding again. aka the learner takes a brief rest after reinforcement

-more likely in FR and FI
-the pause happens right after reinforcement
-the longer the requirement, the longer the pause tends to be

e.g.on an FI 5-min schedule, learner responds, gets reinforced, then waits several minutes before responding again

-this occurs because the learner predicts the reinforcement won’t be available right away

A

postreinforcement pause

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

type of compound schedule in which two or more reinforcement schedules are presented alternately, each associated with a distinct SD. The behavior may be the same across conditions, but the schedule of reinforcement changes, depending on the SD that is present.

-two or more schedules alternate (e.g. FR, VI, EXT)
-each schedule has its own SD (cue or signal)
-only one schedule is in effect at a time
-used to study or teach stimulus control of behavior

e.g. student is reinforced on FR3 schedule when a blue card is on the desk , and on a VI 2-min schedule when a yellow card is present

A

multiple schedule (mult)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

compound schedule in which two or more basic schedules are alternated, without any discriminative stimuli (SDs) signaling which schedule is currently in effect. The individual must respond without knowing which schedule is active at any given time

A

mixed schedule of reinforcement (mix)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

describes how organisms allocate their responses across two or more concurrent reinforcement schedules based on the rate of reinforcement available for each option. Individuals tend to match the proportion of their behavior to the proportion of reinforcement they receive for those behaviors.

-the relative rate of responding will match the relative rate of reinforcement
-explains choice behavior when multiple options are available
-can be used to shift behavior to more appropriate options by adjusting reinforcement rate

e.g. if a student earns 70% of reinforcement from handraising and 30% from calling out, it’s predicted they will allocate about 70% of their responses to handraising and 30% to calling out

aka behavior goes where reinforcement flows

A

matching law

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

feature that can be added to interval schedules of reinforcement, where the reinforcer is available for only a limited period of time after the interval has elapsed. If the individual does not respond within that window, the opportunity for reinforcement is lost, and the timer resets.

-applies to interval schedules (FI, VI)
-a deadline is set for making a response after the interval ends
-reinforcement only delivered if the response occurs within the window
-encourages quick responding once reinforcement becomes available

e.g. a student can earn a token if they begin work within 30s of teacher’s instruction

A

limited hold (LH)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

type of differential reinforcement schedule where reinforcement is delivered only if the individual’s response is different from a set number of previous responses. used to increase variability and promote creativity or flexibility in behavior.

-reinforcement depends on the novelty or variability of a response
-response must differ from the last “n” responses
-encourages diverse responding
-can be applied to verbal, motor, academic, or play behaviors

A

lag schedule

e.g. Lag1 = different from the immediately previous response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

schedule in which only some occurrences of a behavior are reinforced

A

intermittent schedule of reinforcement (INT)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

type of intermittent reinforcement in which reinforcement is delivered after a set number of correct responses

response pattern- pause after r+ followed by rapid burst of responses

A

fixed ratio (FR)

17
Q

type of intermittent reinforcement where the first correct response is reinforced after a fixed amount of time has passed since the last reinforcement. responses before the interval ends do not produce reinforcement.

-reinforcement is based on time, not number of responses
-the interval length doesn’t change
-only the first response after the interval is reinforced
-produces a scalloped response pattern (slow responding, then increases as interval ends)

e.g. ___2min= first correct response after 2 min is reinforced. if learner responds before 2 min are up, those responses are not reinforced.

e.g. teacher checls on student work every 10 min and provides praise if the student is working

A

fixed interval (FI)

18
Q

reinforce behavior only when it occurs at or below a specified rate. goal is to reduce behavior (but not eliminate) to an appropriate level.

-reinforcement is withheld if behavior occurs too frequently

e.g.student is praised only if they ask fewer than 3 questions during class.

A

DRL (differential reinforcement of low rates)

19
Q

reinforcement is delivered only when a behavior occurs at or above a specified high rate, encouraging the learner to increase the frequency of a desirable behavior.

-used to increase rate of already appropriate behaviors
-strengthens fluency, engagement, productivity

e.g. student earns reward only if they complete at least 10 math problems in 20 min

A

DRH (differential reinforcement of high rates)

20
Q

reinforcement is delivered if the behavior occurs fewer times than a gradually criterion over successive sessions or time periods. Used to systematically reduce the rate of a behavoir that is acceptable in small amounts but problematic when too frequent.

-similar to DRL, but criterion decreases over time
-reinforces behavior only if it occurs below a set, reducing limit
-goal is to gradually lower behavior frequency
-for behaviors that are appropriate in moderation and when a gradual reduction is better than abrupt extinction

e.g. student asks 10 off-topic questions per class.
day 1 reinforce if student asks 9 or fewer questions
day 2 reinforce if student asks 8 or fewer questions
continue until behavior reaches a more acceptable level

A

DRD (differential reinforcement of diminishing rates)

21
Q

schedule of reinforcement in which every occurrence of target behavior is reinforced

A

continuous reinforcement (CRF)

22
Q

type of compound reinforcement schedule in which two or more basic schedules (e.g. ratio and interval) are in effect at the same time, and both conditions must be met before reinforcement is delivered.

-often combines ratio + interval
-all components of the schedule must be satisfied for reinforcement
-produces steady, moderate response rates
-encourages both consistent responding and appropriate timing

e.g. ___ FR10 FI 2-min-> individual must emit at least 10 responses and the 2-min interval must have passed for reinforcement to be delivered

A

conjunctive schedule (conj)

23
Q

two or more reinforcement schedules available at the same time, each associated with a different response option. learner is free to choose between options, and behavior tends to be allocated based on the relative rates of reinforcement for each.

-two or more schedules operate simultaneously
-each schedule is tied to a specific response
-individual can switch freely between them
-odten used to study choice behavior or reinforcer preference

e.g.
schedule A- VR5 for touching red card
schedule B- VR10 for touching blue card

A

concurrent schedule (conc)

24
Q

combines two or more basic reinforcement schedules (e.g. FR, FI VR, VI, EXT) that are presented either together or in sequence. These schedules can apply to one behavior and may or may not involve discriminative stimuli (SDs) to signal which schedule is in effect.

-involves two or more basic schedules
-may include SDs
-may occur sequentially or simultaneously
-all apply to the same target behavior

A

compound schedule of reinforcement

25
type of compound reinforcement schedule in which two or more basic schedules are presented in a specific sequence, and each component has a discriminative stimulus. Reinforcement is delivered only after the final component of the chain is completed. -multiple schedules occur in a fixed order -aach component has a distinct SD -reinforcement is only given at end of entire chain -each SD acts as a conditioned reinforcer for the previous link e.g.morning routine- 1. brush teeth (FR=complete the task) 2. get dressed 3. eat breakfast
chained schedule (chain)
26
occurs when a change in reinforcement or punishment in one context leads to change in behavior in the opposite direction in another context, even though the second context remains unchanged. -seen in multiple schedule or multi-setting arrangements -one component changes (e.g. reinforcement is increased or reduced) -behavior increases or decreased in the unchanged component -demonstrates how behavior is influenced by relative reinforcement across environements
behavioral contrast types 1. positive contrast 2. negative contrast
27
type of compound reinforcement schedule in which reinforcement is delivered when either of two or more schedule requirements is met- whichever happens first. typically involves a combination of ratio and interval schedules, both applied to the same behavior. -two or more basic schedules run at the same time -reinforcement occurs when any one of the schedule is completed -all schedules apply to the same behavior -encourages flexibility in how reinforcement is accessed e.g. __FR10 FI 2-min; reinforcement given when 10 responses are completed OR 2 minutes pass and at least one response occurs. whichever response comes first triggers reinforcement student is reinforced for completing 5 questions (FR5) or staying on-task for 3 minutes (FI 3-min). if they do either one first, they get the reinforcer.
alternative schedule (alt)
28
behavior that occurs as a side effect of certain schedules of reinforcement, especially time-based schedules (FI, VI) or time-based delivery (FT, VT). these behaviors are not directly reinforced, but tend to emerge predictably during time periods between reinforcers. -occurs between scheduled reinforcers -not directly related to the target behavior -often excessive, repetitive, or irrelevant -most commonly observed with interval or time-based schedules e.g. student begins doodling or rocking back and forth while waiting for reinforcement student on a VT schedule repeats irrelevant phrases or engages in stereotypy between reinforcers
adjunctive behaviors
29
behavior increases in the unchanged setting after reinforcement is decreased elsewhere
positive contrast
30
behavior decreases in the unchanged setting after reinforcement is increased elsewhere
negative contrast
31
compare & contrast the 7 compound schedules of reinforcement chained schedule tandem schedule multiple schedule mixed schedule alternative schedule conjunctive schedule concurrent schedule
all the compound schedules involve 2 or more basic schedules of reinforcement (FR, VR, FI, VI, EXT) chained schedule- 2 or more schedules presented in specific sequence, each component has SD. R+ after final component is completed. tandem schedule- 2+ schedules in fixed sequence with no SD. R+ at end of sequence. multiple schedule- 2+ schedules alternate, each with its own SD signaling which schedule is in effect mixed schedule- 2+ schedules alternate, without SDs (you don’t know which is in effect) alternative schedule- 2+ schedules occur at same time, R+ when any one of the schedule requirements is met (not the learner’s choice. one behavior occurs.) conjunctive schedule- 2+ schedules occur at the same time, both conditions must be met for R+ concurrent schedule- 2+ schedules available at same time, each associated with a different response option. learner is free to choose which one (multiple behavior options, learner’s choice. the only schedule with more than one behavior option.)
32
which schedules are sequential, which are alternating, and which are simultaneously occurring?
Sequential- chained, tandem Alternating- multiple, mixed Simultaneous- alternative, conjunctive, concurrent