Reinforcement and Schedules of Reinforcement Flashcards
(34 cards)
What is operant conditioning?
A type of learning in which the future probability of a behaviour is affected by its consequences
What are the three main components of operant conditioning?
antecedent cues, behaviour, and consequences.
What is the order that ABC occurs?
antecedent cues (A) come before a behaviour (B); the consequences (C) occur after the behaviour.
what does the arrow between the B and C demonstrate?
The arrow between B and C indicates that the behaviour causes the consequences.
What does the colon between A and B demonstrate?
The colon between A and B indicates that antecedent cues do not cause behaviour, but they merely set the occasion for behaviour.
During operant conditioning, what do the consequences of behaviour influence?
(1) Frequency of the behaviour in the future
(2) Ability of future antecedent cues to set the occasion for the behaviour.
Means that C influences A and B
What is the prime movers of operant conditioning?
The consequences following an operant behaviour
What are reinforcers?
consequences used to strengthen a behaviour
How do we make a behaviour reoccur in the future?
To cause a behaviour (B) to be repeated in the future, a reinforcer must be immediately contingent (dependent) on the execution of the behaviour.
What are the four types of contingency?
Positive punishment
Positive reinforcement
Negative punishment
Negative reinforcement
What is reinforcement?
Reinforcement is defined as the procedure by which certain consequences strengthen a behaviour.
What is the difference between a reinforcer and a reinforcement?
The term reinforcer refers to the actual consequence
of a behaviour; the term reinforcement refers to the
process or procedure of strengthening a behaviour by
instituting this consequence.
What is positive reinforcement?
a behaviour strengthening procedure in which the occurrence of a behaviour is followed by the presentation of a stimulus that is usually considered pleasant or rewarding.
e.g.The use of food to increase the
strength of lever pressing in rats.
What is Negative reinforcement?
Behaviour strengthening procedure in which a stimulus
that is usually considered unpleasant or aversive, is
removed or omitted if the behaviour occurs.
If fastening one’s seatbelt terminates the loud buzz that
occurs in some cars when the ignition is turned on, this
behaviour will be strengthened.
What is a reinforcement schedule?
a rule that states under what conditions a reinforcer will be delivered.
What is a continuous reinforcer?
Where every occurrence of the operant response is followed by a reinforcer. This schedule is called continuous reinforcement (CRF).
What is a partial or intermittent reinforcement?
Where only some of the occurrences of the response are reinforced
According to a continuous reinforcement, every time a rat hits the bar what should happen?
A food pellet should be delivered
Under intermittent (or partial) reinforcement schedules, the rat:
May be required to hit the bar 50 times to get the pellet
2.
Is reinforced only once every five minutes
What is the cumulative recorder?
A classic device that provides an easily read, graphic depiction of changes in the organisms rate of response over time…
What are the four simple reinforcement schedules?
Fixed ratio, variable ratio, fixed interval, variable interval
What is fixed ratio?
A reinforcer is delivered after every n responses, where n is the size of the ratio. Responding exhibits a ‘stop-and-go’ pattern (a high rate of responding along with a short pause following the attainment of each reinforcer… postreinforcement pause).
Eventually this pause gives way to an abrupt continuation of responding.
e.g. is the “piecework” method used to pay factory workers in some companies.
What is variable ratio?
The number of required responses is not constant from reinforcer to reinforcer. On average, a subject will receive one reinforcer for every n responses, but the exact number of reponses required at any moment may vary widely. The pattern of responding might be described as rapid and fairly steady.
With VR schedules of modest length, regular post-reinforcement pauses are found. Yet pauses on VR schedules are several times smaller than those found on FR schedules.
Why are the pauses on VR schedules much smaller than FR schedules?
Many forms of gambling are examples of VR schedules:
(1) A person’s chances of winning are directly proportional to the number of times the person plays
(2) the number of responses required for the next reinforcer is uncertain