Throndike’s law of effect
Thorndike’s concept that a response followed by satisfying consequences will become more likely to occur, whereas a response followed by unsatisfying consequences will become less likely to occur
Skinner’s law of effect (reinforcement)
Skinner’s concept that any consequences of a response that increases the probability of that response, is a reinforcer. (Unlike Throndike, Skinner avoids reference to mental states)
Skinner’s law of effect (punishment)
Skinner’s concept that any consequences of a response that decreases the probability of that response, is a punisher.
operant conditioning
a type of learning in which behaviour is modified by its consequences, such as by reinforcement, punishment, and extinction
-For example, we learn that smiling at others is followed by a friendly greeting.
Reinforcement (operant conditioning)
the strengthening of a response by an outcome that follows it
punishment (operant conditioning)
a response is weakened by an outcome that follows it
ABCs of Operant Conditioning (contingencies)
Identify two key differences between classical and operant conditioning
discriminative stimulus
an antecedent stimulus that signals the likelihood of certain consequences if a response is made
Eg. Going on red light is punished by ticket/car crash
positive reinforcement
a response is strengthened by the subsequent presentation (appearance) of a (positive) stimulus
negative reinforcement
a response is strengthened by the subsequent removal of a (noxious) stimulus
operant extinction
the weakening and eventual disappearance of a response because it is no longer reinforced
positive punishment
occurs when a response is weakened by the subsequent presentation (appearance) of a (noxious) stimulus
Describe some disadvantages of using positive punishment to control behaviour
negative punishment
the removal of a (positive) stimulus following an undesired response to weaken it
(e.g., TV privileges are taken away from a misbehaving child who wants attention)
Describe some advantages of using negative punishment to control behaviour
primary reinforcers
positive reinforcers that satisfy biological needs, such as food and water
secondary (conditioned) reinforcer
a stimulus that acquires reinforcing qualities by being associated with primary reinforcers (eg. Money)
delay of gratification
the ability to forgo immediate rewards for delayed but more satisfying outcomes
shaping
an operant conditioning procedure in which reinforcement begins with a behaviour that the organism can already perform, and then is made contingent (dependent) on behaviours that increasingly approximate the final desired behaviour
chaining
an operant conditioning procedure used to develop a sequence (chain) of responses by reinforcing each response with the opportunity to perform the next response
operant generalization
an operant response occurs to a new antecedent stimulus that is similar to the original antecedent stimulus
operant discrimination
an operant response that occurs when a particular antecedent stimulus is present, but not when another antecedent stimulus is present
Ratio schedules
- Leads to high rates of response if ratio is not too big