Quiz 4 Flashcards
(41 cards)
How did Edward Thorndike’s cats get out of the puzzle?
Instrumental learning, behavior is more liekly when rewards come out.
- trial and error
Where does Operant Conditioning come from
B.F Skinner expanded on Thorndikes work. He said that learning was like natural selection for behaviour, beneficial behaviour “survives” and increases, harmful behaviour gets weeded out by selection
What is the Skinner Box?
box that held rats, and they had a lever to press. They were rewarded when the lever was pressed. This was recorded by a cumulative recorder (shows the number/timing of lever presses)
ABCs of learning and person
B.F Skinner
Antecedent
Behaviour
Consequence
What is antecedent
the stimuli present before behaviour (context or setting)
what is consequence
reinforcement-> something that increases behaviour
punishment-> something that decreases a behaviour
What is a discriminative stimulus
it is an antecedent stimulus that signals certain consequences, if a response is made
What is reinforcement
behaviour is more likely
- positive= something added to the person
- negative= something is taken away from the person
What is punishment
behaviour is less likely
- positive= something being added to the person
- negative= something being taken away from the person
Primary consequence
holding biological importance
- food, water, pain
Secondary consequence
only important because of the relationship to primary consequence.
- money (to buy food)
Animals and consequence
they learn immediate consequences faster
Humans and consequence
delayed gratification (waiting for something to get a better reward/ payout) - children who do not understand this concept have a hard time with frustration and stress later in life
Operant extinction
- much like classical conditioning extinction
- consequence is no longer paired with the response- you unlearn the connection
- some behaviours are more resistant to extinction
Operant generalization
much like generalization in classical conditioning - organisms BEHAVIOUR generalizes to similar ANTECEDENTS
(dogs generalizes the word sit to everyone, it will listen to every pitch say it)
Operant discrimination
- much like classical conditioning
- organisms learn to discriminate between ANTECEDENTS
(kids learns to not swear in front of adults, but its ok to in front of other kids)
What is shaping behaviour
- reinforcing behaviours that are closer and closer to the desired one.
(shaping the professor to teach only on the left side of the classroom)
What is chaining in training
- developing a sequence of responses by successively reinforcing the next behaviour in line
(circus, amazing dog trainers)
What is a schedule of reinforcement
patterns/frequencies of reinforcement
Continuous reinforcement
- every response or act is reinforced
- fastest to learn
easy to tell when reinforcement has stopped
Partial reinforcement
- only some responses or acts are reinforced
- aka intermittent reinforcement
- 4 types
What are the 4 types of partial reinforcement and meaning
- fixed ratio
- variable ratio
- fixed interval
- variable interval
ratio- certain proportion of responses (50% of beh is reinforced)
interval- certain amount of time must pass (get reinforced every 5 mins)
fixed- always the same amount
variable- varies around an average
Fixed ratio (partial reinforcement)
- reinforcement always after a fixed number of responses
- produces high rate of responding
- usually brief pause in responding after each reinforcement
Variable ratio (partial reinforcement)
- reinforcement given after a variable number of responses
- produces highest rate of responding
- most resistant to extinction