Learning theory Flashcards
Ivan Pavlov
Russian- classical conditioning (Pavlovian conditioning). Salivation in dogs
John Watson
-Father of behaviorism
-American
-CC
-classical conditioning can explain all aspects of behavior
- baby Albert - pairing rat with loud noise
Edward thorndike
-basis of learning relies on the association between stimulus and a response (action taken by animal)- called a connection
-most learning stems from trial and error
- problem box
- law of effect- strengthening or weakening of a connection, depending on the consequences
- good experiences increase likelihood of behavior
-bad experiences reduce likelihood
-foundation for the concept of operant conditioning
Burrhus Frederick Skinner
- father of operant conditioning
-Learning happens as the result of the causes of an action and it’s consequences
-operant conditioning - action and consequence
-introduces the term reinforcement
David premack
-premack principle; animals more likely to do behaviors they enjoy than not enjoy.
-animal will perform lower probability behavior in order to perform the higher probability behavior
Classical conditioning
Learning through association.
Three stages of classical conditioning
- Before conditioning- unconditioned stimulus (US) (food) and unconditioned response (UR or UCR) (natural response) - no learning has taken place yet
- During conditioning - pairing neutral stimulus (NS) and Conditioned stimulus (CS)
- After conditioning- UR becoming conditioned response (CR) - dog salivates at sound of bell
- Second order conditioning- squeaky hinge to food cabinet predicts food bag rustle, predicts meal time
Classical conditioning learning modes
- Acquisition- timing is important
- Extinction
- Spontaneous recovery
Extinction
Decrease in the conditioned response when CS is presented WITHOUT the US
Spontaneous recovery
After extinction, if there is a period of rest, you can see old association come back
Classical counterconditioning
-Change response to previously learned stimuli
-new conditioned emotional response
Desensitization
-typically used with CC
-gradual exposure to stimulus
- dog stays under threshold
Operant conditioning
-learning associations between behaviors and consequences
- thorndikes law of effect
-ABCs of learning
Operant
Any behavior that operates on the environment to generate consequences
Reinforcement
Increases behavior
Punishment
Decreases behavior
Positive
Stimulus is added
Negative
Stimulus is removed
Primary reinforcer
Anything that is biologically important to the survival of an animal- food, water, sleep, touch, pleasure, access to mates, going to the bathroom.
- also called unconditioned reinforcers
Secondary reinforcer
Sound of a clicker
Continuous reinforcement schedule (CRF)
For every correct response, a reward is given (most often used during acquisition)
Intermittent reinforcement schedule
Not every correct response receives a reward
Fixed interval (FI)
fixed amount of time between rewards. Leash productive and most likely to lead to extinction
Variable interval (VI)
Changing, unfixed interval of time between reinforcement. Dog is rewarded for sit stay after random amount of time