Chapter 4: Learning Flashcards
Learning
- A relatively permanent change in behaviour that occurs as a result of experience
- Can occur intentionally or unintentionally
Behaviourist approach to learning
- Learning via interactions with stimuli in the environment
- E.g. classical and operant conditioning
Classical conditioning
Ivan Pavlov
- Three-phase learning process (before, during and after)
- Involves associating neutral and unconditioned stimuli to produce a conditioned response
- Occurs through repeated association of different stimuli
NOTE: Classical conditioning is sometimes called respondent conditioning.
Stimulus vs response
-
Stimulus – object / event that elicits a response
- E.g. the sound of a gun shot, the smell of food
-
Response – reaction by an organism to a stimulus
- E.g. wincing, salivating, blinking, vomiting
NS, UCS, CS, UCR and CR
Classical conditioning
-
NS – does not normally produce a predictable response
- Not learned (e.g. bell causes no natural response)
-
UCS – produces a naturally occurring, automatic response
- Not learned (e.g. food causes salivation)
-
CS – initially N but becomes associated w/ UCS & elicits CR
- Learned (e.g. bell becomes associated with food and causes salivation)
-
UCR – occurs automatically when the UCS is presented
- Not learned (e.g. salivating due to food)
-
CR – learned / acquired response to the CS
- Learned (e.g. salivating due to bell)
Factors that influence classical conditioning
-
Nature of the response
- UCR must be an automatic / involuntary response
- Not necessarily ‘thoughtless’ (responses can be ancipatory or due to an expected event)
-
Association or linking of stimuli
- No association between stimuli = no conditioning
- Association is due to contiguity (linking stimuli that occur close together in time and/or space)
-
Timing of the NS and UCS pairing
- Must present NS before UCS in close succession (not more than half a second before)
Three stages of classical conditioning
-
Phase 1: Before conditioning
- NS elicits no relevant response (bell → no salivation)
- UCS elicits UCR (food → salivation)
-
Phase 2: During conditioning
- NS repeatedly paired with UCS to produce UCR
- Pairing occurs many times in close succession
- NS must come before UCS
- E.g. bell + food → salivation
-
Phase 3: After conditioning
- NS becomes a CS
- CS produces CR (bell alone → salivation)
Must specify WHAT causes the response (e.g. salivation to food OR bell).
NOTE: To gain full marks, answers must include these three stages with reference to the specific details in the question.
Why are negative stimuli (e.g. an electric shock) effective in establishing avoidance responses to formerly neutral stimuli?
- Produce a reflexive fear response & initiates emotional learning via amygdala (creates strong association w NS)
- Learning occurs fast if UCS relates to survival (e.g. pain)
Operant conditioning
B. F. Skinner
- Consequences of voluntary behaviour determine how likely it is to be repeated in the future
- I.e. an organism will tend to repeat a behaviour that has desirable consequences and not repeat a behaviour that has undesirable consequences
Operant
- Any response (or set of responses) that acts on the environment to produce some kind of consequence
Three stages of operant conditioning
-
Antecedent (A) – stim that occurs before the behaviour
- Allows us to predict likely consequence of behaviour
- E.g. the word ‘Men’ on a toilet door
-
Behaviour (B) – behaviour that occurs due to antecedent
- E.g. entering if male
-
Consequence (C) – what happens after the behaviour
- E.g. emptying a full bladder
NOTE: Antecedents (aka discriminative stimuli) tend to be general and do not directly produce the response.
Reinforcement
- A process whereby a stimulus increases the likelihood of repeating a desirable behaviour in the future
- Only used after the desired or correct response is made
- Can be positive or negative
Positive vs negative reinforcement
-
Positive – addition of a pleasant stimulus
- E.g. praising a student (+) for contributing in class
-
Negative – removal of an unpleasant stimulus
- E.g. driving within the speed limit to avoid fines (–)
NOTE: Both positive and negative reinforcement increase the likelihood of repeating a desired (good) behaviour in the future.
Punishment
- A process whereby a stimulus decreases the likelihood of repeating an undesired behaviour in the future
Positive vs negative punishment
-
Positive – addition of an unpleasant stimulus
- E.g. being given extra chores (+) / time-out (+) due to poor behaviour
-
Negative – removal of a pleasant stimulus
- E.g. loss of money (–) to a fine due to speeding
NOTE: Both positive and negative punishment decrease the likelihood of repeating an undesired (bad) behaviour in the future.
Factors that influence the effectiveness of reinforcement and punishment (OTA)
-
Order of presentation
- Reinforcement / punishment occurs after behaviour
- Allow us to associate a consequence w/ a behaviour
-
Timing
- Most effective when reinforcement / punishment occurs immediately after response has occurred
- Delayed conseq to behaviour = slow / no learning
-
Appropriateness
- Reinforcers must be pleasant for specific recipient
- Punishers must be unpleasant for specific recipient
Stimulus generalisation and discrimination
Operant conditioning
-
Generalisation – behaviour applied to a different stimulus, hoping to receive the same consequence
- E.g. doing well on a Biology exam after studying hard, so doing the same for other subjects
-
Discrimination – behaviour only occurs in response to the original stimulus
- E.g. working hard in one subject but not others
Classical vs operant conditioning
-
Role of the learner
- C – passive (no control over presentation of stimuli)
- O – active (control over behaviour)
-
Timing of the stimulus and response
- C – response depends on stimulus occurring first
- O – reinforcement/punishment depends on the behaviour occurring first
-
Nature of the response
- C – involuntary (usually reflexive/automatic)
- O – voluntary (consciously initiated)
Similarities between classical and operant conditioning
-
Three-phase model
- Before, during and after conditioning & ABC
-
Not necessarily permanent
- I.e. extinction can occur
- Require the association between different stimuli
Extinction
- Learned responses ≠ permanent (can be lost)
- True for both classical and operant conditioning
NOTE: Details about extinction are not examinable.
Spontaneous recovery
- The reappearance of the CR at the presentation of the CS after extinction and a rest period
- Can occur multiple times, but never as strong as the og CR
Positive reinforcers vs rewards
- Similarity – both have satisfying / desirable consequence
- Difference – rewards are not reinforcers unless they promote behaviour, whereas reinforcers are both rewarding and promote behaviour
Social-cognitive approach to learning
Albert Bandura
- Emphasises the social context & cognitive processes that influence the person and the learning process
- Demonstrates that learning can involve being conditioned by observing someone else’s conditioning
Observational learning
Social-cognitive learning
- Acquisition of info, skills or behaviour through watching the performance of others (directly or indirectly)