learning and behaviour Flashcards

Question

Outline the Law of Effect

Answer 1

Proposed by Thorndike and is the core of instrumental learning Responses that produce a satisfying effect in a particular situation become more likely to occur again in that situation, and responses that produce a discomforting effect become less likely Satisfying outcomes “stamp in” the connection between the stimuli and the response. Instrumental behaviours are “controlled” by stimuli with which they are associated– The “discriminative stimulus

Answer 2

Stimulus-Response (S-R) theories focus on the direct association between stimuli and responses, emphasizing that behavior is a result of external stimuli. This theory posits that behavior can be predicted and controlled by managing the external stimuli Instrumental behaviour only becomes controlled by the situational cues if and only if these cues signal whether the response is going to be reinforced or not

Answer 3

Generalization occurs when an organism responds in a similar way to different but related stimuli Generalisation gradient: the greater the difference between testing and training cues, the worse performance becomes/the broader the generalisation becomes. If it's too broad, this is not an effective response anymore.

Answer 4

Ability to differentiate between stimuli and respond differently based on which stimulus is present. e.g training an animal to respond only to a specific tone for food, but not to other similar tones through successive discrimination Discrimination Training in behavioural psychology involves teaching an individual to respond differently to two or more similar stimuli. This is achieved by reinforcing a response to one stimulus (the S+) and not reinforcing the same response to another stimulus (the S-).

Answer 5

The peak shift effect occurs when, after discrimination training, the highest rate of response occurs in response to a stimulus that is further removed from the non-reinforced stimulus than the original reinforced stimulus. This effect shows a shift in the peak of the response rate away from the non-reinforced stimulus.

Answer 6

Discrimination Training: The dog learns to differentiate between the green and red lights and understands that sitting is rewarded in the presence of the green light but not the red light. Generalisation: After discrimination training, the dog also starts to generalise its response to other stimuli similar to the green light. The Peak Shift Effect: animal not only responds to the original green light but responds even more strongly to a new stimulus that is more distinct from the non-reinforced stimulus (the red light) than the original green light. This indicates that the animal is not just learning about the green light but is also learning about the red light, the animal is not just learning what to do but also what to avoid, and any stimulus further from the "avoid" stimulus is seen as even better or safer.

Answer 7

we can make a behaviour more or less likely to occur by controlling the cues that trigger it For example, insomnnia treatment involves restoring the association between bed and sleep - reduce all non-pleasureable non-sleep activity away from the bedroom

Answer 8

to reduce generalisaiton: in discriminatory training, discriminatory stimuli should be as similar as possible for accurate category formation/expertise: the more exemplars are used in training, the more accuracy in categorising things in new contexts

Answer 9

The study shows discrimination training because pigeons learned to tell apart Monet's and Picasso's paintings through reinforcement (rewarded for correct choices). It demonstrates generalisation because pigeons applied what they learned to new, unseen paintings by the same artists and even grouped similar styles (e.g., other Impressionists or Cubists), showing they responded to broader visual features rather than memorised images.

Answer 10

you don't need drive reduction to reinforce behaviour - e.g stimulating arousal is a potent reward, even if there is no ‘consummation' you don't need a reinforcer for learning to occur e.g latent learning

Answer 11

When the same S-R pair is repeated over time, the response to the stimulus can become automatic or habitual. This means that the behaviour is performed with little to no conscious thought and becomes resistant to change.

Answer 12

Contingency learning is when we learn that our actions cause certain outcomes. key features: Predictability: The learner comes to predict outcomes based on their actions. Contingency: There is a direct link, or contingency, between the action performed and the outcome experienced. High contingency situations are where a specific action reliably leads to a particular outcome.

Answer 13

The Delta-P rule posits that the strength of a learned behaviour is related to the difference in the probability of an outcome occurring in the presence of a stimulus versus its absence. Mathematically, this is expressed as: [ \Delta P = P(O \mid S) - P(O \mid \neg S) ] Where: ( P(O \mid S) ) is the probability of the outcome when the stimulus is present. ( P(O \mid \neg S) ) is the probability of the outcome when the stimulus is not present. Implications of the Delta-P Rule: This rule suggests that the greater the difference between these two probabilities, the stronger the associative learning or contingency. A positive Delta-P indicates that the presence of the stimulus increases the likelihood of the outcome, thereby strengthening the behavior. Conversely, a negative or zero Delta-P might imply no learning or a weak association.

Answer 14

Outcome devaluation occurs when the desirability or value of a reinforcer (outcome) is reduced after an association between a behavior and the outcome has been learned. This can happen through satiation (where a previously desirable outcome becomes less appealing after having too much of it) or by providing information that changes the perceived value of the outcome. Implications of Outcome Devaluation: The effect of outcome devaluation is critical for understanding the flexibility of behavior. It reveals whether a behavior is goal-directed (sensitive to changes in the outcome value) or has become habitual (insensitive to outcome changes). In goal-directed behavior, devaluing the outcome leads to a decrease in the associated behavior, as the behavior is performed with the expectation of obtaining the now devalued outcome. In habitual behavior, the performance of the behavior persists despite the devaluation of the outcome, indicating that the behavior has become automatic and no longer depends on the current value of the outcome.

Answer 15

Goal-directed behaviors are actions that are performed with an awareness of the relationship between the behavior and its outcome. These behaviors are: Flexible: They adjust based on the current value of their outcomes or goals. Sensitive to Outcome Devaluation: If the value of the outcome changes (as in outcome devaluation), the behavior will change accordingly. Driven by Expectations: The individual has expectations about the consequences of their actions, which guides their behavior. Habitual Behavior: Habitual behaviors, on the other hand, are actions that are performed automatically and repetitively without conscious thought about the outcome. These behaviors are: Rigid: They tend not to change even when the associated outcomes are no longer desirable or relevant. Insensitive to Outcome Devaluation: The behavior persists even if the outcome is devalued, indicating a disconnection between the action and the goal. Driven by Cues: The behavior is triggered by specific cues or contexts rather than by outcomes.

Answer 16

Amphetamines & Habitual Behaviour Study (Lever-Press Task) 🧪 Setup: Rats trained to: Press Lever 1 (R1) → Reward 1 (O1) Press Lever 2 (R2) → Reward 2 (O2) 💊 Groups: Amphetamine group: 7 days of amphetamine exposure Control group: received vehicle (no drug) Devaluation Phase: Before test: rats given free access to one reward → causes stimulus-specific satiety (i.e. they no longer value that reward) → this devalues that outcome Test Phase: Free choice between both levers No rewards given (extinction test) Rats must recall R-O (response-outcome) links to guide behaviour Results: Control rats (vehicle): Showed sensitivity to outcome devaluation Avoided pressing the lever linked to the devalued reward → Goal-directed behaviour Amphetamine rats: Continued pressing both levers equally Ignored the devaluation → Showed habit-like behaviour Conclusion: Amphetamine exposure increases habit formation Makes behaviour less flexible and less sensitive to changes in reward value

Answer 17

response/behaviour stops something bad response increases

Answer 18

Escape: learning to perform a behaviour to terminate an unpleasant stim Rat is usually on the one side of the box. The rat will see a 'warning stimulus' (WS) which indicates that the aversive stim (US) is coming. The rat will learn to jump onto the other side of the box to escape the aversive stimulus. This is called escape. Avoidance: learning behaviours to prevent the occurrence of an unpleasant stimulus before it starts (anticipation) Rats get more used to the experiment, when they see the WS, the rats will jump over to the other side of cage before the aversive stim has even happened. The WS is reinforcing this behaviour because it accurately predicts the aversive stimulus. This is why the rats respond to it consistently. These studies have found that avoidance behaviours are particularly resistant to extinciton.

Answer 19

the removal or non-presentation of an aversive stim following a particular behaviour (e.g leaping away) actually reinforces that behaviour --> something that doesn't happen strengthens the behaviour meant to avoid it highlights how behaviours are maintained when the expected punishment or unpleasant stimulus is never actually experienced e.g example, why would an organism continue to perform an avoidance behaviour if they never actually encounter the negative consequence that they are supposedly avoiding (anxiety applications)

Answer 20

Explains that avoidance behaviours are developed and maintained are maintained by fear. It explains that fear induced by CS (classical conditioning) drives avoidance behaviour, which is then maintained by negative reinforcement (operant conditioning) step 1: classical conditioning NS becomes CS through association of US. The CS then becomes signal for impending shock and now induces fear step 2: operant conditioning behaviour (jumping away) is reinforced by the cessation or avoidance of the shock after the CS is presented. behaviour is negatively reinforced because aversive stim is avoided

Answer 21

Definition and Function Definition: Safety signals are cues that predict the non-occurrence of an aversive or threatening event. Function: They function to reduce fear or anxiety by signaling that the environment is safe, thus inhibiting the fear response typically activated by a perceived threat. Formation Classical Conditioning: Safety signals are formed through classical conditioning, where a neutral stimulus is consistently paired with the non-occurrence of an expected aversive stimulus. Over time, this neutral stimulus becomes a conditioned signal of safety. Learning Process: The organism learns to associate the safety signal with the absence of threat, leading to a conditioned response of reduced fear or anxiety when the signal is present. Psychological Impact Reduced Anxiety: The presence of a safety signal can significantly reduce anxiety and stress by providing psychological assurance that no harm will occur. Neurobiological Modulation: Safety signals can modulate activity in brain areas involved in fear and anxiety, such as the amygdala, reducing physiological and emotional responses to fear. Difference from Superstitious Avoidance Basis of Association: Safety signals are based on a true predictive relationship between the signal and the non-occurrence of harm, whereas superstitious avoidance is based on incorrect or coincidental associations that do not actually influence outcomes. Behavioral Reinforcement: Safety signals are reinforced by the true absence of an expected negative outcome, leading to genuine reductions in fear. In contrast, superstititious avoidance behaviors are maintained by the erroneous belief that the behavior itself is preventing a negative outcome.

Answer 22

Systematic Desensitization Definition: Systematic Desensitization is a gradual, step-by-step process used to help individuals cope with fears and anxieties. Developed by Joseph Wolpe, it combines the principles of classical conditioning and relaxation techniques. Function: The primary function is to replace the fear response associated with a feared object or situation with a relaxation response. Process: Relaxation Training: The person is taught relaxation techniques, such as deep breathing or progressive muscle relaxation. Creation of Anxiety Hierarchy: The therapist and client develop a list of feared situations, ranked from least to most anxiety-provoking. Gradual Exposure: Starting with the least fearful situation, the person is exposed to these scenarios while practicing relaxation techniques. This exposure is gradual and only moves to more fearful situations once the individual can handle lower levels without significant anxiety. Flooding Definition: Flooding, or 'implosion therapy', involves exposing the person to their most feared stimuli for a prolonged period, without any gradual buildup. Function: The aim is to help the person face their fears directly and learn that the anxiety or fear will naturally decrease over time without any adverse consequences. Process: Full Exposure: Unlike systematic desensitization, flooding involves immediate and sustained exposure to the feared stimulus at its highest intensity. No Escape: The person is not permitted to escape or avoid the fear-inducing stimulus, which helps in extinguishing the fear response. Natural Decline of Fear: The therapy relies on the natural psychological phenomenon of 'habituation', where the fear response diminishes as the person learns that no harmful consequences are following the feared event.

Answer 23

A psychological model explaining how interpreting pain as threatening can trigger a cycle of fear, avoidance, and increased disability. Although avoidance reduces fear short-term, it worsens pain and functioning over time. Key Components: Pain Experience – Initial pain (e.g., injury) Catastrophizing – Exaggerated fear about pain → leads to anxiety Avoidance – Withdrawal from physical/social activity to prevent pain Disability – Physical deconditioning, reduced function, social isolation Pain Persistence – Less activity → increased sensitivity and chronic pain Why is avoidance reinforced? Avoidance provides short-term relief from fear/discomfort → becomes negatively reinforced, increasing the likelihood it will continue. Psychological Effects: ↑ Anxiety ↑ Pain sensitivity (pain feels worse) ↑ Hypervigilance (focus on pain)

Answer 24

immediacy consistency contingency intensity

Answer 25

increased anxiety: avoidance behaviour: modelling of aggressive responses:

Answer 26

S-R (Stimulus-Response) theories are a specific form of associative learning where a stimulus automatically triggers a response through repetition, without involving awareness of consequences—like flinching at a loud noise. In contrast, broader associative learning includes both S-R learning and operant conditioning, where behavior is influenced by its outcomes. For example, studying to get a good grade is operant conditioning because the behavior is shaped by reinforcement.

Answer 27

Associative learning is the broad process by which connections are formed between stimuli or between behaviors and their consequences. It includes classical and operant conditioning.

Answer 28

S-R (Stimulus-Response) theories are a subset of associative learning that focus on direct connections between a stimulus and a response, often seen as automatic or reflex-like behaviors strengthened through repetition and reinforcement.

Answer 29

Thorndike’s Law of Effect states that behaviors followed by satisfying outcomes strengthen the connection between stimulus and response, making the behavior more likely to occur again.

Answer 30

Hull’s theory suggests that behaviors are reinforced when they reduce biological drives (like hunger or thirst). This means the stimulus (drive) triggers a response that reduces the drive, reinforcing the behavior through the S-R connection.

Answer 31

Both explain how S-R connections are strengthened by consequences—Thorndike through satisfying or unpleasant outcomes, and Hull through the reduction of biological drives—highlighting mechanisms that reinforce behavior within associative learning.

Answer 32

Hull: feeling thirsty (stimulus/drive) leads to drinking water (response), which reduces thirst (drive reduction), reinforcing the behavior. Thorndike: studying hard (response) leads to good grades (positive outcome), reinforcing studying per the Law of Effect.

Answer 33

Definition: Learning that occurs without reinforcement and isn’t shown until there's a reason to demonstrate it. Not immediately visible — appears later when motivation is introduced. Tolman's research: What: Rats in mazes — explored without rewards Findings: Rats who explored without rewards still learned the maze. When food was introduced, these rats completed the maze faster than others. Conclusion: Rats had developed a "cognitive map" — proof of latent learning.

Answer 34

Assumptions: Classical conditioning (fear) must occur before avoidance learning. Fear motivates and maintains avoidance behaviour. There is a strong link between fear and avoidance. Challenges to the theory: Avoidance can occur without a warning signal (WS) E.g., Sidman (1953) avoidance procedure – no explicit WS present. Avoidance persists even when fear to the WS is low or absent Fear may decline, but avoidance continues. Kamin et al. (1963) Study: Investigated the relationship between fear and avoidance. Used a conditioned suppression test (WS suppression of other behaviour = fear index). Rats tested after 1, 3, 9, or 27 successful avoidance trials. Findings: Well-trained animals continued avoidance. Fear to WS decreased with more training (low suppression). Suggests avoidance can persist even when fear is minimal, though some fear remains.

learning and behaviour Flashcards

weeks 1-3 (64 cards)