Unit 4: Operant Conditioning Flashcards

Question 1

Q

What’s the difference between instrumental and classical conditioning?

Answer

A

classical: presence or absence of stimulus causes response
operant: behaviour causes presence or absence of stimulus (consequence)

Question 2

Q

Who did the study of instrumental conditioning start with and what was he interested in?

Answer

A

Edward Thorndike
interested in animal intelligence

Question 3

Q

How were Thorndike’s experiments generally structured?

Answer

A

hungry animals placed in puzzle boxes
food outside of boxes but in view
-> animals had to learn how to escape the box to obtain food

Question 4

Q

How did the animal’s behaviours change in the puzzle box?

Answer

A

initially unable to escape
slow to make right response
continued to practice until latencies become shorter

Question 5

Q

How did the animals learn how to solve the puzzle box?

Answer

A

trial and error to discover behaviour required to escape
successful behaviours retained
useless behaviours eliminated

Question 6

Q

How did Thorndike label the animals ability to learn how to escape a puzzle box?

Answer

A

animal intelligence

Question 7

Q

Why is animal intelligence not an accurate term?

Answer

A

many behaviours seem unintelligent
initial presence of various responses typical for confined animal, with some leading to a desirable result
consequences reinforce the action
->cat doesnt understand how levers work, but presses it because it thinks it will be rewarded for it

Question 8

Q

Law of Effect

Answer

A

If R in presence of S is followed by positive event -> becomes strengthened
If it isn’t followed by a positive event -> S-R association becomes weakened

Question 9

Q

What do we measure in discrete-trial procedures?

Answer

A

rat runs down a maze to get reward
measures response latency (time it takes for the rat to leave the start of the box) and running speed (how fast it reaches the end)

Question 10

Q

What’s a T maze trial?

Answer

A

type of discrete-trial procedure
allows us to measure percentage of correct choices

Question 11

Q

What are trials?

Answer

A

specific periods of time during which the animal can show instrumental responses
set by the experimenter

Question 12

Q

Why didn’t Skinner use discrete-trial procedures?

Answer

A

behaviour is continuous (one leads to the next)
-> trials more natural if animals aren’t removed
behaviour can be broken down into measurable units: operants

Question 13

Q

What is magazine training?

Answer

A

US paired with CS via classical conditioning
sound elicits sign-tracking response

Question 14

Q

How does response shaping work? (example: rat in Skinner box)

Answer

A

after magazine training, rat can learn operant response
1. food given if rat goes on hind legs anywhere in chamber
2. food given if rat leans over lever
3. food given if rat goes up on hind legs and presses lever
=> sequence called shaping/ reinforcement of successive approximations

Question 15

Q

What are operant responses in free-operant procedures measured as?

Question 16

Q

Response rate

Answer

A

frequency of instrumental behaviours occurring
high: high probability of behaviour occurring
low: low probability of behaviour occurring

Question 17

Q

How can we differentiate outcomes?

Answer

A

appetitive vs aversive
positive vs negative

Question 18

Q

Which components do all instrumental conditioning procedures involve?

Answer

A

instrumental response
outcome (reinforcement, punishment)
stimulus
association between response and outcome

Question 19

Q

positive reinforcement

Answer

A

behaviour produces (adds) appetitive outcome

Question 20

Q

negative reinforcement

Answer

A

behaviour produces absence of aversive stimulus

Question 21

Q

positive punishment

Answer

A

behaviour produces aversive outcome

Question 22

Q

negative punishment

Answer

A

behaviour produces absence of appetitive outcome

Question 23

Q

Can a behaviour always be reinforced (by anything)?

Answer

A

no, only if behaviour is naturally linked to reinforcement
e.g. cant reinforce yawning in cats with opening box, because yawning isn’t naturally linked with release from confinement

Question 24

Q

What does the presence of a stimulus activate?

Answer

A

behaviour system related to that stimulus
e.g. hunger (S) causes hamsters to start digging and scrabbling (behavioural system linked to hunger), while stopping self-care behaviour (behaviour doesn’t address hunger)

Question 25

Q

What does instrumental conditioning depend on with regards to the reinforcement?

Answer

A

quality and quantity of reinforcement
nature of reinforcement
previous reinforcements for same instrumental behaviour

Question 26

Q

Behavioural contrast effect

Answer

A

big reward perceived as especially good after small reward and vice versa

Question 27

Q

Which types of relationships between response and reinforcement are there?

Answer

A

temporal relationship: contiguity
causal relationship: contingency

Question 28

Q

Are temporal and causal factors dependent on each other?

Question 29

Q

What can we say about temporal relations?

Answer

A

immediate reinforcement is preferable to delayed reinforcement

Question 30

Q

credit assignment

Answer

A

if too much time passes, we won’t be able to link specific behaviours to the reinforcement
(credit assignment is the reason for this)

Question 31

Q

What’s more important to create associations, contingency or contiguity?

Answer

A

contiguity

Question 32

Q

The fact that a behaviour occurred just before the reinforcement was more important than whether it caused the reinforcement. What is this kind of reinforcement called?

Answer

A

adventitious/ accidental reinforcement

Question 33

Q

learned helplessness

Answer

A

when experiencing a tense state repeatedly
-> feeling of being incapable to change the situation

Question 34

Q

Why does the reinforcer not occur after every response in instrumental conditioning procedures?

Answer

A

reflects nature of real world

Question 35

Q

What’s a schedule of reinforcement?

Answer

A

rule that determines how and when a reinforcer follows a response

Question 36

Q

Ratio schedules

Answer

A

reinforcer occurs after X amount of responses

Question 37

Q

continuous reinforcement schedules

Answer

A

reinforcement delivered after every response
commonly used in drug abuse treatments

Question 38

Q

Is continuous reinforcement common in real life?

Question 39

Q

Token economies

Answer

A

reward system where tokens can be exchanged for bigger rewards
used to reduce disruptive behaviours

Question 40

Q

Partial/ Intermittent reinforcement

Answer

A

reinforcement only occurs sometimes

Question 41

Q

fixed-ratio schedules

Answer

A

number of reinforcers received per number of responses is fixed

Question 42

Q

Are continuous reinforcement schedules a type of fixed-ratio reinforcements?

Question 43

Q

How strong is the rate of responding generated by continuous reinforcement?

Answer

A

steady and moderate

Question 44

Q

What is steady responding PRECEDED by? (in fixed-ratio schedules)

Answer

A

a brief pause

Question 45

Q

How/ where do we see rates of responding?

Answer

A

in cumulative records
-> show total number of responses during a period of time

Question 46

Q

How do cumulative recordings work?

Answer

A

pen rests on piece of paper
moves up vertically after each response
-> time between responses (horizontal distance)
-> slope: rate of responding (number of responses per unit of time operants)

Question 47

Q

What does an increase in fixed-ratio requirements tend to cause?

Answer

A

increase in post-reinforcement pause despite no change in rate of responding during ratio run

Question 48

Q

What’s ratio strain and what causes it?

Answer

A

caused by dramatic increases in fixed-ratio requirements
-> periodic pauses during ratio run
-> in extreme cases responses stop completely

Question 49

Q

Variable-ratio schedules

Answer

A

number of responses required to achieve reinforcement varies

Question 50

Q

Do variable-ratio schedules cause post-reinforcement pauses?

Answer

A

no, they are less likely to cause pauses

Question 51

Q

Why don’t variable-ratio schedules cause post-reinforcement pauses?

Answer

A

subject doesn’t know how many responses are required for reinforcement
-> maintains stable rate of responding

Question 52

Q

Is fixed or variable-ratio reinforcement more effective for long term effects?

Answer

A

variable-ratio

Question 53

Q

What’s the difference between variable reinforcement and intermittent reinforcement?

Answer

A

Intermittent reinforcement: broad category, includes variable and fixed ratio/ intervall reinforcement, rewards only given sometimes
variable reinforcement: reinforcement given after unpredictable amount of times, can’t be fixed number of responses/ time

Question 54

Q

What can be said about the effects (rate of responding and resistence) of variable and intermittent reinforcement?

Answer

A

variable: higher rate of responding, less resistent to extinction
intermittent: lower rate of responding, more resistent to extinction

Question 55

Q

What’s an interval schedule?

Answer

A

responses are only reinforced if the response occurs after a certain amount of time

Question 56

Q

Fixed-interval schedules

Answer

A

amount of time that has to pass before response is reinforced is constant

Question 57

Q

Do fixed interval schedules always lead to the reinforcement being delivered after a certain amount of time?

Answer

A

no, the interval only determines when a reward is available, not when it’s delivered
-> response still has to occur

Question 58

Q

Variable interval schedule

Answer

A

amount of time that passes between a reference point and response that produces a reward varies

Question 59

Q

Do fixed interval schedules also lead to post-reinforcement pauses (and a higher response rate before the delivery of the next reinforcement)?

Question 60

Q

Do variable-ratio and interval schedules cause the same effects?

Answer

A

no, variable-ratio leads to higher response rate while variable-interval leads to more steady response rate

Question 61

Q

Why are the different effects of ratio and interval schedules relevant for us?

Answer

A

implications human behavior (motivation)

Question 62

Q

What happens in concurrent schedule trials?

Answer

A

different responses can be associated with different reinforcement on different schedules of reinforcement
(your responses in a situation are rewarded at different rates depending on the behavior, so you have to choose which behavior to carry out)

Question 63

Q

How do we measure choice behaviour?

Answer

A

assessing how responses are distributed across response alternatives

Question 64

Q

How do you mathematically measure response rates of concurrent schedules?

Answer

A

Response A (total left responses) / Total responses (A&B / left & right)

Answer 60

A

Total reinforcements for option A / Total reinforcements (A&B)

Answer 61

A

rate of responding on alternatives matches rate of reinforcement (40% of all reinforcements come from A -> responds to A 40% of the time)

Answer 62

A

Associative structure of instrumental conditioning (molecular perspective): how stimuli, responses and outcomes are related
Response-allocation approach (molar perspective): how instrumental behavior relates to long-term goals

Answer 63

A

No
the environment

Answer 64

A

Contextual Stimuli
instrumental response
response outcome (only weakens/ strengthens response)

Answer 65

A

things we do automatically
45% (estimate)

Answer 66

A

two types of learning: Pavlovian and instrumental
S-O association established through classical conditioning activates reward expectancy/ emotional state
-> depending on nature of emotional state (appetitive/ aversive): motivates instrumental behavior

Answer 67

A

result of associations between stimuli, responses and outcomes

Answer 68

A

restricting an organism’s natural behavior (only 1 response leads to reward, rest doesn’t)

Answer 69

A

it isn’t the nature of the response that is reinforcing, but the behavior related to consuming the reward

Answer 70

A

high-probability behaviors can be effective reinforcements for low-probability behaviors

Answer 71

A

restricting access to reinforcement

Answer 72

A

restricted access to reinforcement is critical for motivating instrumental responding

Answer 73

A

Quality of reward must reflect quality of response & vice versa
(time on facebook must equal time studying)
-> behavioral bliss point: perfect ratio

Answer 74

A

what alternative reinforcements are available
how are alternatives related to reinforcement in question
costs of obtaining alternatives
-> understand what motivates instrumental responding

Answer 75

A

how consumption of products is influenced by the price

Answer 76

A

how much consumption can vary in relation to increasing costs
high elasticity = high variability

Answer 77

A

yes
more options = higher elasticity

Answer 78

A

income
instrumental contingencies

Brainscape's Knowledge GenomeTM

Unit 4: Operant Conditioning Flashcards

Brainscape's Knowledge Genome^TM