Chapter 5: Operant Conditioning Learning the Outcomes of Behaviours Flashcards

1
Q

Definition of operant conditioning?

A

the process whereby organisms learn to make or to refrain from making certain responses in order to obtain or avoid certain outcomes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What did Edward Thorndike study?

A

how animals learn new behaviours, specifically, how cats learned to escape from puzzle boxes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is another name for operant conditioning?

A

instrumental conditioning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What did Edward Thorndike conclude from his study of cats behaviours?

A

when an animal’s response was followed by a satisfying outcome, then the probability of that response occurring again in the future would increase

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Who first discovered operant conditioning?

A

B. F. Skinner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Edward Thorndike’s theory of law and effect?

A

The probability of a particular behavioural response increased or decreased depending on the consequences that followed; an animal has a range of behaviors: behaviors that lead to positive consequences for the animal tend to persist; those that do not tend to die out

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Definition of discriminative stimulus?

A

a stimulus that signals whether a particular response will lead to a particular outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Definition of reinforcement?

A

the process of providing outcomes for a behaviour that increase the probability of that behaviour occurring again in the future

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why is it called discriminative stimulus?

A

is called a “discriminative stimulus” to emphasis that it helps the organism “discriminate” the conditions under which R(esponse) will lead to O(utcome).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

The process of providing an outcome for a behaviour that increases the probability of that behaviour is called?

A

reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Who is Edward Tolman?

A

Argued that rats are like humans in that they are intrinsically motivated to learn the general layout of mazes by forming what he called a cognitive map

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Why does Edward Tolman argue that the S(D) –> R framework was too limiting?

A

he believed that animals make responses because they (in some sense) understand that response R leads to a specific, predicted outcome O.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the S(D) –> framework?

A

In the presence of a particular stimulus, called the discriminative stimulus, or S(D), a particular response (R) may lead to a particular outcome (O).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In the Edward Thorndikes puzzle box, what is the S(D), R & O?

A

S(D) : the box
R : sequence of movements needed to open the door
O : the escape

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When is the S(D) –> R association strengthened?

A

when Response is followed by a desirable Outcome

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Operant conditioning can be formulated as a three-part association, what is it?

A

Discriminative Stimulus –> S(D) Response R –> Outcome O

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Definition of Classical Conditioning?

A

Two stimuli are linked together to produce a new learned response in a person or animal. Unconscious Learning

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Difference between classical conditioning and operant conditioning?

A

In classical conditioning, organisms experience an outcome (the US) whether or not they perform the conditioned response (CR).

In operant conditioning, by contrast, the outcome O depends on whether the organism performs the response R.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the learning curve that both operant and classical share?

A

negatively accelerated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Whenever you have to decide whether a paradigm is operant or classical, focus on the outcome. How can this be done to identify whether it is operant or classical?

A

If the outcome occurs regardless of responding, then the paradigm is classical; if it is contingent on a response, then the paradigm is operant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

In operant conditioning, the outcome (O) only follows the discriminative stimulus (SD) if?

A

a particular response (R)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Since retiring, Jim spends a lot of time sitting on his back porch, watching the birds and whistling. One day, he scatters crumbs, and birds come and eat them. The next day, he sits and whistles and strews crumbs, and the birds return. After a few days, as soon as Jim sits outside and starts whistling, the birds arrive. Is this classical or operant?

A

operant

Stimulus: whistling
Response: birds arrive
Outcome: getting crumbs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Shevonne’s dog Snoopy is afraid of thunder. Snoopy has learned that lightning always precedes thunder, so whenever Snoopy sees lightning, he runs and hides under the bed. Is this classical or operant?

A

Classical

US: scared of thunder
UR: Run & Hide
CS: Lightning
CR: Run & hide

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Michael takes a new job close to home, and now he can walk to work. On the first morning, there are clouds in the sky. It starts to rain while Michael is walking to work, and he gets very wet. On the next morning, there are again clouds in the sky. Michael brings his umbrella along, just in case. When it rains, he stays dry. After that, Michael carries his umbrella to work anytime the sky looks cloudy. Is this classical or operant?

A

operant

Stimulus: clouds
Response: carry umbrella
Outcome: stay dry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

In Carlos’s apartment building, whenever someone flushes the toilet, the shower water becomes scalding hot, causing him to flinch. Now, whenever he’s in the shower and hears the noise of flushing, he automatically flinches, knowing he’s about to feel the hot water. Is this classical or operant?

A

Operant

Stimulus: noise of flushing toilet while in the shower
Response: flinching
Outcome: not being burnt by hot water

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

Who is the psychologist known as the “radical behaviourist”?

A

B. F. Skinner

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Definition of discrete trials?

A

Experimenter defined the beginning and end of each trial.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

What were Thorndikes procedures categorised by?

A

Discrete trials

29
Q

What is the definition fo free-operant paradigm?

A

The animal could operate the apparatus free whenever it chooses.

30
Q

difference between free-operant paradigm and discrete trials paradigm?

A

in free-operant the animal could operate the apparatus freely with no need of the experimenter bu in discrete, the experimenter must control it themselves.

31
Q

What is the Skinner box?

A

A conditioning chamber in which reinforcement or punishment is delivered automatically whenever an animal makes a particular response, in this case pressing the level.

32
Q

What did Skinner create the be able to measure behaviour more directly?

A

A cage, now known as the Skinner box, with a wall were food is delivered automatically. It contained a lever when pressed dropped food. Eventually the animal is learns to press the lever to relieve food.

33
Q

It the Skinner box operant or classical conditioning?

A

operant

34
Q

What is the discriminative stimulus added to the Skinner box?

A

light. When the lever is pressed, food will only come if the light is on

35
Q

Definition of operant response?

A

When a behavior is modified by its consequences, the probability of that behavior occurring again may either increase (in the case of reinforcement) or decrease (in the case of punishment).

36
Q

Definition of instrumental response?

A

any response that achieves a goal or contributes to its achievement, such as a response that is effective in gaining a reward or avoiding pain

37
Q

Skinner invented a means of recording responses automatically, what is it called?

A

Cumulative recorder

38
Q

What is a cumulative recorder?

A

A device that records behavioural responses

39
Q

How does a cumulative record work?

A

the pen moved each time the animal responded , but if it dint it would continue to draw a straight line, kind of like heart beat whenever the animal responded. The highest of the line represents the total number of responses that have been made in the entire experiment

40
Q

A response is defined not by a particular pattern of motor actions but rather by…?

A

… by the outcome it produces

41
Q

definition of shaping?

A

an operant condition technique in which successive approximations to a desired response are reinforced

42
Q

Successful shaping of behavior involves three components, what are they?

A

1) Clearly define final response you want performed
2) Clearly assess performance starting level
3) Divide progression from starting point to the final target behavior into appropriate training steps/successive approximations

43
Q

What is chaining?

A

organisms are gradually trained to execute complicated sequences of discrete responses.

44
Q

Example fo when Skinner used chaining?

A

trained a rat to pull a string that released a marble, then to pick up the marble with its forepaws, carry it over to a tube and drop the marble inside the tube.

45
Q

Difference between shaping and chaining?

A

Shaping: the learner learns by first approximately performing the goal behaviour.

Chaining: you take a multi-step task and break it down into a sequence of smaller tasks. Step-by-step.

46
Q

What is backward chaining?

A

Doing chaining, step-by-step, by in the reversed order

47
Q

Definition of magazine training?

A

training needed to familiarise an animal with the mechanism (usually a feeder) that delivers the reinforcer.

48
Q

What are the two instrumental conditioning procedures?

A

Appetite Stimulus

Aversive Stimulus

49
Q

Definition of reinforcer?

A

a consequence of behaviour that leads to increased likelihood of that behaviour occurring again in future

50
Q

Definition of primary reinforcer?

A

a stimulus such as food, water, sex, or sleep, that has innate biological value to the organism and therefore will tend to repeat behaviours that provide access to these things

51
Q

Who created the drive reduction theory?

A

Clark Hull

52
Q

What is the drive reduction theory?

A

proposes that all learning reflect the innate, biological need to obtain primary reinforcers

53
Q

Definition of reinforcement?

A

the process of providing outcomes (reinforces) that lead to INCREASED probability of behaviour

54
Q

Definition of secondary reinforcer?

A

Reinforcers that initially have no biological value but have been paired with primary reinforcers, for example, money. used to exchange for water, food, etc.

55
Q

Definition of token economy?

A

An environments in which tokens functions the same was as money does in the outside world

56
Q

Definition of Negative Contrast?

A

The phenomenon in which the reinforcing value of one reward is reduced because a better reward is expected

57
Q

What is a punisher?

A

a consequence of a behaviour that leads to decreased likelihood of that behaviour occurring again in the future

58
Q

What is punishment in operant conditioning?

A

the process of providing outcomes for a behaviour that decrease the probability of that behaviour occurring again in the future

59
Q

What is the negative contrast effect on the infant experiment?

A

infants will suck at a higher rate for sweet water than for plain water, indicating that the sweetened water is a preferred reinforcer. Infants who were given sweet water before given plain water sucked less vigorously than the infants who received plain water all along.

60
Q

What did Thorndike originally assume regarding punishers?

A

that punishers were simply the inverse of reinforcers; whereas reinforcement increases that probability

61
Q

What did Thorndike and Skinner both potentially conclude regarding punishment?

A

punishment is not nearly as effective as reinforcement at controlling behaviour

62
Q

Many modern researchers argue that punishment can indeed be very effective in modifying behaviour, what are the 4 factors that determine how effective the punishment will be?

A
  • Punishment leads to more variable behaviour
  • Discriminative Stimuli for punishment can encourage cheating
  • Concurrent reinforcement can undermine the punishment
  • initial intensity matters
63
Q

Common punishers for animals?

A

pain
confinement
exposure to predators

64
Q

Common punishers for humans?

A

fines
social disapproval
jail

65
Q

What is the process differential reinforcement of alternative behaviours
(DRA)?

A

a method to decrease a frequency of unwanted behaviours by instead reinforcing preferred alternate behaviours

66
Q

Examples of when differential reinforcement of alternative behaviours can be used?

A

children with autism of developmental disorders showing persistent habits of self-injurious behaviour.

So rather than punishing the child for each instance of the unwanted behaviour, you can reward instances of desired behaviour

67
Q

Difference between positive punishment & positive reinforcement?

A

c

68
Q

Difference between negative punishment & negative reinforcement?

A

c

69
Q

207

A

c