Learning Flashcards

(59 cards)

1
Q

What is learning?

A

A long lasting change in behavior caused by the environment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the law of effect?

A

proposed by Thorndike

a response followed by a favourable consequence will be more likely to occur again

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a response?

A

an item of behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a reinforcer?

A

a favourable consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does the law of effect say will happen?

A

A response followed by a reinforcer will increase where all other unsuccessful responses will decrease and disappear

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an experimental chamber?

A

A chamber where controlled experiments on animals occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is needed in a control environment?

A
a response (the bar to be pulled)
a reinforcer (the food)
a stimuli (the sound of the hopper)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is operant or instrumental behaviour?

A

behaviour key to the law of effect

operant behaviour is behaviour controlled by its own consequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is operant conditioning?

A

learn an association between a response and its consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a contingency?

A

the response causes the reinforcer. Or the reinforcer is contingent on the response

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is shaping?

A

the method of successive approximations.

Reinforcing closer and closer approximations of the response that we want.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are some hints for shaping?

A

identify a suitable reinforcer
reinforce as immediately as possible
too many reinforcers and the animal will satiate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is postive reinforcement?

A

The adding of a favourable event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a postivie contingency?

A

if the event is presented

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a negative contingency?

A

If the event is withdrawn

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is punishment?

A

an adversive event. event is less likely to occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is reinforcement?

A

a favourable event. event is more likely to occur

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Escape and avoidance are two types of what?

A

negative reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is escape?

A

response terminates ongoing event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is avoidance?

A

response prevents onset of aversive event

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the Miller-Mowrer box?

A

the box where the dog jumps to avoid getting shocked. initially escape but as he learns it becomes avoidance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the avoidance paradox?

A

Where youre not really avoiding the consequence but you’re avoiding the conditioned anxiety that comes with the consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

What is a primary reinforcer?

A

reinforce behaviour because of their innate biological significance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is a secondary reinforcer?

A

previously neutral stimuli that have acquired reinforcing properties by being paired with primary reinforcers in the organism’s history. Also called conditioned reinforcers because reinforcing property is learned not innate.

25
What is continuous reinforcement?
every response is reinforced
26
What is intermittent reinforcement?
not continuous. usually adhering to a schedule of reinforcement. Invented by skinner so he could go away for the weekend
27
What is the partial reinforcement effect?
responding maintained by intermittent reinforcement persists much longer during extinction than one maintained by continuous reinforcement
28
What are schedules of reinforcement?
rules that specify when a response will be reinforced
29
What is a ratio schedule?
a schedule based on the number of responses emitted
30
What is an interval schedule?
A schedule based on the time thats lapsed.
31
What is a cumulative record?
a graph showing the different patterns of learning behaviour based on the schedule used
32
What is a fixed ratio schedule?
reinforcement is contingent on the last of a fixed number of responses emitted since the last reinforcer. Has a burst and break pattern with the break after every reinforcer
33
What is a variable ratio schedule?
reinforcement is contingent on the last of a variable number of responses emitted since the last reinforcer. VR 10 means every 10th response on average is reinforced. High constant rate of responding
34
What is a fixed interval schedule?
a response is reinforced after a fixed amount of time has elapsed since the last reinforcer. Like FR but with a smoother transition from break to burst o becomes scalloped. fixed intervals on the x axis.
35
What is a variable interval schedule?
A response i reinforced after a variable amount of time has elapsed since the last reinforcer. Like VR but usually a lower response rate. Smooth as again can't tell when reinforcers are due.
36
What is differential reinforcement of other behavior (DRO)?
How to get rid of a behaviour. A reinforcer is given when a fixed amount of time has elapsed since the last response. explicitly reinforces not responding. Also called omission training eliminates behaviour more effectively than extinction
37
What is a Fixed Time schedule?
a reinforcer is delivered when a fixed amount of time has elapsed since the lsat response. Like FI but no response required so. non-contingent or response independent.
38
What happens when there is no response/reinforcer contingency?
If by chance a response is followed by a reinforcer you get superstitious responding. with adventitious reinforcement of whatever response was occurring just before the the reinforcement. so superstition may be failure to discriminate between a lack of contingency
39
What is the complex version of the law of effect?
When a response is followed by a favorable consequence it become more likely that the response will occur again in the context in which it was reinforced
40
What are the likely results of stimulus control?
the amount of generalization depends on how close the stimulus is to the training stimulus. (the pigeon and the different coloured keys.
41
What is stimulus control?
the extent to which stimuli that precede or accompany operant behavior come to affect the rate or probability of that behavior.
42
The more stimulus control the more....
discrimination
43
The less generalisation the more ....
stimulus control
44
What are generalisation gradients?
the slope of the gradient measures stimulus control
45
What is a generalisation test?
After reinforcing responses to a particular stimulus, present several different stimuli, in extinction, and measure responses to each
46
What are incremental gradients?
What if we gave punishment at the original stimulus and then did a gen test? The graph would go the opposite way. the response would increase. Incremental gradients are often flatter than decremental gradients
47
What is discrimination training?
making the context relevant by reinforcing responses in presence of one stimulus (s+) but not in the presence of another (s-). across a cumulative record the responding to each stimulus gets more and more discriminative
48
What is intradimensional discrimination training?
S+ and S- are on the same dimension. e.g red key vs green key
49
What is interdimensional discrimination training?
S+ and S= are not on the same dimension. Each yellow key vs white key with a black vertical line.
50
Is the gradient steeper after single stimulus training or interdimensional training?
After interdimensional training. There is more stimulus control so the gradient is steeper
51
What happens to the gradient after intradimensional training?
there is even more control because there is more attention being given to the dimension thats being tested for. There is a positive peak shift. The most responses are not at S+ but on the opposite side of S+ from S-
52
What is concept formation?
A concept is formed if an animal learns that it should both peck the matching stimulus and shouldnt peck the non matching stimulus
53
What is MTS?
Matching to sample
54
What is DMTS?
delayed matching to sample. Where a delay is introduced between the sample stimulus and the comparison stimuli. accuracy decreases as delay increases. Typical decay curve
55
What is respondant conditioning?
Learning the difference between two stimuli. | aka classical or pavlovian conditioning
56
What is and unconditioned stimulus?
A normal stimulus that elicits a normal unconditioned response
57
What is a neutral stimulus
A stimulus that when paired with an unconditioned stimulus becomes a conditioned stimulus that elicits a conditioned response
58
whats the difference between a multiple schedules and concurrent schedules
multiple schedules have two simple schedules of reinforcement available alternately, one at a time. Concurrent schedules have two simple schedules available simultaneously
59
What is the matching law?
so the percentage of responses on one alternative matches the percentage of reinforcers from the alternative.