operant conditioning Flashcards

1
Q

how does operant conditioning work

A

learning is conducted through ‘trial and error’ method of learning that occurs through consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

what was thorndike’s puzzle box

A

cats have to learn to open latches in order to get a reward (food)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what is the difference between classical conditioning and operant conditioning

A

in classical conditioning the response is involuntary and in operant conditioning the response is voluntary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what does the ABC model stand for

A

antecedent
behaviour
consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what does antecedent mean

A

the skinner box would represent a stimulus that triggers a behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what does behaviour mean

A

a response made by the animals that can be observed as an outcome of the antecedent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what does consequence mean

A

the reward or punishment following the behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is the aim of a punishment

A

to decrease a certain behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what is the aim of a reward

A

to increase a certain behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what was skinner’s box experiment

A

placed a rat or a pigeon in a box which was empty apart from a lever
if the rat pressed the lever then a food pellet would be released = positive reinforcement - so would press the lever more

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what is positive reinforcement

A

something that increases the likelihood of the behaviour being repeated by providing pleasant consequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

what is negative reinforcement

A

something undesired is taken away in response to a desired behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what is positive punishment

A

adding an adverse stimuli that will reduce the presentation of a behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what is negative punishment

A

the removal of a liked/desirable stimuli to reduce the showing of a behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is the token economy program

A

based of operant conditioning - behaviour is rewarded if repeated and stopped if not rewarded
form of behaviour modification because they aim to changed behaviour to a behaviour that is required or desired

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

what is the primary reinforcer

A

anything which satisfies a biological need such as food or warmth

17
Q

what is the secondary reinforcer

A

only fulfilling because they are associated with primary reinforcer. money that can be used to buy food

18
Q

what are schedules of reinforcement

A

tactic that influences how an operant response is learned and maintained. each type imposes a rule to determine how and when a desired behaviour occurs

19
Q

what is continuous reinforcement

A

when each behaviour gets the consequence not strictly a schedule as it always happens

20
Q

what is partial reinforcement

A

more resistant to extinction than continuous reinforcement and schedules

21
Q

what is fixed interval reinforcement

A

regarding the time of the reward. fixed time. schedules results of behaviour as time for the reward increases

22
Q

what is variable interval reinforcement

A

variable time. varying the time it is received. behaviours are reasonably steady as it is not known when consequences will happen

23
Q

what is fixed ratio reinforcement

A

number of behaviours,, reward/punishment is at a set number

high number of responses up to the consequences being given but period of not many once the reward has been given

24
Q

what is variable ratio reinforcement

A

reward is given rather randomly regard to the number of behaviours. most efficient as not known when the consequence will arrive so behaviour is continuous

25
Q

what is behaviour shaping

A

when shaping , must first wait for any behaviour that is even vaguely related to the target behaviour to occur. the first behaviour must be immediately be reinforced.
once they’ve been conditioned to perform the action now only the behaviour that is closer to the target must be reinforced.
when this has been conditioned only behaviour even closer must be reinforced. the behaviours reinforced along the way are called SUCCESSIVE APPROXIMATIONS
to maintain the desired behaviour once it has been conditioned an intermittent reinforcement should be sufficient