LearningšŸ • Operant Conditioning Flashcards

1
Q

If classical conditioning is learning through pairing
Then operant conditioning is learning thriugh __________

A

If classical conditioning is learning through pairing
Then operant conditioning is learning thriugh consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Who was the ā€˜founding father’ of operant conditioning?

A

Skinner (Originally carried out the research but his conclusions were applied to humans by behavioural psychologists

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 3 core principles of Operant conditioning?

A

Antecedent —-> Behaviour —-> Consequence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

If you want to change the ____________, you must change the ____________ (What already happens) or the ____________. It is much easier to change the ____________

A

If you want to change the behaviour, you must change the atecedent (What already happens) or the consequences. It is much easier to change the consequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is reinforcement?

A

Reinforcement is when the desired behaviour is rewarded. Thus making it more likely to be repeated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is positive reinforcement?

A

Positive reinforcement rewards desired behaviour by adding something pleasant such as food, affection or money

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is negative reinforcement?

A

Negative reinforcement rewards the desired behaviour by removing somethinng unpleasant e.g. taking away pain or distress, stopping criticism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is punishment?

A

Punishment is when undesirable behaviour produces unpleasant consequences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is positive punishment?

A

Positive punishment is when undesirable behaviour is punished by adding something unpleasant e.g. an electric shock or criticism

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is negative punishment?

A

Negative punishment is when undesriable behaviour is punished by removing something pleasant e.g. deducting money, taking away X Box

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Skinner found ____________ to be less effective at changing behaviour than ____________

A

Skinner found punishment to be less effective at changing behaviour than reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Effective conditioning must be C and C

A

Effective conditioning must be Contingent and Contiguent

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does contingent mean?

A

Contigent means that there is a clear link between behaviour and the consequence it produces e.g. individuals know exactly what they are being rewarded and punished for

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does continguent mean?

A

Continguent means that the consequence follows soon after the behaviour e.g. if there is too long of a delay, the conditioning/ asociation is weakened

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Skinner’s research deduced that _____________ shapes behaviour better than _____________, and that _____________ reinforcement shapes it better than _____________ reinforcement

A

Skinner’s research deduced that reinforcement shapes behaviour better than punishment, and that positive reinforcement shapes it better than negative reinforcement

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the best method of operant conditioning?

A

Positive reinforcement

17
Q

What is primary reifnorcement/ a primary reinforcer?

A

Neccessary or required items such as food or water

18
Q

What is secondary reifnorcement/ a primary reinforcer?

A

Desired items such as money or a favourite toy

19
Q

What is behaviour shaping?

A

Controlling or moulding behaviour

20
Q

What is a technique commonly used within behaviour shaping?

A

Successive Approximation

21
Q

What is successive approximation?

A

Rewarding behaviours that are similar to the end result/ final form of desired behaviour. Rewarding behaviour that is in a similar direction to the desired one, slowly causing the behaviour to become closer and closer to the desired, thus sucessively getting closer and closer to goal

22
Q

What is an example of successive approximation?

A

Skinner’s Pidgeon Experiment
* Pidgeons were rewarded intially for just slightly turning their heads in the correct direction
* Successively and slowly, the pidgeon started turning more and more, becoming closer to the desired 360 spin each time
* The pidgeon eventualyl was conditioned to respond to stimulus by rotating the full 360 - aquired this through behaviour shaping and successive approximation, learning bit by bit

23
Q

What is reinforcement/ punishment at a continous schedule?

A

When a given behaviour is reinforced/ punished every time it occurs

24
Q

What is reinforcement/ punishment at a fixed ratio schedule?

A

When a given behaviour produces reinforcement/ punishment only after a certain number of responses have been performed

BUT, with…

ā€˜Fixed’ meaning that the amount of responses required is the same each time

25
What is reinforcement/ punishment at a **fixed interval** schedule?
When a given behaviour produces reinforcement/ punishment only after a **certain amount of time** has passed. BUT, with... '**Fixed**' meaning that the **required amount of time** is the **same** each time
26
What is reinforcement/ punishment at a **variable ratio** schedule?
When a given behaviour produces reinforcement/ punishment only after a **certain number of responses** have been performed BUT, with... '**Variable**' meaning that the amount of **responses required** is **random** each time
27
What is reinforcement/ punishment at a **variable interval** schedule?
When a given behaviour produces reinforcement/ punishment only after a **certain amount of time** has passed. BUT, with... '**Variable**' meaning that the **required amount of time** is **random** each time
28
For **Fixed Ratio**, learning is ________ and extinction is ________
For Fixed Ratio, learning is **fast** and extinction is **moderate**
29
For **Variable Ratio**, learning is ________ and extinction is ________
For Variable Ratio, learning is **fast** and extinction is **slow**
30
For **Fixed Interval**, learning is ________ and extinction is ________
For Fixed Interval, learning is **medium** and extinction is **medium**
31
For **Variable Interval**, learning is ________ and extinction is ________
For Variable Interval, learning is **fast** and extinction is **slow**
32
The schedules of reinforcement/ punishment that have ________ learning, but with ____________ ________ are the best
The schedules of reinforcement/ punishment that have **fast** learning, but with **inconsistent** **rewards** are the best
33
What is the application of schedules of reinforcement/ punishment?
**Video Games** * They utilise the most effective techniques (**Variable Ratio and Variable Interval**) to maintain or increase attention to games and desire to continue e.g. Maze Runner uses **Variable Interval** through **parachute prizes dropping down at random times** to get people engaged and motivated to play out of speculation that they 'might get one soon'
34
What was the Skinner Box experiment?
**One of Skinners Experiments used to showcase Operant Conditioning** * A rat would be in a 'skinner box' and if it were to press the lever, the light would come on and a food pellet would roll down the chute (**Positive reinforcement**) - Initially, the rat's learning would have started by hitting the lever **accidentally** * Slowly the rat learnt how to get the food and (A primary reinforcer) and would therefore press the lever and get the light to flash
35
Outline one of the variations of the Skinner Box experiment
* In an additional variation of the skinner box included an **electric grid** at the bottom that was contiuously on * In this variation, the **lever would turn off the electric current for 30 seconds** * This is **Negative Reinforcement** as it is taking away something unpleasant Skinner found that in this varaition, the rats would learn to press the lever to turn the grid off, but **not as quickly** as the rats that were **positively reinforced** by food **(EVIDENCE FOR POSITIVE REINFORCEMENT BEING THE BEST METHOD)**