Lecture 4 - Learning Flashcards

Question 1

Q

What is learning for?

Answer

A

Learning allows an organism to exploit and benefit from regularities in the environment.

Question 2

Q

What does a moon halo mean about upcoming rain according to first nations peoples?

Answer

A

The halo and the number of stars within the halo predict rain and the day on which the rain will come respectively.

Question 3

Q

Learning is about … and …?

Answer

A

Learning is about REGULARITY and INVARIANCE.
See bee example.

Question 4

Q

What is the example of the bees waggle dance used in the lecture?

Answer

A

Length of waggle gives an indication of distance, direction of path after the waggle give an indication of where the food is relevant to the sun. So, here only the direction and distance are important for the other bees - i.e. the number or colour of the flowers are irrelevant. This is an example of selective attention in learning in non-human animals.

Question 5

Q

What is counterfactual reasoning?

Question 6

Q

In terms of cues and outcomes, what is delta-P and what does it tell us?

Answer

A

Delta-P is the probability of an outcome given a cue.
Or, delta-P is the measure of the ACTUAL strength of the contingency between a cue and an outcome.
As delta-P approaches 1, the strength of the contingency increases, i.e. the presence of the cue is strongly correlated with the outcome.
As delta-P approaches -1 it tells us that the presence of the cue is likely to indicate the absence of the outcome.
Delta-P of zero indicates that there is no relationship between the cue and the outcome.

Question 7

Q

What experiment did Shanks (1991) do regarding delta-P?

Answer

A

In Shanks (1991) experiment participants were told of symptoms that occur or do not occur with certain made-up diseases.
They were then measured on how strongly they think a given symptom is a predictor of a given disease.
The more they were exposed to symptoms and symptom combinations and the made-up diseases the better they were at stating the actual statistical covariation.

Question 8

Q

What is the PROBABILISTIC CONTRAST MODEL and what is its relevance to calculating delta-P?

Answer

A

The probabilistic contrast model was proposed by Cheng and Holyoak.
This model says that we should only calculate delta-p when the cue and outcome and not being interfered with by other factors, i.e. when the background is constant.

Question 9

Q

What are some of the mechanisms involved in learning discussed in lecture?

Answer

A

Co-occurrence, delta rule, learned selective attention, and belief updating.

Question 10

Q

What is the blocking paradigm?

Answer

A

Learning isn’t just a reflection of co-occurance (see example of mice and red, blue lights and juice).

Question 11

Q

How does the experiment of the mouse, lights and juice illustrate the Blocking Paradigm?

Question 12

Q

What is the Delta Rule?

Answer

A

The Delta-Rule suggests that the learning is error-driven updating of associations.

Question 13

Q

What is the Delta Rule Explanation of Blocking?

Question 14

Q

What are some of the criticisms of the Delta-Rule?

Answer

A

The Delta-Rule cannot predict the Highlighting Effect.
The Delta-Rule also does not predict how quickly people learn associations.
The Delta-Rule cannot explain the Retrospective Revaluation.

Question 15

Q

What is the Highlighting Effect?

Answer

A

The Highlighting effect occurs in contingency learning in the following example:
A rat is trained to associate red light and bell with food. Then they are trained that red light and alarm lead to juice.
When they are presented with the bell and the alarm the rats go to the juice, even though they have had twice as much exposure to the bell being associated with food.
Because the red light is already associated with food, during the training phase it is the alarm that is learned to be the predictor of the juice regardless of the other cue and therefore when presented with the bell and alarm, the rat chooses the juice.

Question 16

Q

What is Retrospective Revaluation?

Answer

A

Seen in the “blicket” experiment, where a child is able to learn an association and then quickly update that association when presented with new data that contradicts or is not aligned with their prior associations.

Question 17

Q

What is the Attention Account of Highlighting and Blocking Effect?

Question 18

Q

Learning is a balance of belief and data.

Question 19

Q

Can Belief Updating explain Retrospective Revaluation?

Question 20

Q

Is human associative learning sensitive to the degree of statistical covariation of events?
In other words, are the association strengths between cues and outcomes reflective of the actual statistical covariation of cues and outcomes?

Answer

A

Yes, to some degree.
The more trials and exposure to cues and outcomes the closer people’s perceived strength of association between a given cue and outcome is to the actual statistical covariation/correlation.

Question 21

Q

When the background is variable, why is delta-P unreliable?

Answer

A

Because when the background is variable delta-P misestimates the strength of a correlation.
That is why we need to use the probabilistic contrast model and ensure the background is constant.

Question 22

Q

What is the experiment of the rat in the Skinner box with the lights, sounds and food/juice?
And how is this an example of the BLOCKING PARADIGM?

Answer

A

The rat was first taught that a red light was associated with food.
Then the rat was exposed to the red light and a bell and got food.
Then the rat was exposed to blue light and an alarm and got some juice.
They then exposed the rat to the blue light and the bell adn overwhelmingly the rat would go to get the juice.
If co-occurence was all that was involved in us learning contingencies or associations then the rat should go to the food and juice in a 50:50 split.
This is known as the blocking paradigm, where the red light and food association essentially blocks the learning of the association betweem the bell and the food and so when presented with blue ight and bell then the rat just uses the learned association of the blue light and the juice.

Question 23

Q

What is the DELTA RULE?

Answer

A

The Delta-Rule is used to described how we learn contingencies, or correlations/associations between cues and outcomes.
In this rule, learning is seen to be driven by our expectations for a cue and the actual outcome of that cue. If the outcome is a lot different to our expectation then then there is a stronger and quicker association formed between the cue and the outcome. e.g. when you first see a stove and touch it and it is hot.
delta V = gamma * (what actually happens - what you think will happen).
Gamma is the rate of learning.
Called the delta rule because it refers to the difference between what we expect and what occurs.
The difference between what actually happened and what we expected to happen (multiplied by the rate at which we learn the association) gives us a REWARD PREDICTION ERROR SIGNAL.
The bigger the reward predicition error signal the quicker we learn and the stronger the association we learn between a cue and an outcome.

Question 24

Q

What is the RESCORLA-WAGNER model?

Answer

A

The Rescorla-Wagner model is model that attempts to predict the learning seen in classical conditioning between unconditioned and conditioned stimuli, or cues and outcomes.
An example of the delta-P rule.

Question 25

Q

How does the Delta-Rule explain the blocking effect?

Answer

A

When we have learned about a cue and an outcome, then when another cue is added but the conditioned cue remains then the reward prediction error is not very big and we therefore do not form as strong an association between the added cue and the outcome.

Question 26

Q

Can the delta-rule explain the HIGHLIGHTING EFFECT?

Answer

A

No.
The delta rule cannot explain the highlighting effect.

Question 27

Q

What are some criticisms of the DELTA-RULE of contingency learning?

Answer

A

It cannot explain the highlighting effect.
Cannot explain retrospective revaluation.

Question 28

Q

What is RETROSPECTIVE REVALUATION?

Answer

A

Retrospective revaluation occurs when we update our learned associations/contingencies based on the new data we are presented with.
See the “Blicket” experiment with children.

Question 29

Q

Why can the DELTA-RULE not explain retrospective revaluation?

Answer

A

Because according to the delta-rule, we learn an association and then there is little room for us to learn new cue associations with the outcome if paired with the old cue.
Furthermore, the delta-rule does not account for how quickly we learn new associations or unlearn old associations.

Question 30

Q

Can a combination of the delta-rule and selective attention provide an account for THE HIGHLIGHTING effect?

Question 31

Q

Can Selective Attention explain the BLOCKING EFFECT?

Answer

A

Yes.
Similar to the delta-rule, a Learned Selective Attention account of the Blocking Effect would say that when we learn the association between the cue A and the outcome X, when cue A and B are presented together and lead to X, our limited capacity of attention does not allow for a strong association between B and X to form, as we are already attending significantly to A.

Question 32

Q

Can an the LEARNED SELECTIVE ATTENTION MODEL of contingency learning explain RETROSPECTIVE REVALUATION?

Question 33

Q

What is the idea of BELIEF UPDATING as a way of learning?

Answer

A

Belief updating as a way of learning contingencies is based on the idea that e have expectations of associations in the environment (aka beliefs). We only change our beliefs if we have enough evidence (data) to override our prior expectations/beliefs. So, our learning of new contingencies is dependent on how strong our beliefs are.

Question 34

Q

What can Belief Updating explain about learning contingencies?

Answer

A

The Blocking Effect and Retrospective Revaluation.
But it cannot explain the highlighting effect.

Question 35

Q

What combination of theories discussed explain the blocking effect, the highlighting effect, and retrospective revaluation?

Answer

A

Belief Updating and Learned Selective Attention.

Question 36

Q

What is Bayes Rule?

Answer

A

Bayes Rule states that we update our beliefs based on the data we are presented with.

Question 37

Q

In the RESCORLA-WAGNER model of contingency learning are the strength of cues considered separately?

Answer

A

No.
The strength of each of the cues is summed together.

Brainscape's Knowledge GenomeTM

Lecture 4 - Learning Flashcards

Brainscape's Knowledge Genome^TM