Perception 5: Multisensory perception Flashcards

(31 cards)

1
Q

In our perception of something using multiple senses, we seem to be quite visually dominant.
Rock and Victor (1964) investigated this
Completed either condition:
Subjects felt an object from underneath without seeing own hand (no vision)
Subjects visual impression of the size and shape of the object is distorted by a lens (no haptics)
Subjects underwent vision and haptics conditions
Subjects had to give their impression of the size of the square

What did they find?

A

Subjects’ impression was dominated by visual input (regardless of response type)

Only 1 in 5 subjects reported awareness of the conflict

Lens distortion dominated over correct haptic information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the ventriloquist illusion?

A

Vision stronger than hearing as it looks like speech is coming from puppets mouth

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What does hearing allow us to do, unlike touch?

A

Localise an object in extrapersonal space

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Jackson (1953) tested whether our estimate of an object’s location is dominated by vision or hearing.

Two types of stimulus combinations:
E1. Ecologically meaningless (bells and lights)
E2. Ecologically meaningful (visual impression of steam rising and sound of kettle whistling)

Asked where sound came from
What were results?

A

Strong visual capture of auditory localisation
Less capture with increasing spatial congruence (more on this later)
More capture with ecologically meaningful combinations

Visual dominance effect - only when there is small difference between auditory and visual stimulus location
E1 = bells and lights - lower response to visual (capture) as this was a less meaningful combination than kettle and whistle

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the McGurk effect?

A

Effect of vision on heard speech - seeing mouth move in different ways can make us perceive the same sound differently

Is interesting because the resulting perception is a novel composite resulting from fusion
e.g hearing /b/ and seeing /g/ results in perceiving /d/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does Posner et al. (1976) suggest sensory dominance is the result of?

A

An adaptive attentional mechanism
=> vision is less capable of exogenous orienting (involuntary attention to new stimulus) over other modalities (much harder to startle someone with visual stimulus than with auditory stimulus)
=> humans preferentially attend to vision as a result - more reliable?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

In what two possible scenarios might hearing dominate over vision?

A
  1. Resolving ambiguous visual events
  2. Resolving temporal differences

Vision is sluggish - photoreceptor chemical changes are slower than vibrations in ear, auditory signals also get to brain and are processed more quickly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Sekuler et al. (1997) - bouncing balls illusion:
Objects either cross over each other or bounce off each other - adding click sound makes it seem like they bounce, don’t pass through (auditory perception dominates)
What happens in sound offset version? (Playing sound until visual coincidence period, then sound stops)

A

If sound plays at or just before the balls meet in the middle, they will judge them to be bouncing off each other
If sound plays just after balls meet in the middle, they judge them to be passing through - no effect

If sound is playing up until balls meet, and then stops while they meet, then continues, there is no effect - shows it is due to multisensory perception and not change in attention

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a criticism of tasks aimed to test visual dominance?

A

Have we just found visual dominance because tasks target vision better than audition?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How did Shams et al (2000) demonstrate the first evidence of auditory dominance over vision by considering temporal properties of a stimulus rather than spatial?

A

When a flash is shown with two beeps, people perceive this as two flashes
However, when one beep or no beeps accompany one flash, people perceive this correctly as one flash

Number of flashes perceived can only be increased by auditory stimulus, not decreased

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the discontinuity hypothesis? (Shimojo and Shams, 2001)

A

“the modality that carries a signal which is more discontinuous (and hence more salient) becomes the influential or modulating modality”

Would explain why two beeps influence perception of the single flash

This appears to have some effect but can’t explain all cases of visual dominance or other forms of multisensory integration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the modality appropriateness hypothesis? (Welch and Warren, 1980)

A

In spatial tasks, vision dominates.
In temporal tasks, hearing dominates.

Other similar theories considered modality precision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Stein studied neurons in the superior colliculus because it is small, easy to map out (receives inputs from retina), and is known to be involved in spatial orienting.
Studied cat’s response to bird sound vs sight of bird
What did they find?

A

Superficial layers were almost exclusively visual neurons
Deep layers had diverse preferences - multisensory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are the three laws of multisensory neuron behaviour?

A

Law 1: Super-additivity
Law 2: Law of inverse effectiveness
Law 3: Spatiotemporal coincidence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the law of super-additivity?

A

Neural response to combined visual + auditory stimulation is greater than sum of unisensory responses

But super-additivity is dependent on some conditions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the law of inverse effectiveness?

A

The degree of the additive response is inversely related to the strength of the unisensory responses

Weak unisensory cues are more effective as multisensory ones
With optimal unisensory cues, multisensory responses can be sub-additive

17
Q

What is the law of spatio-temporal coincidence?

A

The additive response also depends on spatial and temporal congruity factors…

Response enhancement when visual and auditory stimuli places within respective receptive fields

Response depression when auditory stimulus placed outside of receptive field

18
Q

How does spatial coincidence affect mutlisensory perception?

A

Only have multisensory effects when events are perceived to be occuring in same region of space

19
Q

How does temporal coincidence affect multisensory perception?

A

Greatest multisensory response when auditory stimulus is later (for short distances) - as audition is processed faster, however opposite for long distances as light travels faster than sound

20
Q

Stein et al (1989)
Cat orients to V+A stimuli to get food reward
Unisensory stimuli are near threshold - weak so it is difficult to tell whether they are in a different place
They found that performance with both cues together was better than each cue alone - a multisensory effect - why was this?
(They followed up with another experiment)

A

Cat orients to V stimuli (ignoring A) to get food reward when V and A stimuli spatially disparate (incongruent)
Using V results in correct orienting
Using V and A results in less correct orienting

Suggests spatial coincidence effect - only superadditivity (better performance due to multisensory perception) when stimuli are in same spatial receptive fields

21
Q

Risberg & Lubker (1978) – visual and auditory speech perception shows large super additivity in perception.
How did their experiment show this?

A

1 - weak sound of speech
2 - weak visual stimulus (lip reading)
3 - combined = super-additive effect - most correctly perceived keywords

22
Q

Calvert et al (2000)
Participants see and hear a person reading “1984” by George Orwell
Separate periods of V, A, and VA
fMRI BOLD response recorded
What did they find?

A

Calvert et al (2000) found super and sub additive responses to visual/auditory speech perception in superior temporal sulcus
Superadditive when there was audio-visual congruence
Sub additive when they were incongruent

23
Q

Ernst and Banks (2002) transformed the way we think of multisensory integration:

Maximum likelihood integration

Their findings imply that humans integrate visual and other cues in a statistically optimal way

-> Each cue is weighted based on its reliability (or precision)
-> Does not require a high-level “appropriateness” explanation

What was the task they used?

A

Use a 2 interval forced choice task:

See each stimulus one at a time - could ask which one was bigger
Often take one of the two to be the standard stimulus - always same size throughout experiment
Comparison stimulus can be a range of different sizes

When asking someone which of the two is bigger, you are asking whether comparison or standard is bigger:

If someone is worse at the task, they should still 30% of the time think comparison is bigger when comparison size is 2 and standard size is 6 - cumulative distribution function would flatten out

If someone is better at the task, they should 0% of the time think comparison is bigger when comparison size is 2 and standard size is 6 - cumulative distribution function stretches out (taller)

If someone is biased - their standard size perception is shifted, so whole curve should shift

24
Q

Our sensory estimate of some physical property varies due to internal noise. What is this noise assumed to be?

A

Normally distributed

25
If the standard and comparison are the same size, what is the probability that you will say the comparison is bigger than the standard? (Forced choice - have to say one is bigger)
50%
26
If the standard is bigger than the comparison, what is the probability that you will say the comparison is bigger than the standard? (Forced choice - have to say one is bigger)
Low
27
If the comparison is bigger than the standard, what is the probability that you will say the comparison is bigger than the standard? (Forced choice - have to say one is bigger)
High
28
What makes a normal distribution curve slope flatten out?
The slope of the curve decreases with more noise (i.e. a “fatter” distribution, or greater standard deviation)
29
Ernst and Banks tested whether humans combine cues by weighting them based on the precision of their relative sensory estimates. What does this mean and how did they test this?
In other words, based on the sigma (standard deviation) of the underlying noise distribution i.e. a less steep distribution curve suggests less precise sensory estimate, so this would be less likely to be combined in multisensory perception They test this using the maximum likelihood estimation rule (a statistical concept) “the optimal means of estimation (in the sense of producing the lowest-variance estimate) is to add the sensor estimates weighted by their normalized reciprocal variances”
30
How does psychometric function change based on whether visual or haptic cues possess more precise information?
Psychometric function based on two normal distributions = correct because haptic and visual cues suggest the same thing Psychometric function with one steep and one flat distribution = visual has more precise info than haptic - shifts psychometric function to the right and is steeper
31
2IFC task – “which bar was higher?” The bar was virtually simulated in one of two modalities Visual (using binocular disparities) Haptics (using force feedback device) When there is discrepancy, we expect vision to “dominate”… … but what if vision is degraded to be much less reliable?
With increasing visual noise, judgment of object height is biased towards haptic height (haptic capture) This observed weighting of the two cues closely follows those predicted based on MLE weights In accordance with MLE, sensitivity is greater (threshold is lower) with both cues combined than either alone The brain does not simply select the “most appropriate” modality It combines sensory information from different modalities into a single estimate in order to reduce variability (increase precision)