Perception 3: Low level hearing Flashcards
(37 cards)
What is the difference between our perception of vision vs sound?
Map of vision, but no map of sound - different sounds converge into one
What are sound waves?
Variations in air pressure
(sound causes vibration in air particles, change in air pressure is picked up by ear)
This is where auditory perception begins
The physical properties of a sound wave determine the perceptual properties
What perceptual properties do amplitude and frequency determine?
Amplitude -> loudness
Frequency -> pitch
However, relationship between frequency and pitch is more complicated than that for amplitude and loudness
More frequent oscillations = higher pitch
What does Fourier’s theorem state?
Any complex sound waveform can be created using a finite number of sinusoids
The lowest frequency present is called the fundamental frequency, or f0 (can also be f sub1)
What are harmonics?
Integer multiples of f0
Sounds vibrate at fundamental frequency but also at harmonic frequencies
Harmonics are found in natural sounds
What are the three stages of how the human brain interprets sound waves?
- Delivering the sound stimulus to the receptor
- Converting the physical stimulus into an electrical signals
- Inferring perceptual qualities (e.g. loudness, pitch) from electrical signals
What do the outer and middle ear structures do?
Translate air vibrations into liquid vibrations of the inner ear
Sound hits ear drum - ossicles vibrate (malleus, incus, stapes)
Vibrate onto the oval window and cochlea
What does the cochlear partition contain? What does this do?
The basilar membrane - this reacts to liquid vibrations in the scala vestibuli and scala tympani
Vibrates up and down in response to liquid vibration
What does the movement of the basilar membrane cause?
Causes cilia (hair cells) to move laterally
What does the corti do?
Translate vibration info into a neural impulse
What direction does the tectorial membrane vibrate? What does this result in in combination with the basilar membrane vibrations?
Left to right
Since basilar membrane vibrates up and down, this causes friction on tips of hair cells - causing lateral movement
Cilia tips move left to right
What happens when hair cells bend in one direction?
Links are stretched, these are connected to ion flow gates, so when they are pulled, these gates open - positively charged potassium ions flow inside of the hair cell - inside of hair cell has positive charge relative to outside which causes depolarisation - releases neurotransmitter
When they bend in other direction - hyperpolarisation
The vibrational stimulus causes oscillations of excitation and inhibition in the hair cells
The basilar membrane responds like a travelling wave. What does this mean about the physical properties of it?
Physical properties of the membrane mean that different places along it respond preferentially to different frequencies
Base is stiff and narrow => high frequencies
Apex is flexible and wide => low frequencies
Sound travels from base to apex like a travelling wave
Can map hz onto basilar membrane
What happens to frequency tuning curves as frequency (hz) increases (higher pitch)? What consequences does this have for perception?
The tuning curves get increasingly wide
So wide that higher frequencies cannot be distinguished from one another as easily as low frequencies
Higher frequencies are more perceptually similar to one another
Is pitch derived from place encoding? (area of basilar membrane that is vibrating the most)
Not quite.
Pitch of a complex sound corresponds to its f0
BUT Removing f0 does not change its perceived pitch
Sound sounds the same without f0 - so perception of pitch cannot just be explained by part of basilar membrane vibrating
Therefore, brain must engage in some kind of pattern recognition - we hear the difference between different frequencies
Is pitch derived from time encoding - what is the temporal theory of pitch?
In addition to frequency specificity, auditory nerve fibers are phase-locked with respect to the stimulus
- action potential in nerve fiber only responds to increase in sound pressure, not decrease - each nerve fiber responds to peaks
This is because action potentials only occur when an increase in sound pressure occurs (i.e. a peak in the waveform)
- Time intervals between action potentials are integer multiples of the period (p) of the waveform
- The population of responses across many fibers convey the sound’s frequency
Why is phase locking in the temporal theory of pitch a favourable explanation?
Phase locking is only reliable up to 5 kHz, AND human pitch perception breaks down above 5 kHz
Ability of action potentials to rise and fall at 5k hz is lost - neurons cannot fire at that frequency - shows this is down to time
Summarise place and time theory of pitch perception:
- Place theory
There is a frequency to place conversion in the cochlea;
The perceived pitch is simply related to the place of maximum response on the basilar membrane - Time theory
The time pattern of neural impulses reflects the frequency of the stimulus (phase locking)
The perceived pitch is related to the time intervals between nerve spikes.
In reality both place- and time-based coding contribute to pitch perception.
In hearing what are sounds from all locations combined into?
A single waveform that the ear detects - superimposed together (no spatial mapping in the cochlear)
On what three dimensions can sound be localised?
1) Azimuth - left/right
2) Elevation - up/down
3) Distance - how far away
Each requires different types of acoustic cue (some monoaural, some binaural)
What is the interaural level difference for detecting azimuth (left or right localisation) of sounds?
- Sounds reaching the contralateral ear (opposite ear to where the sound came from) are attenuated - head has shadowing effect
- Better for higher frequencies
What is the interaural timing difference for detecting azimuth (left or right localisation) of sounds?
- Sounds detected in the contralateral ear arrive later than at the ipsilateral ear due to further travel distance
- Better for lower frequencies
Since the interaural level difference and the interaural timing difference are binaural cues, what do they require?
Central processing in the CNS
Wightman and Kistler (1992)
Artificially manipulated interaural timing differences (ITD) of broadband sounds.
Other cues (e.g. ILD) held constant)
i.e. they simply digitally delayed the arrival of one sound at one ear
Ppts had to locate where sound was coming from
What did results show?
ITD manipulated to be 0 = listeners think sound is at 0 degrees
Then far left and far right clusters for -45 and +45
Subjects become more accurate when low frequencies are removed
This suggests the following dual mechanism:
1) ITDs are dominant when low frequencies are present
2) ILDs are dominant when high frequencies are present