Valve at the top of the trachea that hold vocal cords
Larynx
Why is the development of the larynx/speech risky?
In nonhuman animals and human infants the larymx is high in the troat, allowing eating and nasal breathing. Only in older humans does the larynx is low enough that it allows for vast sound production, but also you can’t breathe while eating at the same time (choking)
According to Ladeforged, how many distinct speech sounds can we make?
600 consonants, 28 vowel sounds
What is the International Phonetic Alphabet?
Refers to every speech sound that appears across the word
Row headers in the IPA (consonant)
Manner of articulation (trilling)
Column header in IPA (consonant)
Place of articulation: where in the mouth is a sound made
What do blank white spots in the IPA mean?
Sounds that could theoretically be made but hasn’t been used in any alphabet
What do gray spots in the IPA mean?
Sounds that cannot be made
Row header in vowel IPA
Toungue’s vertical position row
column headers in IPA (vowels)
Tongue’s horizontal position
What are the amount of phonemes in English?
20 vowels, 24 consonants due to different pronunciations of vowels and combined sounds (such as /th/)
Describe the Spectrograph study
Creates a Spectrogram to see the energy within the sound waves in a sentence such as “I can see you”. Observes the frequency of what is said. We make breaks while making discrete words, and darker bands called formants- carry the weight of the sound even without the other segments
Frequency specific band of energy corresponding ot speech sounds- physical properties of speech revealed through a spectogram
Formant
Coarticulation
Whenever we produce a consonant , our articulators are already being shaped to produce the following vowel
Why did it take so long after the creation of formants to create a speech synthesizers?
Formants change location due to coarticulation, as well as shape- changes depending on what comes after it to make two sounds at once rather than just individual phonemes
Is there 1 to 1 mapping between segnents of the speech stream and individual phonemes?
No, because sounds are made based on what comes next
Speech is processed similarly to other auditory inputs
Auditory theory
Speech is considered special and it is processed using specific motor components of speech production- capitalize on producing speech motors used to interpret what other people are speaking
Motor theory
What are the two theories of perceiving speech?
Auditory theory and motor theory
How is motor theory related to face perception?
Face are treated as a special kind of object by the brain and are perceived differently by the brain, similar to sound in motor theory
What are the main claims of motor theory?
Speech perception is tied to speech production
Speech perception is unique to humans because only humans can produce speech (untrue)
What are the three main sources of evidence for motor theory?
Speech perception is bimodal- not just auditory input but also visual inputs (lip reading)
Speech perception is categorical- we hear speech as discrete sounds, we produce discrete phonemes and we hear them as such rather than continuous (found to be untrue)
Speech perception and production are neurologically similar
We spontaneously integrate visual and auditory input into a unified perception
Bimodal perception
Describe the McGurk Effect
Artificial auditory illusion: hear “ba” and see “ga”, perceive “da”- ambiguity in both forms of input because “ba” and “da” are adjacent in speech, and “da” and “ga” are adjacent in visual input, leads to perceiving ‘’da”.