Speech recognition Flashcards
Acoustic signal
Speech sounds or patterns of pressure changes
Articulators
Structures involed in speech production, including lips, teeth, tongue, jaw, soft palate
Sound spectrogram
Graph depicting intensity of speech sound frequencies
Y-axis - frequency
X-axis - sound produced
Intensity indicated by darkness
Formants
Frequencies at which sound intensity peaks occur
Formant transition
Rapid shifts in frequency preceding or following formants
Manner of articulation
How a speech sound is produced by the interaction of articulators
Place of articulation
Locations of articulation during speech sound
Phoneme
Smallest unit of sound that, if changed, would change the meaning of a word
What are the three causes of speech acoustic signal variability?
Coarticulation
Sloppy pronunciation
Individual differences
Coarticulation
Overlap between the articulation of neighbouring phenomes
Categorical perception
When stimuli existing along a continuum are perceived as divided into discrete categories
Vocal onset time
Time delay between when a speech sound begins and when the vocal chords begin vibrating
Phonetic boundary
VOT at which the percept of a sound changes categories (e.g., “da” to “ta” as VOT passes 40 ms)
Multimodal
The involvment of multiple sense in determining speech perception
Categorical perception experiment
Test to discern a sound’s phonetic boundary
Discrimination test
Part of a categorical perception test, in which the VOTs of two stimuli are simultaneously raised until they are on opposite sides of the phonetic boundary
McGurk effect/audiovisual speech perception
The effect of visual perception on the perception of speech sound
Phonemic restoration effect
When an obscured phenome of a word is restored in the perception of the word
Shadowing
Experimental technique in which listeners repeat aloud what they hear through earphones as they hear it
How do listeners decode speech into words and meanings?
Previous knowledge of words and meanings to perceive words in sentences
Speech segmentation via knowing transitional probabilities
Experiential learning (“pop-out” effect) to preceive degraded speech
Speech segmentation
Process of decoding words from continuous acoustic signal (perceiving breaks between continuous words)
Transitional probabilities
The chances that one sound will follow another sound
Transitional learning
The process of learning about transitional probabilities from an early age
Noise-vocoded speech
Experimental technique in which speech signal is divided into frequency bands and then noise is added to each band