speech perception Flashcards

(22 cards)

1
Q

Phoneme

A

The smallest unit of speech; changing a phoneme can change the meaning of a word

English has 37 phonemes
13 vowels and 24 consonants

The number of Phonemes vary between languages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

The acoustic signal for speech

A

Speech sounds are produced by the air pushed up against the lungs through the vocal cords and into the vocal tract

Some parts are fixed (e.g nasal cavity, hard palate)

Other parts can move, such as the vocal cords and the articulators (the tongue, lips, jaw, and soft palate)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The acoustic signal for vowels

formants

A

Vowel sounds are produced by vibration of the vocal cords, changing the shape of the vocal tract.

Formants: the frequencies corresponding to the peaks in a vowel sounds pressure wave

The first formant is the lowest frequency the second is the next highest and so on…

simple explaination- Formants are the special peak frequencies that make each vowel sound unique—like a vocal ID card When you say “ah” vs. “ee,” your mouth shapes these peaks differently.

1st Formant (F1): Lowest frequency peak → Tied to mouth openness.

High F1 = “ah” (open mouth, like “father”).

Low F1 = “oo” (small mouth, like “food”).

2nd Formant (F2): Next peak → Tied to tongue position.

High F2 = “ee” (tongue forward, like “see”).

Low F2 = “aw” (tongue back, like “dog”).

Higher Formants (F3, F4…)

Fine-tune sounds further (e.g., distinguishing “er” vs. “uh”).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

resonant frequency in vowels

A

Changes in the shape of the vocal tract produce different resonant frequencies; each vowel has a different resonant frequency signature

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Sound spectrograms

A

show the changes in frequency and intensity over time for speech

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

The acoustic signal for consonants

A

Consonants are produced by constrictions of the vocal tract

making consonant sounds requires substantial stereotypical movements of the articulators, with each consonant having a signature movement

The vocal cords are largely silent for consonant sounds, they are used primarily for vowels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Formant transitions

A

rapid changes in frequency preceding or following consonant sounds

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

The segmentation problem

A

There are no physical breaks in the continuous acoustic speech signal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The variablity problem

A

there is no simple correspondence between the acoustic signal and the individual phonemes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

variability from a phoneme’s context

A

The acoustic signal is associated with phoneme changes depending on its context

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

coarticulation

A

The blending of neighboring sounds during speech

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Variability from different speakers

A

Different people speak differently
fast/slow
high-pitched/low-pitched
accent/ no accent
clear/ sloppy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Categorical speech perception

A

The mapping of a range of acoustic signals onto the perception of a limited number of sound categories

Your brain lumps similar speech sounds into distinct categories

Listeners do not hear the incremental changes between sounds, instead they hear sudden changes at phonetic boundaries

we experience perceptual constancy for the phonemes within a given voice onset time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

voice onset time

A

The delay between when a sound begins and when the vocal cords start to vibrate (aka Voicing)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Elimas and Corbit (voice onset time)

A

Used a computer to create a range of “da” and ‘ta” sounds with VOTs varying from short to long

at VOTs <35msec subjects perceive “da” at >40msec subjects precieve “ta”

Phonetic boundary- the VOT when perception transitions from “da” to “ta”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Speech perception is multimodal

A

Perception of speech is influenced by information from more than one sense

16
Q

The McGurk effect

A

Auditory stimulus has a speaker saying “ba-ba”

Visual stimulus has a speaker saying “ga-ga”

observer watching and listening hears “da-da” which is the midpoint between “ga” and “ba”

observer with eyes closed will hear “ba”

17
Q

The effect of meaning on speech perception

A

Top-down processing- including knowledge a listener has about a language, affects the perception of the incoming stimulus

Phoneme interpretation is affected by context and meaning

18
Q

Turvey and Van Gelder- speech perception

A

presented short words (sin, bat, and leg) and short non words (jum, baf, and teg) to listeners

The task was to press a button as quickly as possible when they heard the target phoneme

on average, listeners were faster with words than non words

actual words were recognized, enabling top down factors to speed phoneme detection

19
Q

Warren speech perception

A

presented listeners with a sentence that had a phoneme covered by a cough

The task was to state where in the sentence the cough occurred

Listeners could not correctly identify the position and even didn’t notice that a phoneme was missing

20
Q

Phoneme restoration effect

A

A phoneme that is missing from the auditory signal gets restored by top-down knowledge

21
Q

Experience-dependent plasticity Kuhl

A

Before age 1, human infants can tell the difference between all the sounds that create all languages

But the brain becomes tuned to respond best to speech sounds that are in the environment, and we lose the ability to differentiate sounds that we don’t hear during development

As adults, we are impaired in our ability to perceive and recognize phonemes not used in our native language

An example of categorical perception- as we become experts in recognizing our native language, we lose our ability to recognize others