Speech segmentation Flashcards
(27 cards)
Jusczyk and Aslin (1995) 7.5 months repetitions of sentences containing 2 target words
Tested on individual words. Infants listened longer to words embedded in familiarisation sentences. By 7.5 months, at least some rudimentary ability to detect words when they occur in fluent speech contexts.
6 possible ways to help segment speech stream
- Prosodic cues.
- Phonotactic regularities.
- Allophonic variation.
- Isolated words.
- Transitional probabilities.
- Multiple cues approach.
What are prosodic cues?
Rhythm, stress pattern, and intonation of speech. English has trochaic patten (strong-weak stress) so segmentation through high probability that strong syllable belongs to onset of new word.
Jusczyk et al. (1999) prosodic cues familiarisation to individual strong-weak words
7.5 month infants listened longer to passages containing words they were exposed to (suggests segmentation). Effect not seen for weak-strong words (but is at 10.5 months).
2 critiques of prosodic cues
- Not a perfect cue (is segmentation seem before 7.5 months?).
- Not relevant for all languages.
Phonotactic regularities
There are patterns as to which sounds can co-occur within a language (eg. [br] at beginning of word, [nt] at end, [vzg] not legal in Englsih). Sensitivity could provide reasonable cue to word boundaries.
When do we see evidence of sensitivity to phonotactic regularities?
About 9 months.
Allophonic variation
A phoneme might be pronounced differently at the beginning/ends of words (eg. aspirated vs. non-aspirated).
Jusczyk et al. (1994) allophonic variation ‘nitrates’ vs. ‘night rates’
Infants heard examples of ‘nitrates’ vs. ‘night rates’ embedded in continuous speech. Infants listened longer to sentence containing the word they were familiarised to (seem at 10.5 months but not 9 months).
Big problem in speech segmentation
Need words to segment and need to segment to get words. Infants need to learn what’s relevant for their language before they can use it for segmentation.
Isolated words
Might help with early segmentation. Around 9% of words in IDS occur in isolation and tend to be words learnt first. But this method also identifies phrases as words.
Statistical learning
Discovery of patterns or structure in the input. Learning is implicit (without instruction or rules). Infants highly sensitive to distributional info in input.
Transitional probabilities
Conditional probability of Y given X in sequence XY.
Saffran et al. (1996) familiarised infants to continuous speech of trisyllabic words strung together in random order
8-month infants. Test word vs. non-word vs. part-word (syllable sometimes follows). Looked longer to near novel stimulus (non-word or part-word). Suggests babies are sensitive to transitional probabilities.
Factors influencing direction of preference for novelty vs. familiarisation
- Length of familiarisation.
- Meaningfulness of stimuli.
Need pre-registration.
Gambel and Yang (2004) computer model of English utterances and use TP to identify word boundaries
Only 41% were real English words. 77% of words were missed. TP not a sufficient cue to identify words from continuous speech on its own.
Multiple cues
Use all possible cues (prosody, phonotactic, allophonic, isolated words, TPs). Once some words identified, they can pick up common patterns that mark beginning and ends of words.
Thiessen and Saffran (2003) artificial speech stream with colliding cues
7 months-olds: more attention to TPs to segment speech.
2. 9 month-olds: prioritised stress cues over TPs.
Developmental change.
Another form of conditional probability
Meaning predicting form. Infant regularly hears ‘mummy’ in speech stream and mother is generally present at the same time. Link the 2 together (correlations).
Saffran (2020) statistical learning process in language acquisition
Ability to track probabilistic relationships between different aspects of language. Input-driven approach.
2 challenges for statistical learning
- Do infants actually use it?
- Rules? Could abstractions about patterns be generalised to novel elements. Language is structured, not just statistical.
Graf Estes et al. (2007)
17 month infants. Exposed to continuous speech stream with TP as only clue to word boundaries. Then, habituated to object-label association (objects paired with words and nonwords).
Test: switch or same object-label pair. Switch condition led to longer looking time for words but not nonwords or partwords. Suggests that words treated as linguistic units that can be mapped to meanings.
Marcus et al. (1997) what else is needed in addition to statistical learning?
Need an abstract rule learning mechanism. Necessary to generalise to new exemplars, which is critical to language learning.
Marcus et al. (1999) familiarised to different patterns ABA or ABB and then tested on preferential listening for same or novel structure
Infants show a looking-time preference for novel sentences. Claimed that this is evidence for a rule-learning system that operates alongside statistical learning.