Week 13 Flashcards

(25 cards)

1
Q

What are features of OpenFace

A

▶ Comprehensive Features:
▶ Facial Landmark Detection
▶ Head Pose Estimation
▶ Facial Action Unit Recognition
▶ Eye-Gaze Estimation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is OpenFace

A

▶ A toolkit developed initially at Carnegie Mellon University
▶ Provides real-time analysis of facial behavior
▶ Non-invasive; real-time data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is head pose estimation

A

▶ Measures tilt, rotation, nodding to interpret conversational cues
▶ Application in Communication Studies:
▶ Enhances understanding of non-verbal cues in dialogues

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is facial action unit recognition

A

▶ Identifies and quantifies muscle movements for emotional analysis
▶ Provides insights into emotional underpinnings of speech

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is eye-gaze estimation

A

▶ Provides data on where and what people focus on during interactions
▶ Crucial for understanding how infants associate visual cues with verbal information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a context-sensitive model for speech coordination

A

▶ Sensitive to upcoming context.
▶ Movements are pre-planned to anticipate future articulations.
▶ Key question: How do my articulators usually coordinate the upcoming sequence?
▶ Use of motor plans: stored motion patterns.
▶ Built from:
▶ Motor primitives: smallest planning units (e.g., a single articulator movement).
▶ Motor programs: combinations of motor primitives.
▶ Motor plans: larger structures (phrases, sentences)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are context-invariant models in speech coordination

A

▶ No “look-aheads”
▶ Articulators average together adjacent movements automatically.
▶ Key question: What’s the fastest way to get articulators from here to there?
▶ Example: Task Dynamics approach

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the motor planning and task dynamics unifying theory for the coordination problem?

A

Formal theories propose a combination of the two
▶ They act in two different levels:
▶ Motor program level :
▶ Stores abstract, reusable motor patterns.
▶ Guides the selection and sequencing of articulatory targets.
▶ Responsible for higher-level planning (e.g., syllables, words, phrases).
▶ Motor system level :
▶ Handles real-time execution of motor plans.
▶ Implements the physical dynamics needed to reach specified articulatory targets.
▶ Resolves biomechanical constraints and fine-tunes motion paths.
▶ Together, these levels allow for both pre-planned structure and real-time
adaptability in speech

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are complex sounds

A

▶ Sounds that involve simultaneous use of multiple articulators
▶ Examples: liquids, clicks, nasalized and voiced sounds.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the Task Dynamics approach

A

Task Dynamics
▶ Define a task as movement toward a physical target.
▶ Dynamics: calculate forces needed to get from current to target state.
▶ No need for stored plan; things are computed on the fly.
▶ Pseudoinverse solutions determine efficient paths

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the inverse problem in speech

A

The Inverse Problem in Speech
▶ Definition: Inferring the cause from an observed effect.
▶ In speech research: Given an acoustic signal, determine the articulatory movements
that produced it.
▶ Challenge: Multiple articulatory configurations can yield similar acoustic outputs.
▶ Example:
▶ The vowel [i] can be produced with slightly different tongue and lip positions, but
still sound the same

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the pseudoinverse solution in speech modelling

A

Definition: A mathematical way to find the best possible solution to an inverse
problem when no exact solution exists.
▶ In speech research: Used in task dynamics to compute optimal articulator
trajectories to reach a goal.
▶ Why pseudoinverse?
▶ The vocal tract is redundant — multiple ways to achieve the same sound.
▶ The pseudoinverse finds the most efficient combination of articulatory commands

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the timing problem

A

▶ Many speech sounds involve multiple articulatory movements.
▶ Question: How is timing between movements controlled?
▶ One idea: a central “clock” regulates movement timing.
▶ Known as extrinsic timing.
▶ Problem: No strong evidence for such timing device

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How does task dynamics offer a solution to the timing problem

A

▶ Task Dynamics offers an alternative: intrinsic timing.
▶ Relative movement cycles rather than absolute time.
▶ Phase alignment = timing relationship between gestures.
▶ Languages tend to show recurring global phase patterns

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

what is anatomical coupling

A

▶ The human vocal tract is a connected system of articulators.
▶ Some articulators move together due to physical or muscular linkages.
▶ Anatomical coupling refers to the degree to which two articulators influence each
other’s movement.
▶ Tightly coupled articulators: movements in one often affect the other.
▶ Independent articulators: can move without significantly affecting each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Give examples of tightly coupled articulators

A

▶ Tongue tip and tongue body — shared musculature leads to interdependence.
▶ Lips and jaw — jaw lowering often affects lip position.
▶ Example from the book: A three-legged race is difficult b/c of reduced
independence

17
Q

Give examples of relatively independent articulators

A

▶ Velum — moves independently from most oral articulators.
▶ Larynx — voicing and glottal gestures can co-occur with almost any other
articulation

18
Q

Why are anatomically independent articulators important

A

▶ Ease of Coordination: Independent articulators are easier to coordinate → more
combinations are possible.
▶ Typological Trends: Sounds that combine independent articulators (e.g., nasal +
voiced) are more common across languages.
▶ Learning Difficulty: Sounds requiring tightly coupled articulators (e.g., multiple
lingual gestures) are harder to learn, especially for second-language learners

19
Q

What are lingual-lingual sounds

A

▶ Tongue = jointless hydrostat; complex muscle interactions.
▶ Lingual-lingual sounds require two distinct tongue constrictions.
▶ Due to tight coupling, these sounds are:
▶ Hard to learn.
▶ Relatively rare cross-linguistically.
▶ Focus: Liquids and Clicks

20
Q

Do you think dark /~l/ is harder to learn than light (clear) /l/?
A. Yes
B. No

21
Q

Why is dark l rare in languages

A

Involves more coordination of anterior and posterior tongue gestures, which are
anatomically coupled

22
Q

What is articulatory overlap

A

Overlap = simultaneous movement of different articulators
Common when articulators aren’t anatomically coupled

23
Q

What is articulatory conflict

A

when adjacent speech sounds require anatomically coupled articulators to move in
opposite directions, we get a conflict

24
Q

What are ways to resolve articulatory conflict

A

▶ Deletion: we can delete one of them
▶ Transition: we move the articulator from one position to the other
▶ Compromise: we pick a middle ground between them

25
How do you say “bestbuy" in casual/fast speech? If not, which one applies here? A. Deletion B. Transition C. Compromise
A