EXAM: Object recognition Flashcards

(12 cards)

1
Q

What are structural description models of object recognition

A

Structural description theories of object recognition propose that we recognize objects by analyzing their structure, breaking them down into simpler parts and then understanding how those parts relate to each other.These theories focus on creating an internal representation of an object’s shape and spatial arrangement, independent of the viewer’s perspective.Essentially, they suggest we build a mental model of an object’s structure and then match that model to a stored representation in memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

INTRO

A
  • Define object recognition and introduce structural description models.
  • State that both Marr (1982) and Biederman (1987) propose that recognition involves analyzing an object’s structure and parts.

Thesis: While both theories share a similar viewpoint-invariant, bottom-up framework, they differ in complexity, volumetric parts used, and empirical support—and both face limitations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

PARAGRAPH 1 - MARR’S COMPUTATIONAL MODEL (1982)

A

Marr proposed a multi-stage, bottom-up process:
1. Primal sketch – detects basic edges and contrast.
2. 2.5D sketch – depth and surface layout, viewpoint-dependent.
3. 3D model representation – full, abstract, viewpoint-invariant model using generalised cylinders.

Influential for integrating neuroscience, computer vision, and psychology.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

PARAGRAPH 2: BIDERMAN’S RECOGNITION BY COMPONENTS MODEL (1987)

A

Proposes that object recognition occurs through detecting and assembling 36 geons—simple 3D shapes like cylinders, cones.

Emphasizes non-accidental properties (e.g., curvature, edges) and viewpoint-invariant recognition.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

PARAGRAPH 3: SIMILARITIES BETWEEN THEORIES

A
  • Both are structural description models with a bottom-up, hierarchical process.
  • Both argue recognition is viewpoint-invariant at the final stage.
  • Use volumetric primitives: geons (Biederman) vs. generalised cylinders (Marr).

*Propose that objects are recognized by matching perceptual input to stored abstract representations in memory.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Key point: how do people use the volumetric parts to recognise objects

A

By matching perceptual input to stored abstract representations in memory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

PARAGRAPH 4: DIFFERENCES BETWEEN THEORIES

A
  • Scope: Marr covers early to late visual processing; Biederman focuses specifically on recognition.
  • Primitives: Marr uses a generalised cylinder; Biederman uses a limited set of geons.
  • Complexity: Marr’s model is computationally intensive, while Biederman’s is simpler and more empirically testable.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

CRITICISMS

A
  • Both theories:

o Overemphasize bottom-up processing, ignoring top-down influences (e.g., expectations, context).

o Do not fully explain recognition for all object classes (e.g., distinguishing dog breeds or faces).

o Focus on static objects, ignoring motion and learning effects.

Evidence from neuroimaging and behavioral studies (e.g., Gauthier & Tarr, 2016; Zimmermann & Eimer, 2013) shows recognition can be both viewpoint-dependent and viewpoint-invariant, depending on task and familiarity.

  • Visual perception is not always accurate—contradicting Marr’s assumption that vision “tells the truth.”
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

CONCLUSION

A
  • Both models provide valuable frameworks for understanding how we recognize objects based on structural parts and spatial relations.
  • Marr offers a comprehensive theoretical foundation, while Biederman presents a practical, testable model.
  • However, a complete theory of object recognition must also include top-down influences, learning, and contextual flexibility not addressed by either model.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Neuroimaging evidence

A

Evidence from neuroimaging and behavioral studies (e.g., Gauthier & Tarr, 2016; Zimmermann & Eimer, 2013) shows recognition can be both viewpoint-dependent and viewpoint-invariant, depending on task and familiarity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Object recognition definition

A

The process of identifying objects by providing labels or names based on visual information.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What pathway is associated with object recognition? Key regions in it?

A

The Ventral Pathway

This runs from the Occipital lobe to the temporal lobe
It is called the “what” pathway

Key brain regions in this pathway include:

Lateral Occipital Complex (LOC): Involved in processing the shape and structure of objects.

Fusiform Gyrus (especially the Fusiform Face Area, FFA)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly