vision- theories of visual processing Flashcards

Question

state a problem with seeing in 3D

Answer 1

translating a 3d environment onto a 2d surface you lose information An infinite number of shapes could give rise to the same retinal image Distance itself is invisible

Answer 2

accommodation vergence eye movements binocular disparity

Answer 3

How far do you have to distort the lens of the eye to focus on the object Ciliary muscles contract-> lens changes shape Codes distances not depth Only forks up to 2m

Answer 4

compare disparity in angle of eyes when focused on an object to discern depth Only useful to about 6m or less Berkeley discounted as a viable source of information

Answer 5

compare difference between two images on each retina to gage depth codes depth not distance 3D red green glasses ar an example of how this disparity can be tricked to generate perception of 3D Must be scaled by viewing distance to be interpreted correctly

Answer 6

Pictorial cues Interposition(occlusion)- nearer things preclude things further away Shadow- shape from shading we expect light to come from above hence flipping this flips our perception. Cast shadows also provide a cue for height, Aerial perspective- texture gradients height relative to horison

Answer 7

parallel lines convergence

Answer 8

Insufficient information in image Therefore we must make inferences about our environment to perceive it We use rules of thumb- heuristics to attain information from the image

Answer 9

No ambiguity There is more than enough information for direct pick up One of the richest sources of information is movement and the changing image to the observer Referred to as optic flow - J.J.Gibson

Answer 10

With movement close objects exhibit greater relative image change than distanced objects. Gradient of motion vectors that reside into the distance Expansion or contraction- works the same as motion parallax

Answer 11

Rotation of an object enables decoding of shape. because of occlusions shadow ect

Answer 12

Change in a potentially 3d object enables us to infer shape

Answer 13

Extero-specific information (about 3D structure of the environment, objects and layout) Proprio-specific information (about our own movements within the world) this can be demonstrated by The swinging room(lee) the room created distorted version of optic flow- observers would then compensate by moving with the room.

Answer 14

Starting point in vision is optic array, spatial pattern of light movements - transforming optic array Identify invariants in field of view Perception is direct Affordances is the end product of the visual system how does the thing relate to the person what can be don with it

Answer 15

Optic ataxia(parietal cortex damage)- able to understand orientation of objects but not adjust movement to orientation. trouble interacting with objects

Answer 16

Visual agnosia(occipital cortex damage)- unable to describe orientation of object(cannot recognize object) but able to interact correctly with it

Answer 17

These two conditions seem to imply separate processing streams for perception and action

Answer 18

Psychological: Psychophysics techniques for systematically probing the characteristics of our senses e.g. measuring thresholds webber Phenomenology e.g. illusions and aftereffects (Lectures 4–6) - Mach, Hering Physiological: Single cell recording e.g. Hubel & Wiesel (Lecture 2) Computational Modeling of underlying process

Answer 19

Step Computational goal (“Make explicit …”) Grey-level representation … light intensities Raw primal sketch … intensity changes Full primal sketch … contours, boundaries 2½ D sketch … surfaces and their orientations 3D representation … 3 structure in object-centered coordinates

Answer 20

The visual image can represented by a set of parameters • image = f(x, y, I, λ, t) everything we see can be represented explicitly as some mathematical combination of x, y, I, λ, t ( and d, in binocular vision) e.g. an “edge” is computed as dI/dx

Answer 21

as a zero crossing sensor effectively taking the second derivative of the boundary mathematically this is known as a Laplacian filter, 2, or Mexican Hat filter, or DOG (difference of gaussians) filter

Answer 22

* pragnanz - the law of simplicity * Similarity - group similar objects together * Good continuation - join the dots by the smoothest path * Proximity - group near things together * Common region - group objects in the same region of space together * Uniform connectedness * Synchrony - simultaneous events perceived as belonging together * Common fate - things moving in the same direction belong together * Meaningfulness/familiarity - things that form recognizable patterns are grouped together

Answer 23

break things down into general shapes we perceive as consistent however responses of 'face' neurons exhibit nothing like this in macaque monkeys

Answer 24

Marr’s model is bottom-up i.e. all the information needed is present in the grey-level image bottom-up strategy doesn’t work computer scientists have found it impossible to implement without using feedback from later stages in the model to guide processes in earlier stages every level in the visual hierarchy in the brain projects back to the area form which it receives information (and sometimes lower ones too) Marr’s model is far too simple