Perception 1: Low level vision Flashcards
(40 cards)
What is the receptive field of a neuron?
Part in visual space where stimulus give rise to stimulating a neuron - receptive field of a neuron
Excite/inhibit ganglion cell
How are ocular dominance columns (right or left eye dominant) arranged in relation to orientation columns? (columnar arrangement in V1)
Perpendicular
What is a hypercolumn? (in V1)
A cortical processing module for a stimulus that falls within a particular retinal area
Formed from one set of ocular dominance columns and one set of orientation columns
What is feature detection theory of visual processing?
As you go through ventral stream, cells respond to larger and more complex stimuli
Hierarchy of cells
e.g.
V1 - edges and lines
V2 - shapes
V4 - objects
IT - faces
(receptive field size increases)
What two variables can affect spikes per second? (response rate in a particular cell)
Orientation and contrast
What is the Fourier analysis framework?
Visual system deconstructs an image into discrete channels
Each channel conveys information contained in the image at a specific spatial scale and orientation
High spatial frequencies (low scale) for visual detail
Low spatial frequencies (high scale) for broad structure
Visual system then recombines them to form a coherent representation of the scene
What is coarse vs fine luminance?
Coarse luminance changes in an image - reveal large-scale structure
Fine luminance changes in an image - reveal small-scale detail
What happens if you remove high vs low spatial frequency content from an image?
High removed = coarse (blurry)
Low removed = fine (detailed but no sense of bigger picture)
Info present at different scales within the same image
What does the mathematical theorem behind Fourier analysis state?
Any complex signal can be constructed from a set of simpler sinusoidal functions
So, vision can be broken down into simpler parts (frequency and contrast) to explain its complexity
What does contrast sensitivity vary as a function of?
Spatial frequency
This is the contrast sensitivity function (CSF)
What is the contrast sensitivity function?
How well you can see detail across a full range of spatial frequencies
When is sensitivity (CSF) best and why?
Sensitivity is maximum between 2-5 cpd
Sensitivity is better at central range of spatial frequencies as you need less contrast in the image to resolve a pattern
At extremes of spatial frequency - more contrast is needed to resolve a pattern
More sensitive = you don’t need as much contrast to perceive something
What did Blakemore and Campbell want to show with their spatial frequency adaptation experiment?
Show that someone’s sensitivity can change within a narrow range of spatial frequency as a result of adaptation
What is contrast threshold?
The contrast of an image, below which the pattern looks homogenous (cannot see any detail)
What was Blakemore and Campbell’s (1969) spatial frequency adaptation experiment?
Participants set the contrast of an image (grating - stripes) so that they could just perceive luminance differences
Then, they introduced a high contrast adapting stimulus (clear stripes) for 60 secs
- This was at a specific spatial frequency
- Neurons responding to this dull and habituate, becoming less sensitive
Then, participants redo the first task (for contrasts at the same spatial frequency)
What happens to contrast threshold after adaptation? What does this show about neurons involved? (Blakemore and Campbell)
It increases, contrast has to be higher for people to perceive differences
Harder to see stimuli at a lower threshold
Higher threshold = easier to see stimuli
Shows that neurons for that spatial frequency habituated to adaptive stimulus and became less sensitive
After adaptation, how do threshold and sensitivity change in response to spatial frequencies similar to the adapting frequency?
What does this selective adaptation effect imply the existence of?
Threshold is increased
Sensitivity is decreased
This selective adaptation effect implies the existence of multiple, overlapping, spatial frequency channels
What electrophysiological evidence supports conclusions from the spatial frequency adaptation effect - that there are multiple, overlapping, spatial frequency channels?
Contrast sensitivity functions of V1 cells in macaque monkey
Acquired by drifting sinusoidal gratings over receptive fields and measuring response
All acquired from the same location on retina
Cells respond preferential to specific spatial frequency bands - different but overlapping spatial frequency selectivities, they do Fourier analysis of the visual image
What are David Marr’s 3 levels of analysis that must be applied in order to understand how a system works?
1) Computational level
What is the goal of the system?
i.e. what is purpose of vision - could be to recognise an object
2) Algorithmic level
What rules and representations can achieve this goal?
3) Implementational level
How is it achieved physically?
Vision serves multiple goals, including determining where objects are, what shape they have, and how to interact with them.
Each of these goals relies on numerous algorithmic steps.
Edge detection is one of the algorithmic steps that helps us to determine what objects are and what shape they have.
Where do edges exist?
They only exist in interaction between observer and image
We focus on edges even when they are not inherently contained in an image
What are the four steps in Marr’s model of object perception?
Gray-level - photoreceptors
Primal sketch - identifies object boundaries (edges)
2.5D sketch - depth perception
3D model
What does Marr and Hildreth’s model of edge detection assume?
Assumes that edges of an object coincide with gradients in luminance
Only works if coincided luminance change where there is an edge
e.g. luminance gradient would seperate lake from tree
What is involved in Marr and Hildreth’s (1980) model of edge detection - what are the steps? (first and second derivative)
Luminance gradient between two things
e.g. dark tree on light lake background
Take the first derivative of this - shows us sharp luminance changes that correspond to peaks/valleys where there is an intensity gradient
First derivative = rate of change in signal (across edge)
The second derivative is then taken, to stop signal being susceptible to noise - otherwise could be difficult to decide where peaks and valleys are
The second derivative gives us zero crossings (luminance changes across an edge - goes from negative to positive) where there is a luminance gradient
Edge not represented by peak or valley anymore
What does Marr and Hildreth’s model allow computers to do?
Created algorithm so that computers can change image so only edges are highlighted