What is the problem with object recognition?
There is an infinite number of retinal images that could correspond to a particular object
So how are our brain able to look at all of these different images, and interpret them as the same thing
What is the solution to the problem of object recogintion?
= Our brain uses the visual input and matches this with internal object representations held in object stores ( eg held in memory)
eg if asked to imagine an elephant, you can do that because you have seen an image of an elephant before, you can activate a visual representation of an elephant in memory
(However you cannot do this if you have aphantasia)
So object representation really mean - mental image of what we think an object looks like
So if the visual input matches the representation, then we have object representation
If the visual input does not match the representation, then we do not have object recognition
How does the matching process work?
We have possible different theories for this
What is the main idea of MArrs structural analysis, give analogises
Two objects may differ in their 2-dimensional appearance, but they become equalaivulent when they have a 3 dimensional representation
ie visual inputs must be analysed at different levels
-WHen presented with this 2 dimensional elephant along the bottom, difficult to match with the two dimensional elephant on the top
Its not easy to compare the two images and decide that they are the same
While, if you have a 3-dimensional model of an elephant in front of your inner eye, which you can mentally rotate in all directions, it is not as difficult to decide that this is all of the same elephant
What are the 4 sequential levels of structural analysis?
What is the raw primal stage of structural analysis?
What is the full primal stage of structural analysis?
-This is a little more elborate - puts together basic geometrical organisations
eg edges become clear, contours become clearer
Basic shapes may appear
What is the 2 1/2 D stage of structural analysis?
So we just perceive whether some objects are closer or further away from us
That is why it is called the 1/2 dimensional sketch, because it includes some depth information
It is viewer centred representation
It doesn’t necessarily help us to recognise an object, but it can ( only if the representation is prototypical)
What is the 3D stage of structural analysis?
-In order to fully recognise an objec, we need a 3 dimensional model ( not a sketch)
-Includes depth in that we can rotate the object mentally
So this is an object centred representation , not dependant on our viewpoint
This can then be compared and matched with the representation that we hold in memory
What is so good about Marrs hypothesis?
-They have inspired a cognitive model
-This is important because cognitive models make predictions about human behaviour
-This is good because we can then test test predictions, and thus verify or falsify the model
-The more evidence that we find the confirms the models predictions, the more reliable and valid the model is
The cognitive model that this inspired is the Ellsion and Youngs model of object recognition
What is Ellion and Youngs overall model look lile - Draw this out
Explain how cognitive models work
What is the initial represenation stage of the model?
We then generate an initial representation of this object (this is what Marr would call the primal sketch)
So this is the levels of brightness on our retina, first definitions of edges and contours ( eg we talked about receptive fields which are specialised for this)
So all about the grouping and segmentations -in terms of gestalt psychologists
What is the viewer centred recognition stage of the model?
This primal sketch, is then transformed into a 2 1/2 dimensional sketch
So these edgess and contours are now assembled - so that there is an object representation
However, is it an object representation that is solely based on our viewpoint
What is the object centred recognition stage of the model?
. The 2 1/2 D sketch is then transferred into a 3dimensional model
This objected centred representation can be manipulated mentally, it’s the real shape and position of objects in the visual scene
Independent of the observers viewpoint
This is the object representation that we can then use in order to compare to the object representations that we hold in memory
What is the object recognition stage of the model?
This is where Ellison and Young expand Marrs original theory)
So we have the memory storages/spaces that holds all of the object representations that we accumulate in a lifetime
Every new object that we meet, will create a new representation in our object representation units
And when we find an object, we will pull out a representation that matches these visual inputs the best, which will hopefully allow us to recognise the object
( however in the model there is also a shortcut from the viewer centred representation to object recognition units)- Recognition shortcut
So this is for unambiguous viewer centred representations ( eg where you can still recognise the object and dont need the 3dimensional model
It is a prototypical representation of an object
What is the semantic system stage of the model?
Once the visual input is matched with the object recognition units, that allows us to gain access to our semantic memory system in which we store general information, but also personal facts about objects. This also includes the name of the object
What is the name retrial stage of the model?
Accessing these facts generates the name retrieval
How can these 7 stages be grouped into 3 sections?
We have the initial representations- which we can refer to as the sensory processing
We have viewer centric and object centric representations - Which can be put together as perceptual processing
And then we have the recognition units - Which we can put together and refer to as recognition
What does it mean by testing the cognitive model?
What are the 3 examples we use to test the cognitive model?
Describe patient Mr M
Who- Reported by Benson and Greenberg
What happened - Suffered carbon monoxide poisoning
What are the neurological effects -
CAN-
-Maintain fixation
-Name colours
-Describe sensations
-Detect small changes in brightness and wavelength
-Detect movement of small objects
CANNOT
-Percive shape or form
-Cannot copy simple figures, letters, digits
-Cannot match simple objects
-Recognise or name objects and pictures of objects
We can seen that he does not have viewer-centred representation, as he has no pereception of basic shapes
Therefore, he does not have the rest of the model either
Describe the pateint with the right posterior lesion
Who - Reported by Warrington
What happened- Patients had right poster lesions
Neurological effects:
CAN-
Can match objects for size, colour, brightness
Can recognise objects from pictures of usual views
No impairment with object recognition in real life
Can explain the use of objects
CANNOT
Cannot recognise objects in unusual views
In order to perceive objects in unusual views, we need 3D perception, in order to mentally rotate the object
But despite this, they can still access the object recognition units via the recognition shortcut - for usual objects
Which is why they can correctly explain the use of the object as they can assess the semantic system and they can also name the objects
Describe pateint AB
Who - Reported by Warrington
What happened - Cerebral atrophy ( loss of neurons and connections between them)
Neurological affects:
CAN-
Distinguish basic shapes
Identify colours, numbers, letters
Match pictures of different views ( usual and unusual)
CANNOT-
-Cannot Name common objects or famous faces - says they look familiar but cannot name them
-Makes subsitiuons when categorising objects eg a donkey is a horse, a dog is a cat - eg can’t explain what objects do
-Cannot identify meaningful sounds
So, the whole semantic part of the system has been iterrupted
So whether is a problem with the object recognition unit or accessing the semantic system, we dont know for sure