Abstract
The topic of vision is “often studied as if our conscious experience were the ultimate end-product of visual processing” (Hayhoe 2009), with research sometimes overly focused on immediate visual perception—processes aimed at understanding the current visual scene or episode. However, in many situations, visual perception alone is insufficient, and what is also needed is visual memory. In particular, natural visual behavior in the real world requires much more than just understanding the present scene, and often involves the deliberate choice and planning of a sequence of routines or operations involving both vision and memory. These routines are under the active control of the system, and dynamically affect what is seen: the choice of routine affects what is seen, and conversely, what is seen affects subsequent choice of routine. Thus, in real-world tasks, visual perception and memory are intrinsically linked. In this chapter, we explain how for real-world visual tasks, immediate visual perception is not sufficient, and it interacts with visual memory virtually all the time.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aivar MP, Hayhoe MM, Chizk CL, Mruczek REB (2005) Spatial memory and saccadic targeting in a natural task. J Vis 5(3):3
Alba JW, Hasher L (1983) Is memory schematic? Psychol Bull 93(2):203
Bainbridge WA, Isola P, Oliva A (2013) The intrinsic memorability of face photographs. J Exp Psychol Gen 4(142):1323–1334
Balcetis E, Dale R (2003) There is no naked eye: higher-order social concepts clothe visual perception. In: Proceedings of the twenty-fifth annual meeting of the cognitive science society. pp 109–114
Botvinick M, Cohen J (1998) Rubber hands feel touch that eyes see. Nature 391:756
Brady TF, Konkle T, Alvarez GA (2011) A review of visual memory capacity: beyond individual items and toward structured representations. J Vis 11(5)
Brady TF, Konkle T, Alvarez GA, Oliva A (2008) Visual long-term memory has a massive storage capacity for object details. Proc Natl Acad Sci USA 105:14325–14329
Brockmole JR, Henderson JM (2006) Using real-world scenes as contextual cues for search. Vis Cogn 13(1):99–108
Bruce V, Burton AM, Dench N (1994) What’s distinctive about a distinctive face? Q J Exp Psychol A 47(1):119–141
Busey TA (2001). Formal models of familiarity and memorability in face recognition. In: Wenger MJ, Townsend JT (eds) Computational, geometric, and process perspectives on facial cognition: contexts and challenges, Lawrence Erlbaum Associates, pp 147–191
Castelhano M, Henderson J (2005) Incidental visual memory for objects in scenes. Vis Cogn 12(6):1017–1040
Chandrasekhar V, Wu M, Li X, Tan C, Mandal B, Li L, Lim JH (2014) Efficient retrieval from large-scale egocentric visual data using a sparse graph representation. In: IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 541–548
Chandrasekhar V, Wu M, Li X, Tan C, Li L, Lim JH (2014) Incremental graph clustering for efficient retrieval from streaming egocentric video data. In: international conference on pattern recognition. IEEE, pp 2631–2636
Chelazzi L, Miller EK, Duncan J, Desimone R (1993) A neural basis for visual search in inferior temporal cortex. Nature 363:345–347
Craighero L, Bello A, Fadiga L, Rizzolatti G (2002) Hand action preparation influences the responses to hand pictures. Neuropsychologia 40(5):492–502
Dalal N, Triggs W (2005) Histograms of oriented gradients for human detection. In: ieee conference on computer vision and pattern recognition. IEEE, pp 886–893
Damasio AR (1989) Time-locked multiregional retroactivation: a systems-level proposal for the neural substrates of recall and recognition. Cognition 33(1–2):25–62
Duncan J, Humphreys GW (1989) Visual search and stimulus similarity. Psychol Rev 96(3):433
Edelman JA, Cherkasova MV, Nakayama K (2002) A spatial memory system for the guidance of eye movements in crowded visual scenes. J Vis 2(7):572
Felleman DJ, Van Essen DC (1991) Distributed hierarchical processing in the primate cerebral cortex. Cereb Cortex 1(1):1–47
Franconeri SL, Alvarez GA, Cavanagh P (2013) Flexible cognitive resources: competitive content maps for attention and memory. Trends Cogn Sci 17(3):134–141
Fuster J, Jervey J (1982) Neuronal firing in the inferotemporal cortex of the monkey in a visual memory task. J Neurosci 2(3):361–375
González J, Barros-Loscertales A, Pulvermüller F, Meseguer V, Sanjuán A, Belloch V, Avila C (2006) Reading cinnamon activates olfactory brain regions. NeuroImage 32(2):906–912
Gregory R (1970) The intelligent eye. McGraw-Hill
Hayhoe MM (2009) Visual memory in motor planning and action. In: Brockmole JR (ed) The visual world in memory. Psychology Press, pp 117–139
Henderson JM (2008) Eye movements and scene memory. In: Luck S, Hollingworth A (eds) Visual memory. Oxford University Press, pp 87–121
Hollingworth A (2004) Constructing visual representations of natural scenes: the roles of short-and long-term visual memory. J Exp Psychol Hum Percept Perform 30(3):519
Hollingworth A (2005) The relationship between online visual representation of a scene and long-term scene memory. J Exp Psychol Learn Mem Cogn 31(3):396
Hollingworth A (2008) Visual memory for natural scenes. In: Luck S, Hollingworth A (eds) Visual memory. Oxford University Press, pp 123–161
Hollingworth A (2009) Memory for real-world scenes. In: Brockmole JR (ed) The visual world in memory. Psychology Press, pp 89–116
Hollingworth A (2012) Guidance of visual search by memory and knowledge. In Dodd MD, Flowers JH (eds) The Influence of Attention, Learning, and Motivation on Visual Search. Springer, New York, pp 63–89
Hollingworth A, Henderson JM (2002) Accurate visual memory for previously attended objects in natural scenes. J Exp Psychol Hum Percept Perform 28(1):113
Hollingworth A, Richard AM, Luck SJ (2008) Understanding the function of visual short-term memory: transsaccadic memory, object correspondence, and gaze correction. J Exp Psychol Gen 137(1):163
Irwin DE, Yantis S, Jonides J (1983) Evidence against visual integration across saccadic eye movements. Percept Psychophys 34(1):49–57
Isola P, Xiao JX, Torralba A, Oliva A (2011) What makes an image memorable? In: IEEE CVPR. pp 145–152
Kapoula Z (1985) Evidence for a range effect in the saccadic system. Vis Res 25(8):1155–1157
Karn KS, Hayhoe MM (2000) Memory representations guide targeting eye movements in a natural task. Vis Cogn 7(6):673–703
Khosla A, Bainbridge WA, Torralba A, Oliva A (2013) Modifying the memorability of face photographs. In: International conference on computer vision. pp 145–152
Kiefer M, Sim E-J, Herrnberger B, Grothe J, Hoenig K (2008) The sound of concepts: four markers for a link between auditory and conceptual brain systems. J Neurosci 28:12224–12230
Klein RM (2000) Inhibition of return. Trends Cogn Sci 4(4):138–147
Konkle T, Brady TF, Alvarez GA, Oliva A (2010) Conceptual distinctiveness supports detailed visual long-term memory for real-world objects. J Exp Psychol-Gen 139(3):558–578
Land M, Mennie N, Rusted J, others (1999) The roles of vision and eye movements in the control of activities of daily living. Perception 28(11):1311–1328
Lander K, Chuang L (2005) Why are moving faces easier to recognize? Vis Cogn 12:429–442
Loomis JM, Beall AC (2004) Model-based control of perception/action. In: Optic flow and beyond. Springer, pp 421–441
Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110
Luck S (2008) Visual short-term memory. In Luck S Hollingworth A (ed) Visual memory. Oxford University Press, pp 43–85
Luck SJ, Vogel EK (2013) Visual working memory capacity: from psychophysics and neurobiology to individual differences. Trends Cogn Sci 17(8):391–400
Mandler JM, Parker RE (1976) Memory for descriptive and spatial information in complex pictures. J Exp Psychol: Hum Learn Mem 2(1):38
Mandler JM, Ritchey GH (1977) Long-term memory for pictures. J Exp Psychol: Hum Learn Mem 3(4):386
Markman AB, Gentner D (2000) Structure mapping in the comparison process. Am J Psychol Psychol 113(4):501
Martin T, Riley ME, Kelly KN, Hayhoe M, Huxlin KR (2007) Visually-guided behavior of homonymous hemianopes in a naturalistic task. Vis Res 47(28):3434–3446
Matin E (1974) Saccadic suppression: a review and an analysis. Psychol Bull 81(12):899
McGurk H, MacDonald J (1976) Hearing lips and seeing voices. Nature 264(5588):746–748
Messinger A, Squire LR, Zola SM, Albright TD (2001) Neuronal representations of stimulus associations develop in the temporal lobe during learning. Proc Natl Acad Sci USA 98(21):12239–12244
Meyer K, Damasio A (2009) Convergence and divergence in a neural architecture for recognition and memory. Trends Neurosci 32(7):376–382
Oliva A, Torralba A (2001) Modeling the shape of the scene: A holistic representation of the spatial envelope. Int J Comput Vis 42:145–175
Orhan AE, Jacobs R (2014) Toward ecologically realistic theories in visual short-term memory research. Atten Percept Psychophys 76(7):2158–2170
Palmer SE (1999) Vision science: photons to phenomenology. MIT Press, Cambridge, MA
Palmeri TJ, Tarr MJ (2008) Visual object perception and long-term memory. In: Luck S, Hollingworth A (eds), Visual memory. Oxford University Press, pp163–207
Potter MC, Staub A, O’Connor DH (2004) Pictorial and conceptual representation of glimpsed pictures. J Exp Psychol Hum Percept Perform 30(3):478
Rayner K, Pollatsek A (1983) Is visual information integrated across saccades? Percept Psychophys 34(1):39–48
Rock I, Englestein P (1959) A study of memory for visual form. Am J Psychol
Serre T, Oliva A, Poggio T (2007) A feedforward architecture accounts for rapid categorization. Proc Natl Acad Sci USA 104(15):6424–6429
Shams L, Kamitani Y, Shimojo S (2000) Illusions: what you see is what you hear. Nature 408(6814):788
Shechtman E, Irani M (2007) Matching local self-similarities across images and videos. In: Proceedings of the IEEE computer society conference on computer vision and pattern recognition. pp 1–8
Shenton JT, Schwoebel J, Coslett HB (2004) Mental motor imagery and the body schema: evidence for proprioceptive dominance. Neurosci Lett 370(1):19–24
Shepard RN, Metzler J (1971) Mental rotation of three-dimensional objects. Science 171(3972):701–703
Sinha P, Balas B, Ostrovsky Y, Russell R (2006) Face recognition by humans: Nineteen results all computer vision researchers should know about. Proc IEEE 94(11):1948–1962
Spence C (2011) Crossmodal correspondences: A tutorial review. Atten Percept Psychophys 73(4):971–995
Standing L (1973) Learning 10000 pictures. Q J Exp Psychol 25(2):207–222
Suchow JW, Fougnie D, Brady TF, Alvarez GA (2014) Terms of the debate on the format and structure of visual memory. Atten Percept Psychophys 76(7):2071–2079
Sweeny TD, Guzman-Martinez E, Ortega L, Grabowecky M, Suzuki S (2012) Sounds exaggerate visual shape. Cognition 124(2):194–200
Tan C, Goh H, Chandrasekhar V, Li L, Lim J-H (2014) Understanding the nature of first-person videos: characterization and classification using low-level features. In: 2014 IEEE conference on computer vision and pattern recognition workshops. IEEE, pp 549–556
Tan C, Leibo JZ, Poggio T (2013) Throwing down the visual intelligence gauntlet. In: Cipolla R, Battiato S, Farinella GM (eds) Machine learning for computer vision. Springer, pp 1–15
Tatler BW (2014) Eye movements from laboratory to life. In Horsley M, Eliot M, Knight BA, Reilly R (eds) Current trends in eye tracking research. Springer International Publishing, pp 17–35
Torralba A, Oliva A, Castelhano MS, Henderson JM (2006) Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. Psychol Rev 113(4):766
Ullman S (1984) Visual routines. Cognition 18(1):97–159
Van Beers RJ, Sittig AC, van der Gon JJD (1999) Integration of proprioceptive and visual position-information : An experimentally supported model. J Neurophysiol 81(3):1355–1364
Vokey JR, Read JD (1992) Familiarity, memorability, and the effect of typicality on the recognition of faces. Mem Cogn 20(3):291–302
Williams CC, Henderson JM, Zacks F (2005) Incidental visual memory for targets and distractors in visual search. Percept Psychophys 67(5):816–827
Wolfe JM (1998) Vis search Attention 1:13–73
Zelinsky GJ, Rao RPN, Hayhoe MM, Ballard DH (1997) Eye movements reveal the spatiotemporal dynamics of visual search. Psycholo Sci 448–453
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media Singapore
About this chapter
Cite this chapter
Tan, C., Lallee, S., Mandal, B. (2017). Vision and Memory: Looking Beyond Immediate Visual Perception. In: Zhao, Q. (eds) Computational and Cognitive Neuroscience of Vision. Cognitive Science and Technology. Springer, Singapore. https://doi.org/10.1007/978-981-10-0213-7_9
Download citation
DOI: https://doi.org/10.1007/978-981-10-0213-7_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0211-3
Online ISBN: 978-981-10-0213-7
eBook Packages: EngineeringEngineering (R0)