Recognition of natural scenes from global properties: Seeing the forest without representing the trees

被引:333
作者
Greene, Michelle R. [1 ]
Oliva, Aude [1 ]
机构
[1] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
Scene recognition; Basic-level categorization; Global property; Natural images; OBJECT; IMAGE; PERCEPTION; FEATURES; CONTEXT; INFORMATION; CATEGORIZATION; MEMORY; SHAPE; CONNECTEDNESS;
D O I
10.1016/j.cogpsych.2008.06.001
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Human observers are able to rapidly and accurately categorize natural scenes, but the representation mediating this feat is still unknown. Here we propose a framework of rapid scene categorization that does not segment a scene into objects and instead uses a vocabulary of global, ecological properties that describe spatial and functional aspects of scene space (such as navigability or mean depth). In Experiment 1, we obtained ground truth rankings on global properties for use in Experiments 2-4. To what extent do human observers use global property information when rapidly categorizing natural scenes? In Experiment 2, we found that global property resemblance was a strong predictor of both false alarm rates and reaction times in a rapid scene categorization experiment. To what extent is global property information alone a sufficient predictor of rapid natural scene categorization? In Experiment 3, we found that the performance of a classifier representing only these properties is indistinguishable from human performance in a rapid scene categorization task in terms of both accuracy and false alarms. To what extent is this high predictability unique to a global property representation? In Experiment 4, we compared two models that represent scene object information to human categorization performance and found that these models had lower fidelity at representing the patterns of performance than the global property model. These results provide support for the hypothesis that rapid categorization of natural scenes may not be mediated primarily though objects and parts, but also through global properties of structure and affordance. (c) 2008 Elsevier Inc. All rights reserved.
引用
收藏
页码:137 / 176
页数:40
相关论文
共 97 条
[1]   The representation of simple ensemble visual features outside the focus of attention [J].
Alvarez, George A. ;
Oliva, Aude .
PSYCHOLOGICAL SCIENCE, 2008, 19 (04) :392-398
[2]  
[Anonymous], ADV NEURAL INFORM PR
[3]  
[Anonymous], ADV VISUAL COGNITION
[4]  
[Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
[5]  
[Anonymous], 1982, Visual perception
[6]  
Appelton Jay., 1975, The Experience of Landscape
[7]   Seeing sets: Representation by statistical properties [J].
Ariely, D .
PSYCHOLOGICAL SCIENCE, 2001, 12 (02) :157-162
[8]   PREDICTING SIMILARITY AND CATEGORIZATION FROM IDENTIFICATION [J].
ASHBY, FG ;
LEE, WW .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 1991, 120 (02) :150-172
[9]   Visual objects in context [J].
Bar, M .
NATURE REVIEWS NEUROSCIENCE, 2004, 5 (08) :617-629
[10]   PERCEIVING REAL-WORLD SCENES [J].
BIEDERMA.I .
SCIENCE, 1972, 177 (4043) :77-&