Building the gist of a scene: the role of global image features in recognition

被引：942

作者：

Oliva, Aude

Torralba, Antonio

机构：

[1] MIT, Dept Brain & Cognit Sci, Cambridge, MA 02139 USA

[2] MIT, Comp Sci & Artificial Intelligence Lab, Cambridge, MA 02139 USA

来源：

VISUAL PERCEPTION, PT 2: FUNDAMENTALS OF AWARENESS: MULTI-SENSORY INTEGRATION AND HIGH-ORDER PERCEPTION | 2006年 / 155卷

关键词：

scene recognition; gist; spatial envelope; global image feature; spatial frequency; natural image;

D O I：

10.1016/S0079-6123(06)55002-2

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

Humans can recognize the gist of a novel image in a single glance, independent of its complexity. How is this remarkable feat accomplished? On the basis of behavioral and computational evidence, this paper describes a formal approach to the representation and the mechanism of scene gist understanding, based on scene-centered, rather than object-centered primitives. We show that the structure of a scene image can be estimated by the mean of global image features, providing a statistical summary of the spatial layout properties (Spatial Envelope representation) of the scene. Global features are based on configurations of spatial scales and are estimated without invoking segmentation or grouping operations. The scene-centered approach is not an alternative to local image analysis but would serve as a feed-forward and parallel pathway of visual processing, able to quickly constrain local feature analysis and enhance object recognition in cluttered natural scenes.

引用

页码：23 / 36

页数：14

共 72 条

[1] Seeing sets: Representation by statistical properties [J].