Modeling global scene factors in attention

被引:182
作者
Torralba, A [1 ]
机构
[1] MIT, Artificial Intelligence Lab, Cambridge, MA 02115 USA
关键词
D O I
10.1364/JOSAA.20.001407
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Models of visual attention have focused predominantly on bottom-up approaches that ignored structured contextual and scene information. I propose a model of contextual cueing for attention guidance based on the global scene configuration. It is shown that the statistics of low-level features across the whole image can be used to prime the presence or absence of objects in the scene and to predict their location, scale, and appearance before exploring the image. In this scheme, visual context information can become available early in the visual processing chain, which allows modulation of the saliency of image regions and provides an efficient shortcut for object detection and recognition. (C) 2003 Optical Society of America.
引用
收藏
页码:1407 / 1418
页数:12
相关论文
共 53 条
[11]   RELATIONS BETWEEN THE STATISTICS OF NATURAL IMAGES AND THE RESPONSE PROPERTIES OF CORTICAL-CELLS [J].
FIELD, DJ .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1987, 4 (12) :2379-2394
[12]  
Gershenfeld N.A., 1999, The Nature of Mathematical Modeling
[13]  
GORKANI MM, 1994, INT C PATT RECOG, P459, DOI 10.1109/ICPR.1994.576325
[14]  
Heisele B, 2001, PROC CVPR IEEE, P18
[15]   High-level scene perception [J].
Henderson, JM ;
Hollingworth, A .
ANNUAL REVIEW OF PSYCHOLOGY, 1999, 50 :243-271
[16]   A model of saliency-based visual attention for rapid scene analysis [J].
Itti, L ;
Koch, C ;
Niebur, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (11) :1254-1259
[17]  
Jepson A., 1996, PERCEPTION BAYESIAN, P63
[18]   HIERARCHICAL MIXTURES OF EXPERTS AND THE EM ALGORITHM [J].
JORDAN, MI ;
JACOBS, RA .
NEURAL COMPUTATION, 1994, 6 (02) :181-214
[19]  
KOCH C, 1985, HUM NEUROBIOL, V4, P219
[20]   DETECTING SALIENT BLOB-LIKE IMAGE STRUCTURES AND THEIR SCALES WITH A SCALE-SPACE PRIMAL SKETCH - A METHOD FOR FOCUS-OF-ATTENTION [J].
LINDEBERG, T .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 1993, 11 (03) :283-318