SUN: A Bayesian framework for saliency using natural statistics

被引:957
作者
Zhang, Lingyun [1 ]
Tong, Matthew H. [1 ]
Marks, Tim K. [1 ]
Shan, Honghao [1 ]
Cottrell, Garrison W. [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
来源
JOURNAL OF VISION | 2008年 / 8卷 / 07期
关键词
saliency; attention; eye movements; computational modeling;
D O I
10.1167/8.7.32
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
We propose a definition of saliency by considering what the visual system is trying to optimize when directing attention. The resulting model is a Bayesian framework from which bottom-up saliency emerges naturally as the self-information of visual features, and overall saliency ( incorporating top-down information with bottom-up saliency) emerges as the pointwise mutual information between the features and the target when searching for a target. An implementation of our framework demonstrates that our model's bottom-up saliency maps perform as well as or better than existing algorithms in predicting people's fixations in free viewing. Unlike existing saliency measures, which depend on the statistics of the particular image being viewed, our measure of saliency is derived from natural image statistics, obtained in advance from a collection of natural images. For this reason, we call our model SUN ( Saliency Using Natural statistics). A measure of saliency based on natural image statistics, rather than based on a single test image, provides a straightforward explanation for many search asymmetries observed in humans; the statistics of a single test image lead to predictions that are not consistent with these asymmetries. In our model, saliency is computed locally, which is consistent with the neuroanatomy of the early visual system and results in an efficient algorithm with few free parameters.
引用
收藏
页数:20
相关论文
共 65 条
[1]  
[Anonymous], 1995, P 17 ANN C COGN SCI
[2]  
[Anonymous], 2007, ADV NEURAL INFORM PR
[3]  
[Anonymous], ADV NEURAL INF PROCE
[4]  
Barlow H.B., 1994, LARGE SCALE NEURONAL, P1
[5]   The ''independent components'' of natural scenes are edge filters [J].
Bell, AJ ;
Sejnowski, TJ .
VISION RESEARCH, 1997, 37 (23) :3327-3338
[6]  
Bruce N., 2006, ADV NEURAL INFORM PR, P155
[7]   A THEORY OF VISUAL-ATTENTION [J].
BUNDESEN, C .
PSYCHOLOGICAL REVIEW, 1990, 97 (04) :523-547
[8]   The role of memory in guiding attention during natural vision [J].
Carmi, Ran ;
Itti, Laurent .
JOURNAL OF VISION, 2006, 6 (09) :898-914
[9]  
CARON RF, 1968, PSYCHON SCI, V10, P207
[10]  
CHAUVIN A, 2002, CONNECTIONIST MODELS, P236