On the plausibility of the discriminant center-surround hypothesis for visual saliency

被引:202
作者
Gao, Dashan [1 ]
Mahadevan, Vijay [1 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, Dept Elect & Comp Engn, Stat Visual Comp Lab, La Jolla, CA 92093 USA
来源
JOURNAL OF VISION | 2008年 / 8卷 / 07期
关键词
visual search; computational modeling; attention; eye movement; structure of natural images;
D O I
10.1167/8.7.13
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
It has been suggested that saliency mechanisms play a role in perceptual organization. This work evaluates the plausibility of a recently proposed generic principle for visual saliency: that all saliency decisions are optimal in a decision-theoretic sense. The discriminant saliency hypothesis is combined with the classical assumption that bottom-up saliency is a center-surround process to derive a (decision-theoretic) optimal saliency architecture. Under this architecture, the saliency of each image location is equated to the discriminant power of a set of features with respect to the classification problem that opposes stimuli at center and surround. The optimal saliency detector is derived for various stimulus modalities, including intensity, color, orientation, and motion, and shown to make accurate quantitative predictions of various psychophysics of human saliency for both static and motion stimuli. These include some classical nonlinearities of orientation and motion saliency and a Weber law that governs various types of saliency asymmetries. The discriminant saliency detectors are also applied to various saliency problems of interest in computer vision, including the prediction of human eye fixations on natural scenes, motion-based saliency in the presence of ego-motion, and background subtraction in highly dynamic scenes. In all cases, the discriminant saliency detectors outperform previously proposed methods from both the saliency and the general computer vision literatures.
引用
收藏
页数:18
相关论文
共 62 条
[31]  
KOCH C, 1985, HUM NEUROBIOL, V4, P219
[32]   TEXTURE SEGREGATION AND ORIENTATION GRADIENT [J].
LANDY, MS ;
BERGEN, JR .
VISION RESEARCH, 1991, 31 (04) :679-691
[33]   A saliency map in primary visual cortex [J].
Li, ZP .
TRENDS IN COGNITIVE SCIENCES, 2002, 6 (01) :9-16
[34]   A THEORY FOR MULTIRESOLUTION SIGNAL DECOMPOSITION - THE WAVELET REPRESENTATION [J].
MALLAT, SG .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (07) :674-693
[35]  
MODESTINO JW, 1977, NONPARAMETRIC METHOD, P29
[36]   DISPLAY ORGANIZATION AND THE DETECTION OF HORIZONTAL LINE SEGMENTS [J].
MORAGLIA, G .
PERCEPTION & PSYCHOPHYSICS, 1989, 45 (03) :265-272
[37]   Visual response saturation to orientation contrast in the perception of texture boundary [J].
Motoyoshi, I ;
Nishida, S .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2001, 18 (09) :2209-2219
[38]   FEATURE ANALYSIS AND THE ROLE OF SIMILARITY IN PREATTENTIVE VISION [J].
NOTHDURFT, HC .
PERCEPTION & PSYCHOPHYSICS, 1992, 52 (04) :355-375
[39]   TEXTURE SEGMENTATION AND POP-OUT FROM ORIENTATION CONTRAST [J].
NOTHDURFT, HC .
VISION RESEARCH, 1991, 31 (06) :1073-1078
[40]   THE CONSPICUOUSNESS OF ORIENTATION AND MOTION CONTRAST [J].
NOTHDURFT, HC .
SPATIAL VISION, 1993, 7 (04) :341-363