A behavioral analysis of computational models of visual attention

被引:30
作者
Shic, Frederick [1 ]
Scassellati, Brian [1 ]
机构
[1] Yale Univ, Dept Comp Sci, New Haven, CT 06520 USA
基金
美国国家科学基金会;
关键词
computational attention; robot attention; visual attention model; behavioral analysis; eye-tracking; human validation; saliency map; dimensionality reduction; gaze metric; classification strategy;
D O I
10.1007/s11263-006-9784-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Robots often incorporate computational models of visual attention to streamline processing. Even though the number of visual attention systems employed on robots has increased dramatically in recent years, the evaluation of these systems has remained primarily qualitative and subjective. We introduce quantitative methods for evaluating computational models of visual attention by direct comparison with gaze trajectories acquired from humans. In particular, we focus on the need for metrics based not on distances within the image plane, but that instead operate at the level of underlying features. We present a framework, based on dimensionality-reduction over the features of human gaze trajectories, that can simultaneously be used for both optimizing a particular computational model of visual attention and for evaluating its performance in terms of similarity to human behavior. We use this framework to evaluate the Itti et al. (1998) model of visual attention, a computational model that serves as the basis for many robotic visual attention systems.
引用
收藏
页码:159 / 177
页数:19
相关论文
共 49 条
[1]  
[Anonymous], 787 MIT ART INT LAB
[2]  
[Anonymous], 1973, PATTERN RECOGNITION
[3]  
BALKENIUS C, 2004, P LAVS 04 ST CATH CO
[4]   The computation of optical flow [J].
Beauchemin, SS ;
Barron, JL .
ACM COMPUTING SURVEYS, 1995, 27 (03) :433-467
[5]  
Breazeal C, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P1146
[6]  
Burgard W, 1998, FIFTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-98) AND TENTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICAL INTELLIGENCE (IAAI-98) - PROCEEDINGS, P11
[7]   THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE [J].
BURT, PJ ;
ADELSON, EH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) :532-540
[8]   Evaluation of selective attention under similarity transformations [J].
Draper, BA ;
Lionelle, A .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2005, 100 (1-2) :152-171
[9]   A survey of socially interactive robots [J].
Fong, T ;
Nourbakhsh, I ;
Dautenhahn, K .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2003, 42 (3-4) :143-166
[10]   AIBO: Toward the era of digital creatures [J].
Fujita, M .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2001, 20 (10) :781-794