A Probabilistic Model of Overt Visual Attention for Cognitive Robots

被引:21
作者
Begum, Momotaz [1 ]
Karray, Fakhri [1 ]
Mann, George K. I. [2 ]
Gosine, Raymond G. [2 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
[2] Mem Univ Newfoundland, Fac Engn & Appl Sci, St John, NF A1B 3X5, Canada
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2010年 / 40卷 / 05期
关键词
Bayes filter; biased competition (BC); overt visual attention; probabilistic modeling; scale invariant feature transform (SIFT); NEURAL BASIS; SEARCH; SHIFTS; MECHANISMS; IMITATION;
D O I
10.1109/TSMCB.2009.2037511
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual attention is one of the major requirements for a robot to serve as a cognitive companion for human. The robotic visual attention is mostly concerned with overt attention which accompanies head and eye movements of a robot. In this case, each movement of the camera head triggers a number of events, namely transformation of the camera and the image coordinate systems, change of content of the visual field, and partial appearance of the stimuli. All of these events contribute to the reduction in probability of meaningful identification of the next focus of attention. These events are specific to overt attention with head movement and, therefore, their effects are not addressed in the classical models of covert visual attention. This paper proposes a Bayesian model as a robot-centric solution for the overt visual attention problem. The proposed model, while taking inspiration from the primates visual attention mechanism, guides a robot to direct its camera toward behaviorally relevant and/or visually demanding stimuli. A particle filter implementation of this model addresses the challenges involved in overt attention with head movement. Experimental results demonstrate the performance of the proposed model.
引用
收藏
页码:1305 / 1318
页数:14
相关论文
共 51 条
[1]  
[Anonymous], 2002, Computational Neuroscience of Vision
[2]  
[Anonymous], 1996, Tools for Statistical Inference
[3]  
[Anonymous], INT J INTELLIGENT CO
[4]  
Baccon JC, 2002, 2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, P238, DOI 10.1109/IRDS.2002.1041395
[5]   Bottom-up gaze shifts and fixations learning by imitation [J].
Belardinelli, Anna ;
Pirri, Fiora ;
Carbone, Andrea .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :256-271
[6]  
BOLLMANN M, 1999, LNCS, V1542, P392
[7]   Active vision for sociable robots [J].
Breazeal, C ;
Edsinger, A ;
Fitzpatrick, P ;
Scassellati, B .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2001, 31 (05) :443-453
[8]   The FeatureGate model of visual selection [J].
Cave, KR .
PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 1999, 62 (2-3) :182-194
[9]   A NEURAL BASIS FOR VISUAL-SEARCH IN INFERIOR TEMPORAL CORTEX [J].
CHELAZZI, L ;
MILLER, EK ;
DUNCAN, J ;
DESIMONE, R .
NATURE, 1993, 363 (6427) :345-347
[10]   Top-down guided eye movements [J].
Chernyak, DA ;
Stark, LW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2001, 31 (04) :514-522