Familiarity based unified visual attention model for fast and robust object recognition

被引:30
作者
Lee, Seungjin [1 ]
Kim, Kwanho [1 ]
Kim, Joo-Young [1 ]
Kim, Minsu [1 ]
Yoo, Hoi-Jun [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Div Elect Engn, Sch Elect Engn & Comp Sci, Taejon 305701, South Korea
关键词
Visual attention; Object recognition; Scene analysis;
D O I
10.1016/j.patcog.2009.07.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Even though visual attention models using bottom-up saliency can speed up object recognition by predicting object locations, in the presence of multiple salient objects, saliency alone cannot discern target objects from the clutter in a scene. Using a metric named familiarity, we propose a top-down method for guiding attention towards target objects, in addition to bottom-up saliency. To demonstrate the effectiveness of familiarity. the unified visual attention model (UVAM) which combines top-down familiarity and bottom-up saliency is applied to SIFT based object recognition. The UVAM is tested on 3600 artificially generated images containing COIL-100 objects with varying amounts of clutter, and on 126 images of real scenes. The recognition times are reduced by 2.7x and 2x, respectively, with no reduction in recognition accuracy, demonstrating the effectiveness and robustness of the familiarity based UVAM. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1116 / 1128
页数:13
相关论文
共 31 条
[1]  
[Anonymous], 1996, COLUMBIA OBJECT IMAG
[2]  
[Anonymous], 1967, Cognitive Psychology
[3]   An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].
Arya, S ;
Mount, DM ;
Netanyahu, NS ;
Silverman, R ;
Wu, AY .
JOURNAL OF THE ACM, 1998, 45 (06) :891-923
[4]   GENERALIZING THE HOUGH TRANSFORM TO DETECT ARBITRARY SHAPES [J].
BALLARD, DH .
PATTERN RECOGNITION, 1981, 13 (02) :111-122
[5]  
BONAIUTO JJ, 2005, IEEE COMP SOC COMP V
[6]  
BROWN M., 2002, BRIT MACHINE VISION, P656, DOI DOI 10.5244/C.16.23
[7]   FAMILIARITY AND ATTENTION - DOES WHAT WE KNOW AFFECT WHAT WE NOTICE [J].
CHRISTIE, J ;
KLEIN, R .
MEMORY & COGNITION, 1995, 23 (05) :547-550
[8]   A hierarchical neural system with attentional top-down enhancement of the spatial resolution for object recognition [J].
Deco, G ;
Schürmann, B .
VISION RESEARCH, 2000, 40 (20) :2845-2859
[9]   Neural Mechanisms of Selective Visual Attention [J].
Moore, Tirin ;
Zirnsak, Marc .
ANNUAL REVIEW OF PSYCHOLOGY, VOL 68, 2017, 68 :47-72
[10]   PARALLEL PROCESSING IN VISUAL SAME-DIFFERENT DECISIONS [J].
DONDERI, DC ;
ZELNICKE.D .
PERCEPTION & PSYCHOPHYSICS, 1969, 5 (04) :197-&