Familiarity based unified visual attention model for fast and robust object recognition

被引：30

作者：

Lee, Seungjin ^{[1
]}

Kim, Kwanho ^{[1
]}

Kim, Joo-Young ^{[1
]}

Kim, Minsu ^{[1
]}

Yoo, Hoi-Jun ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Div Elect Engn, Sch Elect Engn & Comp Sci, Taejon 305701, South Korea

来源：

PATTERN RECOGNITION | 2010年 / 43卷 / 03期

关键词：

Visual attention; Object recognition; Scene analysis;

D O I：

10.1016/j.patcog.2009.07.014

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Even though visual attention models using bottom-up saliency can speed up object recognition by predicting object locations, in the presence of multiple salient objects, saliency alone cannot discern target objects from the clutter in a scene. Using a metric named familiarity, we propose a top-down method for guiding attention towards target objects, in addition to bottom-up saliency. To demonstrate the effectiveness of familiarity. the unified visual attention model (UVAM) which combines top-down familiarity and bottom-up saliency is applied to SIFT based object recognition. The UVAM is tested on 3600 artificially generated images containing COIL-100 objects with varying amounts of clutter, and on 126 images of real scenes. The recognition times are reduced by 2.7x and 2x, respectively, with no reduction in recognition accuracy, demonstrating the effectiveness and robustness of the familiarity based UVAM. (C) 2009 Elsevier Ltd. All rights reserved.

引用

页码：1116 / 1128

页数：13

共 31 条

[1]

[Anonymous], 1996, COLUMBIA OBJECT IMAG

[2]

[Anonymous], 1967, Cognitive Psychology

[3] An optimal algorithm for approximate nearest neighbor searching in fixed dimensions [J].