Attention-driven image interpretation with application to image retrieval

被引:66
作者
Fu, Hong
Chi, Zheru
Feng, Dagan
机构
[1] Hong Kong Polytech Univ, Dept Elect & Informat Engn, Ctr Multimedia Signal Proc, Kowloon, Hong Kong, Peoples R China
[2] Univ Sydney, Sch Informat Technol, Sydney, NSW 2006, Australia
关键词
D O I
10.1016/j.patcog.2005.12.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Visual attention, a selective procedure of human's early vision, plays a very important role for humans to understand a scene by intuitively emphasizing some focused regions/objects. Being aware of this, we propose an attention-driven image interpretation method that pops out visual attentive objects from an image iteratively by maximizing a global attention function. In this method, an image can be interpreted as containing several perceptually attended objects as well as a background, where each object has an attention value. The attention values of attentive objectives are then mapped to importance factors so as to facilitate the subsequent image retrieval. An attention-driven matching algorithm is proposed in this paper based on a retrieval strategy emphasizing attended objects. Experiments on 7376 Hemera color images annotated by keywords show that the retrieval results from our attention-driven approach compare favorably with conventional methods, especially when the important objects are seriously concealed by the irrelevant background. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1604 / 1621
页数:18
相关论文
共 34 条
[1]  
[Anonymous], 1999, 7 ACM INT C MULT, DOI DOI 10.1145/319878.319896
[2]   Edge based features for content based image retrieval [J].
Banerjee, M ;
Kundu, MK .
PATTERN RECOGNITION, 2003, 36 (11) :2649-2661
[3]   Blobworld: Image segmentation using expectation-maximization and its application to image querying [J].
Carson, C ;
Belongie, S ;
Greenspan, H ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (08) :1026-1038
[4]  
CASTLEMAN K. R., 1996, Digital image processing
[5]   A hierarchical neural system with attentional top-down enhancement of the spatial resolution for object recognition [J].
Deco, G ;
Schürmann, B .
VISION RESEARCH, 2000, 40 (20) :2845-2859
[6]   Unsupervised segmentation of color-texture regions in images and video [J].
Deng, YN ;
Manjunath, BS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (08) :800-810
[7]   Active object recognition integrating attention and viewpoint control [J].
Dickinson, SJ ;
Christensen, HI ;
Tsotsos, JK ;
Olofsson, G .
COMPUTER VISION AND IMAGE UNDERSTANDING, 1997, 67 (03) :239-260
[8]   Statistical modeling and conceptualization of natural images [J].
Fan, JP ;
Gao, YL ;
Luo, HZ ;
Xu, GY .
PATTERN RECOGNITION, 2005, 38 (06) :865-885
[9]  
Feng HM, 2004, 10TH INTERNATIONAL MULTIMEDIA MODELLING CONFERENCE, PROCEEDINGS, P249
[10]   Bayesian relevance feedback for content-based image retrieval [J].
Giacinto, G ;
Roli, F .
PATTERN RECOGNITION, 2004, 37 (07) :1499-1508