A survey of content-based image retrieval with high-level semantics

被引:928
作者
Liu, Ying [1 ]
Zhang, Dengsheng
Lu, Guojun
Ma, Wei-Ying
机构
[1] Monash Univ, Gippsland Sch Comp & Informat Technol, Clayton, Vic 3842, Australia
[2] Microsoft Res Asia, Beijing 100080, Peoples R China
关键词
content-based image retrieval; semantic gap; high-level semantics; survey;
D O I
10.1016/j.patcog.2006.04.045
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In order to improve the retrieval accuracy of content-based image retrieval systems, research focus has been shifted from designing sophisticated low-level feature extraction algorithms to reducing the 'semantic gap' between the visual features and the richness of human semantics. This paper attempts to provide a comprehensive survey of the recent technical achievements in high-level semantic-based image retrieval. Major recent publications are included in this survey covering different aspects of the research in this area, including low-level image feature extraction, similarity measurement, and deriving high-level semantic features. We identify five major categories of the state-of-the-art techniques in narrowing down the 'semantic gap': (1) using object ontology to define high-level concepts; (2) using machine learning methods to associate low-level features with query concepts; (3) using relevance feedback to learn users' intention; (4) generating semantic template to support high-level image retrieval; (5) fusing the evidences from HTML text and the visual content of images for WWW image retrieval. In addition, some other related issues such as image test bed and retrieval performance evaluation are also discussed. Finally, based on existing technology and the demand from real-world applications, a few promising future research directions are suggested. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:262 / 282
页数:21
相关论文
共 137 条
[51]   Learning in content-based image retrieval [J].
Huang, TS ;
Zhou, XS ;
Nakazato, M ;
Wu, Y ;
Cohen, I .
2ND INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, PROCEEDINGS, 2002, :155-162
[52]  
Jin W., 2004, P ACM MULT
[53]  
Jin X. Y., 2002, CBIR DIFFICULTY CHAL
[54]   Relevance feedback in region-based image retrieval [J].
Jing, F ;
Li, MJ ;
Zhang, HJ ;
Zhang, B .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (05) :672-681
[55]  
Kim S, 2003, LECT NOTES COMPUT SC, V2728, P39
[56]  
Kulkarni S, 2003, ICCIMA 2003: FIFTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND MULTIMEDIA APPLICATIONS, PROCEEDINGS, P223
[57]   Query feedback for interactive image retrieval [J].
Kushki, A ;
Androutsos, P ;
Plataniotis, KN ;
Venetsanopoulos, AN .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2004, 14 (05) :644-655
[58]   Retrieval of images from artistic repositories using a decision fusion framework [J].
Kushki, A ;
Androutsos, P ;
Plataniotis, KN ;
Venetsanopoulos, AN .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (03) :277-292
[59]  
Leow W.K., 2000, TEXTURE ANAL MACHINE
[60]   An integrated content and metadata based retrieval system for art [J].
Lewis, PH ;
Martinez, K ;
Abas, FS ;
Fauzi, MFA ;
Chan, SCY ;
Addis, MJ ;
Boniface, MJ ;
Grimwood, P ;
Stevenson, A ;
Lahanier, C ;
Stevenson, J .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (03) :302-313