Semantic interactive image retrieval combining visual and conceptual content description

被引:2
作者
Marin Ferecatu
Nozha Boujemaa
Michel Crucianu
机构
[1] INRIA Rocquencourt,
[2] IMEDIA Team,undefined
[3] CNAM Paris,undefined
来源
Multimedia Systems | 2008年 / 13卷
关键词
Cross-modal image retrieval; Relevance feedback; Active learning; Semantic indexing;
D O I
暂无
中图分类号
学科分类号
摘要
We address the challenge of semantic gap reduction for image retrieval through an improved support vector machines (SVM)-based active relevance feedback framework, together with a hybrid visual and conceptual content representation and retrieval. We introduce a new feature vector based on projecting the keywords associated to an image on a set of “key concepts” with the help of an external lexical database. We then put forward two improvements of SVM-based relevance feedback method. First, to optimize the transfer of information between the user and the system, we introduce a new active learning selection criterion that minimizes redundancy between the candidate images shown to the user. Second, as most image classes span a wide range of scales in the description space, we argue that the insensitivity of the SVM to the scale of the data is desirable in this context and we show how to obtain it by using specific kernel functions. Experimental evaluations show that the joint use of the new concept-based feature vector and the visual features with our relevance feedback scheme can significantly improve the quality of the results.
引用
收藏
页码:309 / 322
页数:13
相关论文
共 41 条
[1]  
Adams W.H.(2003)Semantic indexing of multimedia content using visual, audio and text cues EURASIP J. Appl. Signal Process. 3 170-185
[2]  
Iyengar G.(1999)Support-vector machines for histogram-based image classification IEEE Trans. Neural Netw. 10 1055-1064
[3]  
Lin C.Y.(1996)Active learning with statistical models J. Artif. Intell. Res. 4 129-145
[4]  
Naphade M.R.(2000)The Bayesian image retrieval system, PicHunter: theory, implementation and psychophysical experiments IEEE Trans. Image Process. 9 20-37
[5]  
Neti C.(2001)Bayes point machines J. Mach. Learning Res. 1 245-279
[6]  
Nock H.J.(1998)Using corpus statistics and WordNet relations for sense identification Comput. Linguist. 24 147-165
[7]  
Smith J.R.(1995)Cyc: a large-scale investment in knowledge infrastructure Commun. ACM 38 33-38
[8]  
Chapelle O.(2004)Conceptnet: a practical commonsense reasoning tool-kit BT Technol. J. 22 211-226
[9]  
Haffner P.(2000)The kernel trick for distances Adv. Neural Inf. Process. Systems 12 301-307
[10]  
Vapnik V.N.(2000)Content-based image retrieval at the end of the early years IEEE Trans. Pattern Anal. Mach. Intell. 22 1349-1380