Effective Semantic Annotation by Image-to-Concept Distribution Model

被引:33
作者
Su, Ja-Hwung [1 ]
Chou, Chien-Li [2 ]
Lin, Ching-Yung [3 ]
Tseng, Vincent S. [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 701, Taiwan
[2] Natl Chiao Tung Univ, Dept Comp Sci & Informat Engn, Hsinchu 30050, Taiwan
[3] IBM TJ Watson Res Ctr, Hawthorne, NY 10532 USA
关键词
Entropy; image annotation; image-to-concept distribution; tf-idf;
D O I
10.1109/TMM.2011.2129502
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Image annotation based on visual features has been a difficult problem due to the diverse associations that exist between visual features and human concepts. In this paper, we propose a novel approach called Annotation by Image-to-Concept Distribution Model (AICDM) for image annotation by discovering the associations between visual features and human concepts from image-to-concept distribution. Through the proposed image-to-concept distribution model, visual features and concepts can be bridged to achieve high-quality image annotation. In this paper, we propose to use "visual features", "models", and "visual genes" which represent analogous functions to the biological chromosome, DNA, and gene. Based on the proposed models using entropy, tf-idf, rules, and SVM, the goal of high-quality image annotation can be achieved effectively. Our empirical evaluation results reveal that the AICDM method can effectively alleviate the problem of visual-to-concept diversity and achieve better annotation results than many existing state-of-the-art approaches in terms of precision and recall.
引用
收藏
页码:530 / 538
页数:9
相关论文
共 25 条
[1]
[Anonymous], P ACM WORKSH LSMRM
[2]
[Anonymous], P CVPR
[3]
[Anonymous], P IEEE INT C COMP VI
[4]
[Anonymous], 2004, MULTIMEDIA'04, DOI [10.1145/1027527.1027748, DOI 10.1145/1027527.1027748]
[5]
[Anonymous], 2007, P INT WORKSHOP WORKS
[6]
[Anonymous], 2004, P 10 ACM SIGKDD INT, DOI [DOI 10.1145/1014052.1014135, 10.1145/1014052]
[7]
LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[8]
EVERINGHAM M, 2009, PASCAL VISUAL CLASSE
[9]
Structured Max-Margin Learning for Inter-Related Classifier Training and Multilabel Image Annotation [J].
Fan, Jianping ;
Shen, Yi ;
Yang, Chunlei ;
Zhou, Ning .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (03) :837-854
[10]
LAVRENKO V, 2004, P IEEE INT C AC SPEE, P17