Relaxational metric adaptation and its application to semi-supervised clustering and content-based image retrieval

被引:19
作者
Chang, Hong
Yeung, Dit-Yan
Cheung, William K.
机构
[1] Hong Kong Univ Sci & Technol, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
[2] Hong Kong Baptist Univ, Dept Comp Sci, Kowloon, Hong Kong, Peoples R China
关键词
distance metric; nonparametric method; semi-supervised clustering; constrained k-means; side information; pairwise similarity and dissimilarity; content-based image retrieval;
D O I
10.1016/j.patcog.2006.04.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The performance of many supervised and unsupervised learning algorithms is very sensitive to the choice of an appropriate distance metric. Previous work in metric learning and adaptation has mostly been focused on classification tasks by making use of class label in, formation. In standard clustering tasks, however, class label information is not available. In order to adapt the metric to improve the clustering results, some background knowledge or side information is needed. One useful type of side information is in the form of pairwise similarity or dissimilarity information. Recently, some novel methods (e.g., the parametric method proposed by Xing et al.) for learning global metrics based on pairwise side information have been shown to demonstrate promising results. In this paper, we propose a nonparametric method, called relaxational metric adaptation (RMA), for the same metric adaptation problem. While RMA is local in the sense that it allows locally adaptive metrics, it is also global because even patterns not in the vicinity can have long-range effects on the metric adaptation process. Experimental results for semi-supervised clustering based on both simulated and real-world data sets show that RMA outperforms Xing et al.'s method under most situations. Besides applying RMA to semi-supervised learning, we have also used it to improve the performance of content-based image retrieval systems through metric adaptation. Experimental results based on two real-world image databases show that RMA significantly outperforms other methods in improving the image retrieval performance. (c) 2006 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1905 / 1917
页数:13
相关论文
共 39 条
[1]  
[Anonymous], 2001, ICML, DOI DOI 10.1109/TPAMI.2002.1017616
[2]  
[Anonymous], 2004, P 21 INT C MACH LEAR
[3]  
Bar-Hillel A., 2003, P 20 INT C MACH LEAR, P11
[4]   GTM: The generative topographic mapping [J].
Bishop, CM ;
Svensen, M ;
Williams, CKI .
NEURAL COMPUTATION, 1998, 10 (01) :215-234
[5]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[6]   Bidirectional deformable matching with application to handwritten character extraction [J].
Cheung, KW ;
Yeung, DY ;
Chin, RT .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (08) :1133-1139
[7]   On deformable models for visual pattern recognition [J].
Cheung, KW ;
Yeung, DY ;
Chin, RT .
PATTERN RECOGNITION, 2002, 35 (07) :1507-1526
[8]   A Bayesian framework for deformable pattern recognition with application to handwritten character recognition [J].
Cheung, KW ;
Yeung, DY ;
Chin, RT .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1998, 20 (12) :1382-1388
[9]  
Cox T., 2001, MULTIDIMENSIONAL SCA
[10]  
Domeniconi C, 2002, ADV NEUR IN, V14, P665