A Multiobjective Simultaneous Learning Framework for Clustering and Classification

被引:42
作者
Cai, Weiling [1 ,2 ]
Chen, Songcan [1 ]
Zhang, Daoqiang [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Dept Comp Sci & Engn, Nanjing 210016, Peoples R China
[2] Nanjing Normal Univ, Dept Comp Sci & Engn, Nanjing 210097, Peoples R China
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2010年 / 21卷 / 02期
基金
美国国家科学基金会;
关键词
Bayesian theory; classification learning; clustering learning; multiobjective optimization; pattern recognition; VECTOR QUANTIZATION; VALIDITY INDEX; ALGORITHM; RECOGNITION;
D O I
10.1109/TNN.2009.2034741
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional pattern recognition involves two tasks: clustering learning and classification learning. Clustering result can enhance the generalization ability of classification learning, while the class information can improve the accuracy of clustering learning. Hence, both learning methods can complement each other. To fuse the advantages of both learning methods together, many existing algorithms have been developed in a sequential fusing way by first optimizing the clustering criterion and then the classification criterion associated with the obtained clustering results. However, such kind of algorithms naturally fails to achieve the simultaneous optimality for two criteria, and thus has to sacrifice either the clustering performance or the classification performance. To overcome that problem, in this paper, we present a multiobjective simultaneous learning framework (MSCC) for both clustering and classification learning. MSCC utilizes multiple objective functions to formulate the clustering and classification problems, respectively, and more importantly, it employs the Bayesian theory to make these functions all only dependent on a set of the same parameters, i.e., clustering centers which play a role of the bridge connecting the clustering and classification learning. By simultaneously optimizing the clustering centers embedded in these functions, not only the effective clustering performance but also the promising classification performance can be simultaneously attained. Furthermore, from the multiple Pareto-optimality solutions obtained in MSCC, we can get an interesting observation that there is complementarity to great extent between clustering and classification learning processes. Empirical results on both synthetic and real data sets demonstrate the effectiveness and potential of MSCC.
引用
收藏
页码:185 / 200
页数:16
相关论文
共 40 条
[1]  
ABE S, 2005, INT C ART NETW WARS
[2]  
[Anonymous], 2004, Introduction to Statistical Learning Theory
[3]  
[Anonymous], 1998, PATTERN RECOGNITION
[4]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[5]  
BASU S, 2004, ACM SIGKDD INT C KNO
[6]  
Blake C. L., 1998, Uci repository of machine learning databases
[7]  
Cai W.L., 2007, INT C WAV AN PATT RE
[8]  
Cai W.L., 2009, PATTERN RECOGNIT
[9]   Robust fuzzy relational classifier incorporating the soft class labels [J].
Cai, Weiling ;
Chen, Songcan ;
Zhang, Daoqiang .
PATTERN RECOGNITION LETTERS, 2007, 28 (16) :2250-2263
[10]   Handling multiple objectives with particle swarm optimization [J].
Coello, CAC ;
Pulido, GT ;
Lechuga, MS .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2004, 8 (03) :256-279