A Kernel Fuzzy c-Means Clustering-Based Fuzzy Support Vector Machine Algorithm for Classification Problems With Outliers or Noises

被引:214
作者
Yang, Xiaowei [1 ,2 ]
Zhang, Guangquan [2 ]
Lu, Jie [2 ]
Ma, Jun [2 ]
机构
[1] S China Univ Technol, Sch Sci, Dept Math, Guangzhou 510641, Guangdong, Peoples R China
[2] Univ Technol Sydney, Fac Engn & Informat Technol, Ctr Quantum Computat & Intelligent Syst, Sydney, NSW 2007, Australia
基金
澳大利亚研究理事会;
关键词
Classification; fuzzy c-means (FCM); fuzzy support vector machine (FSVM); high-dimensional feature space; kernel clustering; outliers or noises; IMAGE SEGMENTATION; VALIDITY INDEX; SVM; CATEGORIZATION; IMPROVEMENTS; MODELS;
D O I
10.1109/TFUZZ.2010.2087382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The support vector machine (SVM) has provided higher performance than traditional learning machines and has been widely applied in real-world classification problems and nonlinear function estimation problems. Unfortunately, the training process of the SVM is sensitive to the outliers or noises in the training set. In this paper, a common misunderstanding of Gaussian-function-based kernel fuzzy clustering is corrected, and a kernel fuzzy c-means clustering-based fuzzy SVM algorithm (KFCM-FSVM) is developed to deal with the classification problems with outliers or noises. In the KFCM-FSVM algorithm, we first use the FCM clustering to cluster each of two classes from the training set in the high-dimensional feature space. The farthest pair of clusters, where one cluster comes from the positive class and the other from the negative class, is then searched and forms one new training set with membership degrees. Finally, we adopt FSVM to induce the final classification results on this new training set. The computational complexity of the KFCM-FSVM algorithm is analyzed. A set of experiments is conducted on six benchmarking datasets and four artificial datasets for testing the generalization performance of the KFCM-FSVM algorithm. The results indicate that the KFCM-FSVM algorithm is robust for classification problems with outliers or noises.
引用
收藏
页码:105 / 115
页数:11
相关论文
共 64 条
[1]  
Amo A, 2004, EUR J OPER RES, V156, P495, DOI [10.1016/S0377-2217(03)00002-X, 10.1016/s0377-2217(03)00002-x]
[2]  
[Anonymous], INT C INT AG WEB TEC
[3]  
[Anonymous], Pattern Recognition with Fuzzy Objective Function Algorithms
[4]  
Bezdek J. C., 1973, Journal of Cybernetics, V3, P58, DOI 10.1080/01969727308546047
[5]   Some new indexes of cluster validity [J].
Bezdek, JC ;
Pal, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03) :301-315
[6]   Support vector machine with adaptive parameters in financial time series forecasting [J].
Cao, LJ ;
Tay, FEH .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (06) :1506-1518
[7]   Support vector machines for histogram-based image classification [J].
Chapelle, O ;
Haffner, P ;
Vapnik, VN .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1055-1064
[8]   Robust image segmentation using FCM with spatial constraints based on new kernel-induced distance measure [J].
Chen, SC ;
Zhang, DQ .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (04) :1907-1916
[9]  
Chen YX, 2004, J MACH LEARN RES, V5, P913
[10]   In Silico Prediction of Human Protein Interactions Using Fuzzy-SVM Mixture Models and Its Application to Cancer Research [J].
Chiang, Jung-Hsien ;
Lee, Tsung-Lu Michael .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2008, 16 (04) :1087-1095