A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis

被引:255
作者
Chen, Hui-Ling [1 ,2 ]
Yang, Bo [1 ,2 ]
Liu, Jie [1 ,2 ]
Liu, Da-You [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
Breast cancer diagnosis; Rough set theory; Support vector machines; Feature selection; SYSTEM; RULES;
D O I
10.1016/j.eswa.2011.01.120
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Breast cancer is becoming a leading cause of death among women in the whole world, meanwhile, it is confirmed that the early detection and accurate diagnosis of this disease can ensure a long survival of the patients. Expert systems and machine learning techniques are gaining popularity in this field because of the effective classification and high diagnostic capability. In this paper, a rough set (RS) based supporting vector machine classifier (RS_SVM) is proposed for breast cancer diagnosis. In the proposed method (RS_SVM), RS reduction algorithm is employed as a feature selection tool to remove the redundant features and further improve the diagnostic accuracy by SVM. The effectiveness of the RS_SVM is examined on Wisconsin Breast Cancer Dataset (WBCD) using classification accuracy, sensitivity, specificity, confusion matrix and receiver operating characteristic (ROC) curves. Experimental results demonstrate the proposed RS_SVM can not only achieve very high classification accuracy but also detect a combination of five informative features, which can give an important clue to the physicians for breast diagnosis. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:9014 / 9022
页数:9
相关论文
共 28 条
[1]   Supervised fuzzy clustering for the identification of fuzzy classifiers [J].
Abonyi, J ;
Szeifert, F .
PATTERN RECOGNITION LETTERS, 2003, 24 (14) :2195-2207
[2]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[3]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[4]  
Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
[5]   SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[6]  
Cristianini Nello, 2000, An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods, DOI DOI 10.1017/CB09780511801389
[7]   Feature selection for support vector machines by means of genetic algorithms [J].
Fröhlich, H ;
Chapelle, O ;
Schölkopf, B .
15TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2003, :142-148
[8]  
GOODMAN D, 2002, INTELLIGENT ENG SYST, V12, P179
[9]   Rough set approach for attribute reduction and rule generation: A case of patients with suspected breast cancer [J].
Hassanien, AE .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2004, 55 (11) :954-962
[10]  
Hsu C., 2003, A practical guide to support vector classification