Nonlinear support vector machine visualization for risk factor analysis using nomograms and localized radial basis function kernels

被引:44
作者
Cho, Baek Hwan [1 ]
Yu, Hwanjo [2 ]
Lee, Jongshill [1 ]
Chee, Young Joon [1 ]
Kim, In Young [1 ]
Kim, Sun I. [1 ]
机构
[1] Hanyang Univ, Dept Biomed Engn, Seoul 133605, South Korea
[2] Univ Iowa, Dept Comp Sci, Iowa City, IA 52242 USA
来源
IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE | 2008年 / 12卷 / 02期
关键词
decision support systems; feature selection; localized radial basis function (LRBF) kernel; nomograms; support vector machines (SVMs); visualization;
D O I
10.1109/TITB.2007.902300
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Nonlinear classifiers, e.g., support vector machines (SVMs) with radial basis function (RBF) kernels, have been used widely for automatic diagnosis of diseases because of their high accuracies. However, it is difficult to visualize the classifiers, and thus difficult to provide intuitive interpretation of results to physicians. We developed a new nonlinear kernel, the localized radial basis function (LRBF) kernel, and new visualization system visualization for risk factor analysis (VRIFA) that applies a nomogram and LRBF kernel to visualize the results of nonlinear SVMs and improve the interpretability of results while maintaining high prediction accuracy. Three representative medical datasets from the University of California, Irvine repository and Statlog dataset-breast cancer, diabetes, and heart disease datasets-were used to evaluate the system. The results showed that the classification performance of the LRBF is comparable with that of the RBF, and the LRBF is easy to visualize via a nomogram. Our study also showed that the LRBF kernel is less sensitive to noise features than the RBF kernel, whereas the LRBF kernel degrades the prediction accuracy more when important features are eliminated. We demonstrated the VRIFA system, which visualizes the results of linear and nonlinear SVMs with LRBF kernels, on the three datasets.
引用
收藏
页码:247 / 256
页数:10
相关论文
共 22 条
  • [1] Pattern recognition techniques for automatic detection of suspicious-looking anomalies in mammograms
    Arodz, T
    Kurdziel, M
    Sevre, EOD
    Yuen, DA
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2005, 79 (02) : 135 - 149
  • [2] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167
  • [3] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [4] Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
  • [5] Learning from imbalanced data in surveillance of nosocomial infection
    Cohen, Gilles
    Hilario, Melanie
    Sax, Hugo
    Hugonnet, Stephane
    Geissbuhler, Antoine
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 2006, 37 (01) : 7 - 18
  • [6] CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
  • [7] Embrechts M. J., 2003, INT J SMART ENG SYST, V5, P225, DOI DOI 10.1080/10255810390245555
  • [8] Support vector machine classification and validation of cancer tissue samples using microarray expression data
    Furey, TS
    Cristianini, N
    Duffy, N
    Bednarski, DW
    Schummer, M
    Haussler, D
    [J]. BIOINFORMATICS, 2000, 16 (10) : 906 - 914
  • [9] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    [J]. MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [10] Guyon I, 2003, J MACH LEARN RES, P1157, DOI [10.1016/j.aca.2011.07.027, DOI 10.1016/J.ACA.2011.07.027]