Classification of multiple cancer types by tip multicategory support vector machines using gene expression data

被引:191
作者
Lee, Y [1 ]
Lee, CK
机构
[1] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
[2] Univ Wisconsin, Mol & Environm Toxicol Ctr, Madison, WI 53706 USA
关键词
D O I
10.1093/bioinformatics/btg102
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: High-density DNA microarray measures the activities of several thousand genes simultaneously and the gene expression profiles have been used for the cancer classification recently. This new approach promises to give better therapeutic measurements to cancer patients by diagnosing cancer types with improved accuracy. The Support Vector Machine (SVM) is one of the classification methods successfully applied to the cancer diagnosis problems. However, its optimal extension to more than two classes was not obvious, which might impose limitations in its application to multiple tumor types. We briefly introduce the Multicategory SVM, which is a recently proposed extension of the binary SVM, and apply it to multiclass cancer diagnosis problems. Results: Its applicability is demonstrated on the leukemia data (Golub et al., 1999) and the small round blue cell tumors of childhood data (Khan et al., 2001). Comparable classification accuracy shown in the applications and its flexibility render the MSVM a viable alternative to other classification methods.
引用
收藏
页码:1132 / 1139
页数:8
相关论文
共 19 条
[1]  
[Anonymous], 1987, P CBMS NSF REG C SER
[2]   Multicategory classification by support vector machines [J].
Bredensteiner, EJ ;
Bennett, KP .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 1999, 12 (1-3) :53-79
[3]  
DeRisi J, 1996, NAT GENET, V14, P457
[4]   Comparison of discrimination methods for the classification of tumors using gene expression data [J].
Dudoit, S ;
Fridlyand, J ;
Speed, TP .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) :77-87
[5]   Interfaces to PATH 3.0: Design, implementation and usage [J].
Ferris, MC ;
Munson, TS .
COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 1999, 12 (1-3) :207-227
[6]   Support vector machine classification and validation of cancer tissue samples using microarray expression data [J].
Furey, TS ;
Cristianini, N ;
Duffy, N ;
Bednarski, DW ;
Schummer, M ;
Haussler, D .
BIOINFORMATICS, 2000, 16 (10) :906-914
[7]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[8]   A comparison of methods for multiclass support vector machines [J].
Hsu, CW ;
Lin, CJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (02) :415-425
[9]   Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks [J].
Khan, J ;
Wei, JS ;
Ringnér, M ;
Saal, LH ;
Ladanyi, M ;
Westermann, F ;
Berthold, F ;
Schwab, M ;
Antonescu, CR ;
Peterson, C ;
Meltzer, PS .
NATURE MEDICINE, 2001, 7 (06) :673-679
[10]  
LEE Y, 2002, 1064 U WISC DEP STAT