Network-based Prediction of Cancer under Genetic Storm

被引:3
作者
Ay, Ahmet [1 ,2 ]
Gong, Dihong [3 ]
Kahveci, Tamer [3 ]
机构
[1] Colgate Univ, Dept Math, Hamilton, NY 13346 USA
[2] Colgate Univ, Dept Biol, Hamilton, NY 13346 USA
[3] Univ Florida, Dept Comp & Informat Sci & Engn, Gainesville, FL 32611 USA
关键词
network-based cancer prediction; cancer classification; feature selection; comparison of classification techniques; comparison of feature selection techniques;
D O I
10.4137/CIN.S14025
中图分类号
R73 [肿瘤学];
学科分类号
100214 [肿瘤学];
摘要
Classification of cancer patients using traditional methods is a challenging task in the medical practice. Owing to rapid advances in microarray technologies, currently expression levels of thousands of genes from individual cancer patients can be measured. The classification of cancer patients by supervised statistical learning algorithms using the gene expression datasets provides an alternative to the traditional methods. Here we present a new network-based supervised classification technique, namely the NBC method. We compare NBC to five traditional classification techniques (support vector machines (SVM), k-nearest neighbor (kNN), naive Bayes (NB), C4.5, and random forest (RF)) using 50-300 genes selected by five feature selection methods. Our results on five large cancer datasets demonstrate that NBC method outperforms traditional classification techniques. Our analysis suggests that using symmetrical uncertainty (SU) feature selection method with NBC method provides the most accurate classification strategy. Finally, in-depth analysis of the correlation-based co-expression networks chosen by our network-based classifier in different cancer classes shows that there are drastic changes in the network models of different cancer types.
引用
收藏
页码:15 / 31
页数:17
相关论文
共 48 条
[1]
INSTANCE-BASED LEARNING ALGORITHMS [J].
AHA, DW ;
KIBLER, D ;
ALBERT, MK .
MACHINE LEARNING, 1991, 6 (01) :37-66
[2]
Graph-based methods for analysing networks in cell biology [J].
Aittokallio, Tero ;
Schwikowski, Benno .
BRIEFINGS IN BIOINFORMATICS, 2006, 7 (03) :243-255
[3]
Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[4]
Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[5]
Chowdhury SA, 2010, BIOCOMPUT-PAC SYM, P133
[6]
Network-based classification of breast cancer metastasis [J].
Chuang, Han-Yu ;
Lee, Eunjung ;
Liu, Yu-Tsueng ;
Lee, Doheon ;
Ideker, Trey .
MOLECULAR SYSTEMS BIOLOGY, 2007, 3 (1)
[7]
SUPPORT-VECTOR NETWORKS [J].
CORTES, C ;
VAPNIK, V .
MACHINE LEARNING, 1995, 20 (03) :273-297
[8]
Cristianini N., 2000, INTRO SUPPORT VECTOR, P189
[9]
Ding C., 2003, J BIOINF COMPUT BIOL, V3, P185, DOI DOI 10.1109/CSB.2003.1227396]
[10]
Research on collaborative negotiation for e-commerce. [J].
Feng, YQ ;
Lei, Y ;
Li, Y ;
Cao, RZ .
2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, :2085-2088