Multi-platform, multi-site, microarray-based human tumor classification

被引:142
作者
Bloom, G
Yang, IV
Boulware, D
Kwong, KY
Coppola, D
Eschrich, S
Quackenbush, J
Yeatman, TJ
机构
[1] Univ S Florida, H Lee Moffitt Canc Ctr, Dept Interdisciplinary Oncol, Tampa, FL 33612 USA
[2] Inst Genom Res, Rockville, MD USA
关键词
D O I
10.1016/S0002-9440(10)63090-8
中图分类号
R36 [病理学];
学科分类号
100104 ;
摘要
The introduction of gene expression profiling has resulted in the production of rich human data sets with potential for deciphering tumor diagnosis, prognosis, and therapy. Here we demonstrate how artificial neural networks (ANNs) can be applied to two completely different microarray platforms (cDNA and oligonucleotide), or a combination of both, to build tumor classifiers capable of deciphering the identity of most human cancers. First, 78 tumors representing eight different types of histologically similar adenocarcinoma, were evaluated with a 32k cDNA microarray and correctly classified by a cDNA-based ANN, using independent training and test sets, with a mean accuracy of 83%. To expand our approach, oligonucleotide data derived from six independent performance sites, representing 463 tumors and 21 tumor types, were assembled, normalized, and scaled. An oligonucleotide-based ANN, trained on a random fraction of the tumors (n = 343), was 88% accurate in predicting known pathological origin of the remaining fraction of tumors (n = 120) not exposed to the training algorithm. Finally, a mixed-platform classifier using a combination of both cDNA and of oligonucleotide microarray data from seven performance sites, normalized and scaled from a large and diverse tumor set (n = 539), produced similar results (85% accuracy) on independent test sets. Further validation of our classifiers was achieved by accurately (84%) predicting the known primary site of origin for an independent set of metastatic lesions (n = 50), resected from brain, lung, and liver, potentially addressing the vexing classification problems imposed by unknown primary cancers. These cDNA- and oligonucleotide-based classifiers provide a first proof of principle that data derived from multiple platforms and performance sites can be exploited to build multi-tissue tumor classifiers.
引用
收藏
页码:9 / 16
页数:8
相关论文
共 25 条
[1]   PATTERN-RECOGNITION BY AN ARTIFICIAL NETWORK DERIVED FROM BIOLOGIC NEURONAL SYSTEMS [J].
ALKON, DL ;
BLACKWELL, KT ;
BARBOUR, GS ;
RIGLER, AK ;
VOGL, TP .
BIOLOGICAL CYBERNETICS, 1990, 62 (05) :363-376
[2]  
[Anonymous], P 1988 CONN MOD SUMM
[3]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[4]   Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring [J].
Golub, TR ;
Slonim, DK ;
Tamayo, P ;
Huard, C ;
Gaasenbeek, M ;
Mesirov, JP ;
Coller, H ;
Loh, ML ;
Downing, JR ;
Caligiuri, MA ;
Bloomfield, CD ;
Lander, ES .
SCIENCE, 1999, 286 (5439) :531-537
[5]   Gene-expression profiles in hereditary breast cancer. [J].
Hedenfalk, I ;
Duggan, D ;
Chen, YD ;
Radmacher, M ;
Bittner, M ;
Simon, R ;
Meltzer, P ;
Gusterson, B ;
Esteller, M ;
Kallioniemi, OP ;
Wilfond, B ;
Borg, Å ;
Trent, J ;
Raffeld, M ;
Yakhini, Z ;
Ben-Dor, A ;
Dougherty, E ;
Kononen, J ;
Bubendorf, L ;
Fehrle, W ;
Pittaluga, S ;
Gruvberger, S ;
Loman, N ;
Johannsoson, O ;
Olsson, H ;
Sauter, G .
NEW ENGLAND JOURNAL OF MEDICINE, 2001, 344 (08) :539-548
[6]   A concise guide to cDNA microarray analysis [J].
Hegde, P ;
Qi, R ;
Abernathy, K ;
Gay, C ;
Dharap, S ;
Gaspard, R ;
Hughes, JE ;
Snesrud, E ;
Lee, N ;
Quackenbush, J .
BIOTECHNIQUES, 2000, 29 (03) :548-+
[7]   Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks [J].
Khan, J ;
Wei, JS ;
Ringnér, M ;
Saal, LH ;
Ladanyi, M ;
Westermann, F ;
Berthold, F ;
Schwab, M ;
Antonescu, CR ;
Peterson, C ;
Meltzer, PS .
NATURE MEDICINE, 2001, 7 (06) :673-679
[8]   Microalbuminuria identifies overall cardiovascular risk in essential hypertension: an artificial neural network-based approach [J].
Leoncini, G ;
Sacchi, G ;
Viazzi, F ;
Ravera, M ;
Parodi, D ;
Ratto, E ;
Vettoretti, S ;
Tomolillo, C ;
Deferrari, G ;
Pontremoli, R .
JOURNAL OF HYPERTENSION, 2002, 20 (07) :1315-1321
[9]  
MULSANT BH, 1990, M D COMPUT, V7, P25
[10]  
Nakhleh RE, 1998, ARCH PATHOL LAB MED, V122, P303