Multiclass cancer diagnosis using tumor gene expression signatures

被引:1418
作者
Ramaswamy, S
Tamayo, P
Rifkin, R
Mukherjee, S
Yeang, CH
Angelo, M
Ladd, C
Reich, M
Latulippe, E
Mesirov, JP
Poggio, T
Gerald, W
Loda, M
Lander, ES
Golub, TR
机构
[1] Harvard Univ, Sch Med, Dana Farber Canc Inst, Dept Adult Oncol, Boston, MA 02115 USA
[2] Harvard Univ, Sch Med, Dana Farber Canc Inst, Dept Pediat Oncol, Boston, MA 02115 USA
[3] MIT, Whitehead Inst, Ctr Genome Res, Cambridge, MA 02138 USA
[4] Brigham & Womens Hosp, Dept Pathol, Boston, MA 02115 USA
[5] Mem Sloan Kettering Canc Ctr, Dept Pathol, New York, NY 10021 USA
[6] MIT, Dept Biol, Cambridge, MA 02139 USA
[7] MIT, McGovern Inst, Ctr Brain & Computat Learning, Cambridge, MA 02139 USA
[8] MIT, Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
D O I
10.1073/pnas.211566398
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The optimal treatment of patients with cancer depends on establishing accurate diagnoses by using a complex combination of clinical and histopathological data. In some instances, this task is difficult or impossible because of atypical clinical presentation or histopathology. To determine whether the diagnosis of multiple common adult malignancies could be achieved purely by molecular classification, we subjected 218 tumor samples, spanning 14 common tumor types, and 90 normal tissue samples to oligonucleotide microarray gene expression analysis. The expression levels of 16,063 genes and expressed sequence tags were used to evaluate the accuracy of a multiclass classifier based on a support vector machine algorithm. Overall classification accuracy was 78%, far exceeding the accuracy of random classification (91%). Poorly differentiated cancers resulted in low-confidence predictions and could not be accurately classified according to their tissue of origin, indicating that they are molecularly distinct entities with dramatically different gene expression patterns compared with their well differentiated counterparts. Taken together, these results demonstrate the feasibility of accurate, multiclass molecular cancer classification and suggest a strategy for future clinical implementation of molecular cancer diagnostics.
引用
收藏
页码:15149 / 15154
页数:6
相关论文
共 31 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] [Anonymous], GENOME BIOL
  • [3] Linking colorectal cancer to Wnt signaling
    Bienz, M
    Clevers, H
    [J]. CELL, 2000, 103 (02) : 311 - 320
  • [4] Molecular classification of cutaneous malignant melanoma by gene expression profiling
    Bittner, M
    Meitzer, P
    Chen, Y
    Jiang, Y
    Seftor, E
    Hendrix, M
    Radmacher, M
    Simon, R
    Yakhini, Z
    Ben-Dor, A
    Sampas, N
    Dougherty, E
    Wang, E
    Marincola, F
    Gooden, C
    Lueders, J
    Glatfelter, A
    Pollock, P
    Carpten, J
    Gillanders, E
    Leja, D
    Dietrich, K
    Beaudry, C
    Berens, M
    Alberts, D
    Sondak, V
    Hayward, N
    Trent, J
    [J]. NATURE, 2000, 406 (6795) : 536 - 540
  • [5] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [6] CHAPELLE O, 2002, IN PRESS MACH LEARN
  • [7] CONNOLLY JL, 1997, CANC MED, P533
  • [8] DASARATHY VB, 1991, NN PATTERN CLASSIFIC
  • [9] Delineation of prognostic biomarkers in prostate cancer
    Dhanasekaran, SM
    Barrette, TR
    Ghosh, D
    Shah, R
    Varambally, S
    Kurachi, K
    Pienta, KJ
    Rubin, MA
    Chinnaiyan, AM
    [J]. NATURE, 2001, 412 (6849) : 822 - 826
  • [10] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868