Bio-molecular cancer prediction with random subspace ensembles of support vector machines

被引:52
作者
Bertoni, A [1 ]
Folgieri, R [1 ]
Valentini, G [1 ]
机构
[1] Univ Milan, Dipartimento Sci Informaz, Milan, Italy
关键词
molecular classification of tumors; DNA microarray; ensemble of learning machines; random subspace; support vector machines;
D O I
10.1016/j.neucom.2004.07.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support vector machines (SVMs), and other supervised learning techniques have been experimented for the bio-molecular diagnosis of malignancies, using also feature selection methods. The classification task is particularly difficult because of the high dimensionality and low cardinality of gene expression data. In this paper we investigate a different approach based on random subspace ensembles of SVMs: a set of base learners is trained and aggregated using subsets of features randomly drawn from the available DNA microarray data. Experimental results on the colon adenocarcinoma diagnosis and medulloblastoma clinical outcome prediction show the effectiveness of the proposed approach. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:535 / 539
页数:5
相关论文
共 12 条
  • [1] Towards a novel classification of human malignancies based on gene expression patterns
    Alizadeh, AA
    Ross, DT
    Perou, CM
    van de Rijn, M
    [J]. JOURNAL OF PATHOLOGY, 2001, 195 (01) : 41 - 52
  • [2] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [3] On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems
    Amaldi, E
    Kann, V
    [J]. THEORETICAL COMPUTER SCIENCE, 1998, 209 (1-2) : 237 - 260
  • [4] [Anonymous], 1961, Adaptive Control Processes: a Guided Tour, DOI DOI 10.1515/9781400874668
  • [5] Comparison of discrimination methods for the classification of tumors using gene expression data
    Dudoit, S
    Fridlyand, J
    Speed, TP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
  • [6] Support vector machine classification and validation of cancer tissue samples using microarray expression data
    Furey, TS
    Cristianini, N
    Duffy, N
    Bednarski, DW
    Schummer, M
    Haussler, D
    [J]. BIOINFORMATICS, 2000, 16 (10) : 906 - 914
  • [7] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    [J]. MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [8] Ho TK, 1998, IEEE T PATTERN ANAL, V20, P832, DOI 10.1109/34.709601
  • [9] Prediction of central nervous system embryonal tumour outcome based on gene expression
    Pomeroy, SL
    Tamayo, P
    Gaasenbeek, M
    Sturla, LM
    Angelo, M
    McLaughlin, ME
    Kim, JYH
    Goumnerova, LC
    Black, PM
    Lau, C
    Allen, JC
    Zagzag, D
    Olson, JM
    Curran, T
    Wetmore, C
    Biegel, JA
    Poggio, T
    Mukherjee, S
    Rifkin, R
    Califano, A
    Stolovitzky, G
    Louis, DN
    Mesirov, JP
    Lander, ES
    Golub, TR
    [J]. NATURE, 2002, 415 (6870) : 436 - 442
  • [10] Bagging, boosting and the random subspace method for linear classifiers
    Skurichina, M
    Duin, RPW
    [J]. PATTERN ANALYSIS AND APPLICATIONS, 2002, 5 (02) : 121 - 135