Feature selection combined with random subspace ensemble for gene expression based diagnosis of malignancies

被引:8
作者
Bertoni, Alberto [1 ]
Folgieri, Raffaella [1 ]
Valentini, Giorgio [1 ]
机构
[1] Univ Milan, DSI, Dipartimento Sci Informaz, I-20135 Milan, Italy
来源
Biological and Artificial Intelligence Environments | 2005年
关键词
molecular diagnosis; ensemble methods; support vector machine; random subspace; DNA microarray;
D O I
10.1007/1-4020-3432-6_4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The bio-molecular diagnosis of malignancies represents a difficult learning task, because of the high dimensionality and low cardinality of the data. Many supervised learning techniques, among them support vector machines, have been experimented, using also feature selection methods to reduce the dimensionality of the data. In alternative to feature selection methods, we proposed to apply random subspace ensembles, reducing the dimensionality of the data by randomly sampling subsets of features and improving accuracy by aggregating the resulting base classifiers. In this paper we experiment the combination of random subspace with feature selection methods, showing preliminary experimental results that seem to confirm the effectiveness of the proposed approach.
引用
收藏
页码:29 / 35
页数:7
相关论文
共 13 条
  • [1] Towards a novel classification of human malignancies based on gene expression patterns
    Alizadeh, AA
    Ross, DT
    Perou, CM
    van de Rijn, M
    [J]. JOURNAL OF PATHOLOGY, 2001, 195 (01) : 41 - 52
  • [2] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
    Alon, U
    Barkai, N
    Notterman, DA
    Gish, K
    Ybarra, S
    Mack, D
    Levine, AJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
  • [3] Selection bias in gene extraction on the basis of microarray gene-expression data
    Ambroise, C
    McLachlan, GJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) : 6562 - 6566
  • [4] ARLANDINI C, 2004, B CILEA, V91
  • [5] BERTONI A, 2004, RANDOM SUBSPACE ENSE
  • [6] Approximate statistical tests for comparing supervised classification learning algorithms
    Dietterich, TG
    [J]. NEURAL COMPUTATION, 1998, 10 (07) : 1895 - 1923
  • [7] Comparison of discrimination methods for the classification of tumors using gene expression data
    Dudoit, S
    Fridlyand, J
    Speed, TP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
  • [8] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [9] Gene selection for cancer classification using support vector machines
    Guyon, I
    Weston, J
    Barnhill, S
    Vapnik, V
    [J]. MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
  • [10] Guyon I., 2003, J MACH LEARN RES, V3, P1157, DOI [DOI 10.1162/153244303322753616, 10.1016/j.aca.2011.07.027, DOI 10.1016/J.ACA.2011.07.027]