Feature selection combined with random subspace ensemble for gene expression based diagnosis of malignancies

被引：8

作者：

Bertoni, Alberto ^{[1
]}

Folgieri, Raffaella ^{[1
]}

Valentini, Giorgio ^{[1
]}

机构：

[1] Univ Milan, DSI, Dipartimento Sci Informaz, I-20135 Milan, Italy

来源：

Biological and Artificial Intelligence Environments | 2005年

关键词：

molecular diagnosis; ensemble methods; support vector machine; random subspace; DNA microarray;

D O I：

10.1007/1-4020-3432-6_4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The bio-molecular diagnosis of malignancies represents a difficult learning task, because of the high dimensionality and low cardinality of the data. Many supervised learning techniques, among them support vector machines, have been experimented, using also feature selection methods to reduce the dimensionality of the data. In alternative to feature selection methods, we proposed to apply random subspace ensembles, reducing the dimensionality of the data by randomly sampling subsets of features and improving accuracy by aggregating the resulting base classifiers. In this paper we experiment the combination of random subspace with feature selection methods, showing preliminary experimental results that seem to confirm the effectiveness of the proposed approach.

引用

页码：29 / 35

页数：7

共 13 条

[1] Towards a novel classification of human malignancies based on gene expression patterns
Alizadeh, AA
Ross, DT
Perou, CM
van de Rijn, M
[J]. JOURNAL OF PATHOLOGY, 2001, 195 (01) : 41 - 52
[2] Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays
Alon, U
Barkai, N
Notterman, DA
Gish, K
Ybarra, S
Mack, D
Levine, AJ
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) : 6745 - 6750
[3] Selection bias in gene extraction on the basis of microarray gene-expression data
Ambroise, C
McLachlan, GJ
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) : 6562 - 6566
[4] ARLANDINI C, 2004, B CILEA, V91
[5] BERTONI A, 2004, RANDOM SUBSPACE ENSE
[6] Approximate statistical tests for comparing supervised classification learning algorithms
Dietterich, TG
[J]. NEURAL COMPUTATION, 1998, 10 (07) : 1895 - 1923
[7] Comparison of discrimination methods for the classification of tumors using gene expression data
Dudoit, S
Fridlyand, J
Speed, TP
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
[8] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
Golub, TR
Slonim, DK
Tamayo, P
Huard, C
Gaasenbeek, M
Mesirov, JP
Coller, H
Loh, ML
Downing, JR
Caligiuri, MA
Bloomfield, CD
Lander, ES
[J]. SCIENCE, 1999, 286 (5439) : 531 - 537
[9] Gene selection for cancer classification using support vector machines
Guyon, I
Weston, J
Barnhill, S
Vapnik, V
[J]. MACHINE LEARNING, 2002, 46 (1-3) : 389 - 422
[10] Guyon I., 2003, J MACH LEARN RES, V3, P1157, DOI [DOI 10.1162/153244303322753616, 10.1016/j.aca.2011.07.027, DOI 10.1016/J.ACA.2011.07.027]

← 1 2 →