Minimum Number of Genes for Microarray Feature Selection

被引:3
作者
Baralis, Elena [1 ]
Bruno, Giulia [1 ]
Fiori, Alessandro [1 ]
机构
[1] Politecn Torino, Turin, Italy
来源
2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8 | 2008年
关键词
D O I
10.1109/IEMBS.2008.4650506
中图分类号
R318 [生物医学工程];
学科分类号
0831 [生物医学工程];
摘要
A fundamental problem in microarray analysis is to identify relevant genes from large amounts of expression data. Feature selection aims at identifying a subset of features for building robust learning models. However, finding the optimal number of features is a challenging problem, as it is a trade off between information loss when pruning excessively and noise increase when pruning is too weak. This paper presents a novel representation of genes as strings of bits and a method which automatically selects the minimum number of genes to reach a good classification accuracy on the training set. Our method first eliminates redundant features, which do not add further information for classification, then it exploits a set covering algorithm. Preliminary experimental results on public datasets confirm the intuition of the proposed method leading to high classification accuracy.
引用
收藏
页码:5692 / 5695
页数:4
相关论文
共 16 条
[1]
APILETTI D, 2007, P IEEE C ENG MED BIO
[2]
LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[3]
Monte Carlo feature selection for supervised classification [J].
Draminski, Michal ;
Rada-Iglesias, Alvaro ;
Enroth, Stefan ;
Wadelius, Claes ;
Koronacki, Jacek ;
Komorowski, Jan .
BIOINFORMATICS, 2008, 24 (01) :110-117
[4]
Semisupervised learning for molecular profiling [J].
Furlanello, C ;
Serafini, M ;
Merler, S ;
Jurman, G .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (02) :110-118
[5]
HALL MA, 2003, IEEE T KNOWLEDGE DAT, V15
[6]
Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data [J].
Jeffery, Ian B. ;
Higgins, Desmond G. ;
Culhane, Aedin C. .
BMC BIOINFORMATICS, 2006, 7 (1)
[7]
Lê Cao KA, 2007, STAT APPL GENET MOL, V6
[8]
Li W., 2002, Methods of Microarray Data Analysis, P137
[9]
Prediction of cancer outcome with microarrays: a multiple random validation strategy [J].
Michiels, S ;
Koscielny, S ;
Hill, C .
LANCET, 2005, 365 (9458) :488-492
[10]
Ni B, 2004, PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, P2537