ICGA-PSO-ELM Approach for Accurate Multiclass Cancer Classification Resulting in Reduced Gene Sets in Which Genes Encoding Secreted Proteins Are Highly Represented

被引:90
作者
Saraswathi, Saras [1 ]
Sundaram, Suresh [2 ]
Sundararajan, Narasimhan [3 ]
Zimmermann, Michael [4 ]
Nilsen-Hamilton, Marit [5 ]
机构
[1] Iowa State Univ, Laurence H Baker Ctr Bioinformat & Biol Stat, Ames, IA 50011 USA
[2] Indian Inst Technol, Dept Elect Engn, New Delhi 110016, India
[3] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[4] Iowa State Univ, Bioinformat & Computat Biol Lab, Ames, IA 50011 USA
[5] Iowa State Univ, Dept Biochem Biophys & Mol Biol, Ames, IA 50011 USA
关键词
Biology and genetics; classifier design and evaluation; feature evaluation and selection; neural nets; EXTREME LEARNING-MACHINE; MICROARRAY DATA; SVM-RFE; SELECTION; PREDICTION; ALGORITHMS; STRATEGY; SYSTEM;
D O I
10.1109/TCBB.2010.13
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A combination of Integer-Coded Genetic Algorithm (ICGA) and Particle Swarm Optimization (PSO), coupled with the neural-network-based Extreme Learning Machine (ELM), is used for gene selection and cancer classification. ICGA is used with PSO-ELM to select an optimal set of genes, which is then used to build a classifier to develop an algorithm (ICGA_PSO_ELM) that can handle sparse data and sample imbalance. We evaluate the performance of ICGA-PSO-ELM and compare our results with existing methods in the literature. An investigation into the functions of the selected genes, using a systems biology approach, revealed that many of the identified genes are involved in cell signaling and proliferation. An analysis of these gene sets shows a larger representation of genes that encode secreted proteins than found in randomly selected gene sets. Secreted proteins constitute a major means by which cells interact with their surroundings. Mounting biological evidence has identified the tumor microenvironment as a critical factor that determines tumor survival and growth. Thus, the genes identified by this study that encode secreted proteins might provide important insights to the nature of the critical biological features in the microenvironment of each tumor type that allow these cells to thrive and proliferate.
引用
收藏
页码:452 / 463
页数:12
相关论文
共 46 条
[31]   A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis [J].
Statnikov, A ;
Aliferis, CF ;
Tsamardinos, I ;
Hardin, D ;
Levy, S .
BIOINFORMATICS, 2005, 21 (05) :631-643
[32]   Gene selection in microarray data: the elephant, the blind men and our algorithms [J].
Stolovitzky, G .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2003, 13 (03) :370-376
[33]   Risk-sensitive loss functions for sparse multi-category classification problems [J].
Suresh, S. ;
Sundararajan, N. ;
Saratchandran, P. .
INFORMATION SCIENCES, 2008, 178 (12) :2621-2638
[34]   A sequential multi-category classifier using radial basis function networks [J].
Suresh, S. ;
Sundararajan, N. ;
Saratchandran, P. .
NEUROCOMPUTING, 2008, 71 (7-9) :1345-1358
[35]   Performance enhancement of extreme learning machine for multi-category sparse data classification problems [J].
Suresh, S. ;
Saraswathi, S. ;
Sundararajan, N. .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2010, 23 (07) :1149-1157
[36]   17 Divisible load scheduling in distributed system with buffer constraints: genetic algorithm and linear programming approach [J].
Suresh, S. ;
Mani, V. ;
Omkar, S. N. ;
Kim, H. J. .
INTERNATIONAL JOURNAL OF PARALLEL EMERGENT AND DISTRIBUTED SYSTEMS, 2006, 21 (05) :303-321
[37]   No-reference image quality assessment using modified extreme learning machine classifier [J].
Suresh, S. ;
Babu, R. Venkatesh ;
Kim, H. J. .
APPLIED SOFT COMPUTING, 2009, 9 (02) :541-552
[38]   Development of two-stage SVM-RFE gene selection strategy for microarray expression data analysis [J].
Tang, Yuchun ;
Zhang, Yan-Qing ;
Huang, Zhen .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (03) :365-381
[39]   Reactome: a knowledge base of biologic pathways and processes [J].
Vastrik, Imre ;
D'Eustachio, Peter ;
Schmidt, Esther ;
Joshi-Tope, Geeta ;
Gopinath, Gopal ;
Croft, David ;
de Bono, Bernard ;
Gillespie, Marc ;
Jassal, Bijay ;
Lewis, Suzanna ;
Matthews, Lisa ;
Wu, Guanming ;
Birney, Ewan ;
Stein, Lincoln .
GENOME BIOLOGY, 2007, 8 (03)
[40]   Accurate cancer classification using expressions of very few genes [J].
Wang, Lipo ;
Chu, Feng ;
Xie, Wei .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2007, 4 (01) :40-53