ICGA-PSO-ELM Approach for Accurate Multiclass Cancer Classification Resulting in Reduced Gene Sets in Which Genes Encoding Secreted Proteins Are Highly Represented

被引:90
作者
Saraswathi, Saras [1 ]
Sundaram, Suresh [2 ]
Sundararajan, Narasimhan [3 ]
Zimmermann, Michael [4 ]
Nilsen-Hamilton, Marit [5 ]
机构
[1] Iowa State Univ, Laurence H Baker Ctr Bioinformat & Biol Stat, Ames, IA 50011 USA
[2] Indian Inst Technol, Dept Elect Engn, New Delhi 110016, India
[3] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[4] Iowa State Univ, Bioinformat & Computat Biol Lab, Ames, IA 50011 USA
[5] Iowa State Univ, Dept Biochem Biophys & Mol Biol, Ames, IA 50011 USA
关键词
Biology and genetics; classifier design and evaluation; feature evaluation and selection; neural nets; EXTREME LEARNING-MACHINE; MICROARRAY DATA; SVM-RFE; SELECTION; PREDICTION; ALGORITHMS; STRATEGY; SYSTEM;
D O I
10.1109/TCBB.2010.13
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
A combination of Integer-Coded Genetic Algorithm (ICGA) and Particle Swarm Optimization (PSO), coupled with the neural-network-based Extreme Learning Machine (ELM), is used for gene selection and cancer classification. ICGA is used with PSO-ELM to select an optimal set of genes, which is then used to build a classifier to develop an algorithm (ICGA_PSO_ELM) that can handle sparse data and sample imbalance. We evaluate the performance of ICGA-PSO-ELM and compare our results with existing methods in the literature. An investigation into the functions of the selected genes, using a systems biology approach, revealed that many of the identified genes are involved in cell signaling and proliferation. An analysis of these gene sets shows a larger representation of genes that encode secreted proteins than found in randomly selected gene sets. Secreted proteins constitute a major means by which cells interact with their surroundings. Mounting biological evidence has identified the tumor microenvironment as a critical factor that determines tumor survival and growth. Thus, the genes identified by this study that encode secreted proteins might provide important insights to the nature of the critical biological features in the microenvironment of each tumor type that allow these cells to thrive and proliferate.
引用
收藏
页码:452 / 463
页数:12
相关论文
共 46 条
[21]  
MICHALEWICZ Z, 1994, GENETIC ALGORITHM DA, P18
[22]   Stroma-epithelium crosstalk in prostate cancer [J].
Niu, Yi-Nong ;
Xia, Shu-Jie .
ASIAN JOURNAL OF ANDROLOGY, 2009, 11 (01) :28-35
[23]   Genetic algorithms applied to multi-class prediction for the analysis of gene expression data [J].
Ooi, CH ;
Tan, P .
BIOINFORMATICS, 2003, 19 (01) :37-44
[24]   Molecular classification of cancer types from microarray data using the combination of genetic algorithms and support vector machines [J].
Peng, SH ;
Xu, QH ;
Ling, XB ;
Peng, XN ;
Du, W ;
Chen, LB .
FEBS LETTERS, 2003, 555 (02) :358-362
[25]  
Piatetsky-Shapiro G., 2003, ACM SIGKDD Explor. Newsl., V5, P1, DOI [DOI 10.1145/980972.980974, 10.1145/980972.980974]
[26]   Prediction of central nervous system embryonal tumour outcome based on gene expression [J].
Pomeroy, SL ;
Tamayo, P ;
Gaasenbeek, M ;
Sturla, LM ;
Angelo, M ;
McLaughlin, ME ;
Kim, JYH ;
Goumnerova, LC ;
Black, PM ;
Lau, C ;
Allen, JC ;
Zagzag, D ;
Olson, JM ;
Curran, T ;
Wetmore, C ;
Biegel, JA ;
Poggio, T ;
Mukherjee, S ;
Rifkin, R ;
Califano, A ;
Stolovitzky, G ;
Louis, DN ;
Mesirov, JP ;
Lander, ES ;
Golub, TR .
NATURE, 2002, 415 (6870) :436-442
[27]   Multiclass cancer diagnosis using tumor gene expression signatures [J].
Ramaswamy, S ;
Tamayo, P ;
Rifkin, R ;
Mukherjee, S ;
Yeang, CH ;
Angelo, M ;
Ladd, C ;
Reich, M ;
Latulippe, E ;
Mesirov, JP ;
Poggio, T ;
Gerald, W ;
Loda, M ;
Lander, ES ;
Golub, TR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (26) :15149-15154
[28]   Potential Novel Targets in Breast Cancer [J].
Rameshwar, Pranela .
CURRENT PHARMACEUTICAL BIOTECHNOLOGY, 2009, 10 (02) :148-153
[29]   Translation initiation site prediction on a genomic scale: beauty in simplicity [J].
Saeys, Yvan ;
Abeel, Thomas ;
Degroeve, Sven ;
Van de Peer, Yves .
BIOINFORMATICS, 2007, 23 (13) :I418-I423
[30]  
Schaffer J. D., 1992, COGANN-92. International Workshop on Combinations of Genetic Algorithms and Neural Networks (Cat. No.92TH0435-8), P1, DOI 10.1109/COGANN.1992.273950