Identification of informative genes and pathways using an improved penalized support vector machine with a weighting scheme

被引:12
作者
Chan, Weng Howe [1 ]
Mohamad, Mohd Saberi [1 ]
Deris, Safaai [2 ]
Zaki, Nazar [3 ]
Kasim, Shahreen [4 ]
Omatu, Sigeru [5 ]
Manuel Corchado, Juan [6 ]
Al Ashwal, Hany [3 ]
机构
[1] Univ Teknol Malaysia, Artificial Intelligence & Bioinformat Res Grp, Fac Comp, Skudai 81310, Johor, Malaysia
[2] Univ Malaysia Kelantan, Fac Creat Technol & Heritage, Locked Bag 01, Kota Baharu, Kelantan, Malaysia
[3] United Arab Emirate Univ, Coll Informat Technol, Al Ain 15551, U Arab Emirates
[4] Univ Tun Hussein Onn Malaysia, Fac Comp Sci & Informat Technol, Batu Pahat 86400, Malaysia
[5] Osaka Inst Technol, Dept Elect Informat & Commun Engn, Osaka 5358585, Japan
[6] Univ Salamanca, Biomed Res Inst Salamanca, BISITE Res Grp, Salamanca, Spain
关键词
Artificial intelligence; Bioinformatics; Informative genes; Pathway-based microarray analysis; Penalized support vector machine; Weighting scheme; Penalty function; CELL LUNG-CANCER; INCORPORATING PRIOR KNOWLEDGE; PLATINUM-BASED CHEMOTHERAPY; TUMOR-SUPPRESSOR P53; SET ENRICHMENT; FEATURE-SELECTION; MICROARRAY DATA; DOWN-REGULATION; MESSENGER-RNA; CYCLE ARREST;
D O I
10.1016/j.compbiomed.2016.08.004
中图分类号
Q [生物科学];
学科分类号
090105 [作物生产系统与生态工程];
摘要
Incorporation of pathway knowledge into microarray analysis has brought better biological interpretation of the analysis outcome. However, most pathway data are manually curated without specific biological context. Non-informative genes could be included when the pathway data is used for analysis of context specific data like cancer microarray data. Therefore, efficient identification of informative genes is inevitable. Embedded methods like penalized classifiers have been used for microarray analysis due to their embedded gene selection. This paper proposes an improved penalized support vector machine with absolute t-test weighting scheme to identify informative genes and pathways. Experiments are done on four microarray data sets. The results are compared with previous methods using 10-fold cross validation in terms of accuracy, sensitivity, specificity and F-score. Our method shows consistent improvement over the previous methods and biological validation has been done to elucidate the relation of the selected genes and pathway with the phenotype under study. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:102 / 115
页数:14
相关论文
共 137 条
[1]
The public road to high-quality curated biological pathways [J].
Adriaens, Michiel E. ;
Jaillard, Magali ;
Waagmeester, Andra ;
Coort, Susan L. M. ;
Pico, Alex R. ;
Evelo, Chris T. A. .
DRUG DISCOVERY TODAY, 2008, 13 (19-20) :856-862
[2]
Expression of glutathione S-transferase π and glutathione synthase correlates with survival in early stage non-small cell carcinomas of the lung [J].
Allen, Timothy C. ;
Granville, Laura A. ;
Cagle, Philip T. ;
Haque, Abida ;
Zander, Dani S. ;
Barrios, Roberto .
HUMAN PATHOLOGY, 2007, 38 (02) :220-227
[3]
RPS4Y gene family evolution in primates [J].
Andres, Olga ;
Kellermann, Thomas ;
Lopez-Giraldez, Francesc ;
Rozas, Julio ;
Domingo-Roura, Xavier ;
Bosch, Montserrat .
BMC EVOLUTIONARY BIOLOGY, 2008, 8 (1)
[4]
Tumor suppressor p53 is required to modulate BRCA1 expression [J].
Arizti, P ;
Fang, L ;
Park, I ;
Yin, YX ;
Solomon, E ;
Ouchi, T ;
Aaronson, SA ;
Lee, SW .
MOLECULAR AND CELLULAR BIOLOGY, 2000, 20 (20) :7450-7459
[5]
GeneTrail -: advanced gene set enrichment analysis [J].
Backes, Christina ;
Keller, Andreas ;
Kuentzer, Jan ;
Kneissl, Benny ;
Comtesse, Nicole ;
Elnakady, Yasser A. ;
Mueller, Rolf ;
Meese, Eckart ;
Lenhof, Hans-Peter .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W186-W192
[6]
Gene-expression profiles predict survival of patients with lung adenocarcinoma [J].
Beer, DG ;
Kardia, SLR ;
Huang, CC ;
Giordano, TJ ;
Levin, AM ;
Misek, DE ;
Lin, L ;
Chen, GA ;
Gharib, TG ;
Thomas, DG ;
Lizyness, ML ;
Kuick, R ;
Hayasaka, S ;
Taylor, JMG ;
Iannettoni, MD ;
Orringer, MB ;
Hanash, S .
NATURE MEDICINE, 2002, 8 (08) :816-824
[7]
Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[8]
A tumor suppressor function of Smurf2 associated with controlling chromatin landscape and genome stability through RNF20 [J].
Blank, Michael ;
Tang, Yi ;
Yamashita, Motozo ;
Burkett, Sandra S. ;
Cheng, Steven Y. ;
Zhang, Ying E. .
NATURE MEDICINE, 2012, 18 (02) :227-234
[9]
Mutations in DDX3X Are a Common Cause of Unexplained Intellectual Disability with Gender-Specific Effects on Wnt Signaling [J].
Blok, Lot Snijders ;
Madsen, Erik ;
Juusola, Jane ;
Gilissen, Christian ;
Baralle, Diana ;
Reijnders, Margot R. F. ;
Venselaar, Hanka ;
Helsmoorte, Celine ;
Cho, Megan T. ;
Hoischen, Alexander ;
Vissers, Lisenka E. L. M. ;
Koemans, Tom S. ;
Wissink-Lindhout, Willemijn ;
Eichler, Evan E. ;
Romano, Corrado ;
Van Esch, Hilde ;
Stumpel, Connie ;
Vreeburg, Maaike ;
Smeets, Eric ;
Obemdorff, Karin ;
van Bon, Bregje W. M. ;
Shaw, Marie ;
Gecz, Jozef ;
Haan, Eric ;
Bienek, Melanie ;
Jensen, Corinna ;
Loeys, Bart L. ;
Van Diick, Anke ;
Innes, A. Micheil ;
Racher, Hilary ;
Vermeer, Sascha ;
Di Donato, Nataliya ;
Rump, Andreas ;
Tatton-Brown, Katrina ;
Parker, Michael J. ;
Henderson, Alex ;
Lynch, Sally A. ;
Fryer, Alan ;
Ross, Alison ;
Vasudevan, Pradeep ;
Kini, Usha ;
Newbury-Ecob, Ruth ;
Chandler, Kate ;
Male, Alison ;
Dijkstra, Sybe ;
Schieving, Jolanda ;
Giltay, Jacques ;
Van Gassen, Koen L. I. ;
Schuurs-Hoeijmakers, Janneke ;
Tan, Perciliz L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2015, 97 (02) :343-352
[10]
Rac1 targeting suppresses p53 deficiency-mediated lymphomagenesis [J].
Bosco, Emily E. ;
Ni, Wenjun ;
Wang, Lei ;
Guo, Fukun ;
Johnson, James F. ;
Zheng, Yi .
BLOOD, 2010, 115 (16) :3320-3328