Using Chou's pseudo amino acid composition to predict subcellular localization of apoptosis proteins: An approach with immune genetic algorithm-based ensemble classifier

被引:212
作者
Ding, Yong-Sheng [1 ,2 ]
Zhang, Tong-Liang [1 ]
机构
[1] Donghua Univ, Coll Informat Sci & Technol, Shanghai 201620, Peoples R China
[2] Minist Educ China, Engn Res Ctr Digitized Text & Fash Technol, Shanghai 201620, Peoples R China
关键词
apoptosis protein subcellular location; pseudo amino acid composition; approximate entropy; ensemble classifier; fuzzy K-nearest neighbor classifier;
D O I
10.1016/j.patrec.2008.06.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
It is crucial to develop powerful tools to predict apoptosis protein locations for rapidly increasing gap between the number of known structural proteins and the number of known sequences in protein data-bank. In this study, based on the concept of pseudo amino acid (PseAA) composition originally introduced by Chou, a novel approximate entropy (ApEn) based PseAA composition is proposed to represent apoptosis protein sequences. An ensemble classifier is introduced, of which the basic classifier is the FKNN (fuzzy K-nearest neighbor) one, as prediction engine. Each basic classifier is trained in different dimensions of PseAA composition of protein sequences. The immune genetic algorithm (IGA) is used to search the optimal weight factors in generating the PseAA composition for crucial of weight factors in PseAA composition. The results obtained by jackknife test are quite encouraging, indicating that the proposed method might become a potentially useful tool for protein function, or at least can play a complimentary role to the existing methods in the relevant areas. (C) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:1887 / 1892
页数:6
相关论文
共 84 条
[1]
The Bcl-2 protein family: Arbiters of cell survival [J].
Adams, JM ;
Cory, S .
SCIENCE, 1998, 281 (5381) :1322-1326
[2]
ARGOS P, 1982, EUR J BIOCHEM, V128, P565
[3]
Predicting protein subcellular locations using hierarchical ensemble of Bayesian classifiers based on Markov chains [J].
Bulashevska, Alla ;
Eils, Roland .
BMC BIOINFORMATICS, 2006, 7 (1)
[4]
Nearest neighbour algorithm for predicting protein subcellular location by combining functional domain composition and pseudo-amino acid composition [J].
Cai, YD ;
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 305 (02) :407-411
[5]
Support vector machines for predicting membrane protein types by using functional domain composition [J].
Cai, YD ;
Zhou, GP ;
Chou, KC .
BIOPHYSICAL JOURNAL, 2003, 84 (05) :3257-3263
[6]
Using pseudo-amino acid composition and support vector machine to predict protein structural class [J].
Chen, Chao ;
Tian, Yuan-Xin ;
Zou, Xiao-Yong ;
Cai, Pei-Xiang ;
Mo, Jin-Yuan .
JOURNAL OF THEORETICAL BIOLOGY, 2006, 243 (03) :444-448
[7]
Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network [J].
Chen, Chao ;
Zhou, Xibin ;
Tian, Yuanxin ;
Zou, Xiaoyong ;
Cai, Peixiang .
ANALYTICAL BIOCHEMISTRY, 2006, 357 (01) :116-121
[8]
Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition [J].
Chen, Ying-Li ;
Li, Qian-Zhong .
JOURNAL OF THEORETICAL BIOLOGY, 2007, 248 (02) :377-381
[9]
Prediction of the subcellular location of apoptosis proteins [J].
Chen, Ying-Li ;
Li, Qian-Zhong .
JOURNAL OF THEORETICAL BIOLOGY, 2007, 245 (04) :775-783
[10]
Solution structure of the RAIDD CARD and model for CARD/CARD interaction in caspase-2 and caspase-9 recruitment [J].
Chou, JJ ;
Matsuo, H ;
Duan, H ;
Wagner, G .
CELL, 1998, 94 (02) :171-180