Using LogitBoost classifier to predict protein structural classes

被引:158
作者
Cai, YD
Feng, KY
Lu, WC
Chou, KC
机构
[1] Gordon Life Sci Inst, San Diego, CA 92130 USA
[2] Shanghai Univ, Dept Chem, Coll Sci, Shanghai 200436, Peoples R China
[3] Shanghai Ctr Bioinformat Technol, Shanghai 200235, Peoples R China
[4] Univ Manchester, Sch Med, Manchester M13 9PT, Lancs, England
关键词
protein structure classification; LogitBoost; support vector machines; amino acid composition;
D O I
10.1016/j.jtbi.2005.05.034
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prediction of protein classification is an important topic in molecular biology. This is because it is able to not only provide useful information from the viewpoint of structure itself, but also greatly stimulate the characterization of many other features of proteins that may be closely correlated with their biological functions. In this paper, the LogitBoost, one of the boosting algorithms developed recently, is introduced for predicting protein structural classes. It performs classification using a regression scheme as the base learner, which can handle multi-class problems and is particularly superior in coping with noisy data. It was demonstrated that the LogitBoost outperformed the support vector machines in predicting the structural classes for a given dataset, indicating that the new classifier is very promising. It is anticipated that the power in predicting protein structural classes as well as many other biomacromolecular attributes will be further strengthened if the LogitBoost and some other existing algorithms can be effectively complemented with each other. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:172 / 176
页数:5
相关论文
共 60 条
[1]  
Bahar I, 1997, PROTEINS, V29, P172, DOI 10.1002/(SICI)1097-0134(199710)29:2<172::AID-PROT5>3.3.CO
[2]  
2-D
[3]  
Breiman L, 1998, ANN STAT, V26, P801
[4]   Knowledge-based analysis of microarray gene expression data by using support vector machines [J].
Brown, MPS ;
Grundy, WN ;
Lin, D ;
Cristianini, N ;
Sugnet, CW ;
Furey, TS ;
Ares, M ;
Haussler, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) :262-267
[5]   Prediction of protein structural classes by neural network [J].
Cai, YD ;
Zhou, GP .
BIOCHIMIE, 2000, 82 (08) :783-785
[6]   Identify catalytic triads of serine hydrolases by support vector machines [J].
Cai, YD ;
Zhou, GP ;
Jen, CH ;
Lin, SL ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2004, 228 (04) :551-557
[7]   Application of SVM to predict membrane protein types [J].
Cai, YD ;
Ricardo, PW ;
Jen, CH ;
Chou, KC .
JOURNAL OF THEORETICAL BIOLOGY, 2004, 226 (04) :373-376
[8]   Support vector machines for predicting membrane protein types by using functional domain composition [J].
Cai, YD ;
Zhou, GP ;
Chou, KC .
BIOPHYSICAL JOURNAL, 2003, 84 (05) :3257-3263
[9]   Using neural networks for prediction of domain structural classes [J].
Cai, YD ;
Li, YX ;
Chou, KC .
BIOCHIMICA ET BIOPHYSICA ACTA-PROTEIN STRUCTURE AND MOLECULAR ENZYMOLOGY, 2000, 1476 (01) :1-2
[10]   Prediction of membrane protein types by incorporating amphipathic effects [J].
Chou, KC ;
Cai, YD .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2005, 45 (02) :407-413