Using LogitBoost classifier to predict protein structural classes

被引:158
作者
Cai, YD
Feng, KY
Lu, WC
Chou, KC
机构
[1] Gordon Life Sci Inst, San Diego, CA 92130 USA
[2] Shanghai Univ, Dept Chem, Coll Sci, Shanghai 200436, Peoples R China
[3] Shanghai Ctr Bioinformat Technol, Shanghai 200235, Peoples R China
[4] Univ Manchester, Sch Med, Manchester M13 9PT, Lancs, England
关键词
protein structure classification; LogitBoost; support vector machines; amino acid composition;
D O I
10.1016/j.jtbi.2005.05.034
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Prediction of protein classification is an important topic in molecular biology. This is because it is able to not only provide useful information from the viewpoint of structure itself, but also greatly stimulate the characterization of many other features of proteins that may be closely correlated with their biological functions. In this paper, the LogitBoost, one of the boosting algorithms developed recently, is introduced for predicting protein structural classes. It performs classification using a regression scheme as the base learner, which can handle multi-class problems and is particularly superior in coping with noisy data. It was demonstrated that the LogitBoost outperformed the support vector machines in predicting the structural classes for a given dataset, indicating that the new classifier is very promising. It is anticipated that the power in predicting protein structural classes as well as many other biomacromolecular attributes will be further strengthened if the LogitBoost and some other existing algorithms can be effectively complemented with each other. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:172 / 176
页数:5
相关论文
共 60 条
[21]   Predicting protein quaternary structure by pseudo amino acid composition [J].
Chou, KC ;
Cai, YD .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2003, 53 (02) :282-289
[22]   A new hybrid approach to predict subcellular localization of proteins by incorporating gene ontology [J].
Chou, KC ;
Cai, YD .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 311 (03) :743-747
[23]  
Chou KC, 1999, PROTEINS, V34, P137, DOI 10.1002/(SICI)1097-0134(19990101)34:1<137::AID-PROT11>3.0.CO
[24]  
2-O
[25]   Prediction of enzyme family classes [J].
Chou, KC ;
Elrod, DW .
JOURNAL OF PROTEOME RESEARCH, 2003, 2 (02) :183-190
[26]   Using functional domain composition and support vector machines for prediction of protein subcellular location [J].
Chou, KC ;
Cai, YD .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2002, 277 (48) :45765-45769
[27]   Bioinformatical analysis of G-protein-coupled receptors [J].
Chou, KC ;
Elrod, DW .
JOURNAL OF PROTEOME RESEARCH, 2002, 1 (05) :429-433
[28]   Protein subcellular location prediction [J].
Chou, KC ;
Elrod, DW .
PROTEIN ENGINEERING, 1999, 12 (02) :107-118
[29]   PREDICTION OF PROTEIN STRUCTURAL CLASSES [J].
CHOU, KC ;
ZHANG, CT .
CRITICAL REVIEWS IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, 1995, 30 (04) :275-349
[30]   A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE [J].
CHOU, KC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 21 (04) :319-344