Progress in protein structural class prediction and its impact to bioinformatics and proteomics

被引:133
作者
Chou, KC [1 ]
机构
[1] Gordon Life Sci Inst, San Diego, CA 92130 USA
关键词
all-alpha; all-beta; alpha/beta; mu (multi-domain); sigma (small protein); p (peptide); pseudo amino acid composition; functional domain composition; FunD-PseAA predictor; GO-PseAA predictor; GO-FunD-PseAA predictor;
D O I
10.2174/138920305774329368
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The structural class is an important attribute used to characterize the overall folding type of a protein or its domain. Since the concept of protein structural class was developed about 3 decades ago based on a visual inspection of polypeptide chain topologies in a dataset of only 31 gloular proteins, the number of structure-known proteins has been increased rapidly. For example, as of 12-July-2005, the entries deposited into RCSB PDB Protein Data Bank for proteins, peptides, and viruses whose 3-dimensional structures were determined by X-ray and NMR techniques have been increased to 28,920. To properly cover more and more structure-known proteins, some modification and expansion from the original structural classification scheme have been developed. Meanwhile, many different approaches have been proposed for predicting the structural class of proteins. In this review, the new classification schemes are briefly introduced. The attention is focused on the progress in structural class prediction and its impact in stimulating the development of identifying the other attributes of proteins. It is interesting to point out that the development of the latter has actually in turn greatly enriched the power of the former. Also, some promising approaches for the further development of protein structural class prediction are also addressed.
引用
收藏
页码:423 / 436
页数:14
相关论文
共 105 条
[1]   The InterPro database, an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, T ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :37-40
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]  
Bahar I, 1997, PROTEINS, V29, P172, DOI 10.1002/(SICI)1097-0134(199710)29:2<172::AID-PROT5>3.3.CO
[4]  
2-D
[5]   The SWISS-PROT protein sequence data bank and its supplement TrEMBL [J].
Bairoch, A ;
Apweller, R .
NUCLEIC ACIDS RESEARCH, 1997, 25 (01) :31-36
[6]   Predicting 22 protein localizations in budding yeast [J].
Cai, YD ;
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004, 323 (02) :425-428
[7]   Predicting subcellular localization of proteins in a hybridization space [J].
Cai, YD ;
Chou, KC .
BIOINFORMATICS, 2004, 20 (07) :1151-1156
[8]   Nearest neighbour algorithm for predicting protein subcellular location by combining functional domain composition and pseudo-amino acid composition [J].
Cai, YD ;
Chou, KC .
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2003, 305 (02) :407-411
[9]   Support vector machines for predicting membrane protein types by using functional domain composition [J].
Cai, YD ;
Zhou, GP ;
Chou, KC .
BIOPHYSICAL JOURNAL, 2003, 84 (05) :3257-3263
[10]   Prediction of protein structural classes by support vector machines [J].
Cai, YD ;
Liu, XJ ;
Xu, XB ;
Chou, KC .
COMPUTERS & CHEMISTRY, 2002, 26 (03) :293-296