A NOVEL-APPROACH TO PREDICTING PROTEIN STRUCTURAL CLASSES IN A (20-1)-D AMINO-ACID-COMPOSITION SPACE

被引:428
作者
CHOU, KC
机构
[1] Upjohn Laboratories, Kalamazoo, Michigan
来源
PROTEINS-STRUCTURE FUNCTION AND GENETICS | 1995年 / 21卷 / 04期
关键词
ALPHA PROTEIN; BETA PROTEIN; ALPHA+BETA PROTEIN; ALPHA BETA PROTEIN; MAHALANOBIS DISTANCE; SEED-PROPAGATED SAMPLING; JACKKNIFE ANALYSIS;
D O I
10.1002/prot.340210406
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The development of prediction methods based on statistical theory generally consists of two parts: one is focused on the exploration of new algorithms, and the other on the improvement of a training database. The current study is devoted to improving the prediction of protein structural classes from both of the two aspects, To explore a new algorithm, a method has been developed that makes allowance for taking into account the coupling effect among different amino acid components of a protein by a covariance matrix, To improve the training database, the selection of proteins is carried out so that they have (1) as many nonhomologous structures as possible, and (2) a good quality of structure. Thus, 129 representative proteins are selected. They are classified into 30 alpha, 30 beta, 30 alpha + beta, 30 alpha/beta, and 9 zeta (irregular) proteins according to a new criterion that better reflects the feature of the structural classes concerned, The average accuracy of prediction by the current method for the 4 x 30 regular proteins is 99.2%, and that for 64 independent testing proteins not included in the training database is 95.3%. To further validate its efficiency, a jackknife analysis has been performed for the current method as well as the previous ones, and the results are also much in favor of the current method, To complete the mathematical basis, a theorem is presented and proved in Appendix A that is instructive for understanding the novel method at a deeper level. (C) 1995 Wiley-Liss, Inc.
引用
收藏
页码:319 / 344
页数:26
相关论文
共 52 条
[1]   A HEURISTIC APPROACH TO PREDICTING THE TERTIARY STRUCTURE OF BOVINE SOMATOTROPIN [J].
CARLACCI, L ;
CHOU, KC ;
MAGGIORA, GM .
BIOCHEMISTRY, 1991, 30 (18) :4389-4398
[2]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[3]   ENERGY-OPTIMIZED STRUCTURE OF ANTIFREEZE PROTEIN AND ITS BINDING MECHANISM [J].
CHOU, KC .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 223 (02) :509-517
[4]   SIMULATED ANNEALING APPROACH TO THE STUDY OF PROTEIN STRUCTURES [J].
CHOU, KC ;
CARLACCI, L .
PROTEIN ENGINEERING, 1991, 4 (06) :661-667
[5]   ENERGETICS OF INTERACTIONS OF REGULAR STRUCTURAL ELEMENTS IN PROTEINS [J].
CHOU, KC ;
NEMETHY, G ;
SCHERAGA, HA .
ACCOUNTS OF CHEMICAL RESEARCH, 1990, 23 (05) :134-141
[6]   ORIGIN OF THE RIGHT-HANDED TWIST OF BETA-SHEETS OF POLY(LVAL) CHAINS [J].
CHOU, KC ;
SCHERAGA, HA .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-PHYSICAL SCIENCES, 1982, 79 (22) :7047-7051
[7]   EFFECT OF AMINO-ACID-COMPOSITION ON THE TWIST AND THE RELATIVE STABILITY OF PARALLEL AND ANTI-PARALLEL BETA-SHEETS [J].
CHOU, KC ;
NEMETHY, G ;
SCHERAGA, HA .
BIOCHEMISTRY, 1983, 22 (26) :6213-6221
[8]   AN ENERGY-BASED APPROACH TO PACKING THE 7-HELIX BUNDLE OF BACTERIORHODOPSIN [J].
CHOU, KC ;
CARLACCI, L ;
MAGGIORA, GM ;
PARODI, LA ;
SCHULZ, MW .
PROTEIN SCIENCE, 1992, 1 (06) :810-827
[9]   A NEW APPROACH TO PREDICTING PROTEIN FOLDING TYPES [J].
CHOU, KC ;
ZHANG, CT .
JOURNAL OF PROTEIN CHEMISTRY, 1993, 12 (02) :169-178
[10]  
CHOU KC, 1994, J BIOL CHEM, V269, P22014