PREDICTION OF PROTEIN-FOLDING CLASS USING GLOBAL DESCRIPTION OF AMINO-ACID-SEQUENCE

被引:478
作者
DUBCHAK, I
MUCHNIK, I
HOLBROOK, SR
KIM, SH
机构
[1] UNIV CALIF BERKELEY,LAWRENCE BERKELEY LAB,BERKELEY,CA 94720
[2] RUTGERS STATE UNIV,CTR OPERAT RES,NEW BRUNSWICK,NJ 08903
关键词
D O I
10.1073/pnas.92.19.8700
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a method for predicting protein folding class based on global protein chain description and a voting process. Selection of the best descriptors was achieved by a computer-simulated neural network trained on a data base consisting of 83 folding classes, Protein-chain descriptors include overall composition, transition, and distribution of amino acid attributes, such as relative hydrophobicity, predicted secondary structure, and predicted solvent exposure, Cross-validation testing was performed on 15 of the largest classes. The test shows that proteins were assigned to the correct class (correct positive prediction) with an average accuracy of 71.7%, whereas the inverse prediction of proteins as not belonging to a particular class (correct negative prediction) was 90-95% accurate. When tested on 254 structures used in this study, the top two predictions contained the correct class in 91% of the cases.
引用
收藏
页码:8700 / 8704
页数:5
相关论文
共 28 条
[1]   THE CLASSIFICATION AND ORIGINS OF PROTEIN FOLDING PATTERNS [J].
CHOTHIA, C ;
FINKELSTEIN, AV .
ANNUAL REVIEW OF BIOCHEMISTRY, 1990, 59 :1007-1039
[2]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[3]   A CORRELATION-COEFFICIENT METHOD TO PREDICTING PROTEIN-STRUCTURAL CLASSES FROM AMINO-ACID COMPOSITIONS [J].
CHOU, KC ;
ZHANG, CT .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1992, 207 (02) :429-433
[4]   A NEW APPROACH TO PREDICTING PROTEIN FOLDING TYPES [J].
CHOU, KC ;
ZHANG, CT .
JOURNAL OF PROTEIN CHEMISTRY, 1993, 12 (02) :169-178
[5]  
CHOU PY, 1989, PREDICTION PROTEIN S, P549
[6]   PREDICTION OF PROTEIN FOLDING CLASS FROM AMINO-ACID-COMPOSITION [J].
DUBCHAK, I ;
HOLBROOK, SR ;
KIM, SH .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1993, 16 (01) :79-91
[7]  
DUBCHAK J, 1993, 1ST P INT C INT SYST, P118
[8]   WHY DO GLOBULAR-PROTEINS FIT THE LIMITED SET OF FOLDING PATTERNS [J].
FINKELSTEIN, AV ;
PTITSYN, OB .
PROGRESS IN BIOPHYSICS & MOLECULAR BIOLOGY, 1987, 50 (03) :171-190
[9]  
HERTZ J, 1992, INTRO THEORY NEURAL, P147
[10]   PREDICTING SURFACE EXPOSURE OF AMINO-ACIDS FROM PROTEIN-SEQUENCE [J].
HOLBROOK, SR ;
MUSKAL, SM ;
KIM, SH .
PROTEIN ENGINEERING, 1990, 3 (08) :659-665