PROTEIN CLASSIFICATION BY STOCHASTIC MODELING AND OPTIMAL FILTERING OF AMINO-ACID-SEQUENCES

被引:117
作者
WHITE, JV
STULTZ, CM
SMITH, TF
机构
[1] HARVARD UNIV,COMM HIGHER DEGREES BIOPHYS,CAMBRIDGE,MA 02115
[2] BOSTON UNIV,BMERC,BOSTON,MA 02215
关键词
D O I
10.1016/0025-5564(94)90004-3
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The prediction of a protein's tertiary structural class from its amino-acid sequence is formulated as a signal-processing problem. The amino-acid sequence is treated as a ''time series'' of symbols containing signals that determine the protein's structural class. A methodology is described for building detailed stochastic signal models for recognized structural classes of single-domain proteins. We solve the problem of determining that model, from a set of candidates, which is the most probable generator of a protein's entire amino-acid sequence. The solution employs a nonlinear, optimal filtering algorithm, which is suited for implementation on parallel computer architectures. Previous approaches have only been able to classify correctly 80% of single-domain proteins within three very broad structural types, while our approach achieves this level across twelve much more detailed classes.
引用
收藏
页码:35 / 75
页数:41
相关论文
共 31 条
[1]  
ANFINSEN CB, 1973, SCIENCE, V181, P233
[3]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[4]   CRYSTALLOGRAPHIC REFINEMENT BY SIMULATED ANNEALING - APPLICATION TO CRAMBIN [J].
BRUNGER, AT ;
KARPLUS, M ;
PETSKO, GA .
ACTA CRYSTALLOGRAPHICA SECTION A, 1989, 45 :50-61
[5]  
Bucy R. S., 1969, J ASTRONAUT SCI, V17, P80
[6]  
BUCY RS, 1972, AS746921, V1
[7]   CONFORMATIONAL PARAMETERS FOR AMINO-ACIDS IN HELICAL, BETA-SHEET, AND RANDOM COIL REGIONS CALCULATED FROM PROTEINS [J].
CHOU, PY ;
FASMAN, GD .
BIOCHEMISTRY, 1974, 13 (02) :211-222
[8]  
CHOU PY, 1989, PREDICTION PROTEIN S, P549
[9]  
CHURCHILL GA, 1989, B MATH BIOL, V51, P79
[10]  
CORNETTE JC, 1987, J MOL BIOL, V195, P695