MODELING PROTEIN CORES WITH MARKOV RANDOM-FIELDS

被引:13
作者
WHITE, JV [1 ]
MUCHNIK, I [1 ]
SMITH, TF [1 ]
机构
[1] BOSTON UNIV,BMERC,BOSTON,MA
基金
美国国家科学基金会;
关键词
D O I
10.1016/0025-5564(94)90041-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A mathematical formalism is introduced that has general applicability to many protein structure models used in the various approaches to the ''inverse protein folding problem.'' The inverse nature of the problem arises from the fact that one begins with a set of assumed tertiary structures and searches for those most compatible with a new sequence, rather than attempting to predict the structure directly from the new sequence. The formalism is based on the well-known theory of Markov random fields (MRFs). Our MRF formulation provides explicit representations for the relevant amino acid position environments and the physical topologies of the structural contacts. In particular, MRF models can readily be constructed for the secondary structure packing topologies found in protein domain cores, or other structural motifs, that are anticipated to be common among large sets of both homologous and nonhomologous proteins. MRF models are probabilistic and can exploit the statistical data from the limited number of proteins having known domain structures. The MRF approach leads to a new scoring function for comparing different threadings (placements) of a sequence through different structure models. The scoring function is very important, because comparing alternative structure models with each other is a key step in the inverse folding problem. Unlike previously published scoring functions, the one derived in this paper is based on a comprehensive probabilistic formulation of the threading problem.
引用
收藏
页码:149 / 179
页数:31
相关论文
共 33 条
  • [1] ACZEL J, 1975, MEASURES INFORMATION
  • [2] PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES
    BERNSTEIN, FC
    KOETZLE, TF
    WILLIAMS, GJB
    MEYER, EF
    BRICE, MD
    RODGERS, JR
    KENNARD, O
    SHIMANOUCHI, T
    TASUMI, M
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) : 535 - 542
  • [3] A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE
    BOWIE, JU
    LUTHY, R
    EISENBERG, D
    [J]. SCIENCE, 1991, 253 (5016) : 164 - 170
  • [4] INVERTED PROTEIN-STRUCTURE PREDICTION
    BOWIE, JU
    EISENBERG, D
    [J]. CURRENT OPINION IN STRUCTURAL BIOLOGY, 1993, 3 (03) : 437 - 444
  • [5] AN EMPIRICAL ENERGY FUNCTION FOR THREADING PROTEIN-SEQUENCE THROUGH THE FOLDING MOTIF
    BRYANT, SH
    LAWRENCE, CE
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1993, 16 (01) : 92 - 112
  • [6] PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST
    CHOTHIA, C
    [J]. NATURE, 1992, 357 (6379) : 543 - 544
  • [7] PREDICTION OF PROTEIN CONFORMATION
    CHOU, PY
    FASMAN, GD
    [J]. BIOCHEMISTRY, 1974, 13 (02) : 222 - 245
  • [8] PREDICTION OF PROTEIN FOLDING FROM AMINO-ACID-SEQUENCE OVER DISCRETE CONFORMATION SPACES
    CRIPPEN, GM
    [J]. BIOCHEMISTRY, 1991, 30 (17) : 4232 - 4237
  • [9] NEW PROGRAMS FOR PROTEIN TERTIARY STRUCTURE PREDICTION
    FETROW, JS
    BRYANT, SH
    [J]. BIO-TECHNOLOGY, 1993, 11 (04): : 479 - 484
  • [10] TOWARD PROTEIN TERTIARY STRUCTURE RECOGNITION BY MEANS OF ASSOCIATIVE MEMORY HAMILTONIANS
    FRIEDRICHS, MS
    WOLYNES, PG
    [J]. SCIENCE, 1989, 246 (4928) : 371 - 373