MODELING PROTEIN CORES WITH MARKOV RANDOM-FIELDS

被引:13
作者
WHITE, JV [1 ]
MUCHNIK, I [1 ]
SMITH, TF [1 ]
机构
[1] BOSTON UNIV,BMERC,BOSTON,MA
基金
美国国家科学基金会;
关键词
D O I
10.1016/0025-5564(94)90041-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A mathematical formalism is introduced that has general applicability to many protein structure models used in the various approaches to the ''inverse protein folding problem.'' The inverse nature of the problem arises from the fact that one begins with a set of assumed tertiary structures and searches for those most compatible with a new sequence, rather than attempting to predict the structure directly from the new sequence. The formalism is based on the well-known theory of Markov random fields (MRFs). Our MRF formulation provides explicit representations for the relevant amino acid position environments and the physical topologies of the structural contacts. In particular, MRF models can readily be constructed for the secondary structure packing topologies found in protein domain cores, or other structural motifs, that are anticipated to be common among large sets of both homologous and nonhomologous proteins. MRF models are probabilistic and can exploit the statistical data from the limited number of proteins having known domain structures. The MRF approach leads to a new scoring function for comparing different threadings (placements) of a sequence through different structure models. The scoring function is very important, because comparing alternative structure models with each other is a key step in the inverse folding problem. Unlike previously published scoring functions, the one derived in this paper is based on a comprehensive probabilistic formulation of the threading problem.
引用
收藏
页码:149 / 179
页数:31
相关论文
共 33 条
  • [11] SEQUENCE STRUCTURE MATCHING IN GLOBULAR-PROTEINS - APPLICATION TO SUPERSECONDARY AND TERTIARY STRUCTURE DETERMINATION
    GODZIK, A
    SKOLNICK, J
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (24) : 12098 - 12102
  • [12] TOPOLOGY FINGERPRINT APPROACH TO THE INVERSE PROTEIN FOLDING PROBLEM
    GODZIK, A
    KOLINSKI, A
    SKOLNICK, J
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1992, 227 (01) : 227 - 238
  • [13] OPTIMAL PROTEIN-FOLDING CODES FROM SPIN-GLASS THEORY
    GOLDSTEIN, RA
    LUTHEYSCHULTEN, ZA
    WOLYNES, PG
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (11) : 4918 - 4922
  • [14] COMPARATIVE MODELING METHODS - APPLICATION TO THE FAMILY OF THE MAMMALIAN SERINE PROTEASES
    GREER, J
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1990, 7 (04) : 317 - 334
  • [15] IDENTIFICATION OF NATIVE PROTEIN FOLDS AMONGST A LARGE NUMBER OF INCORRECT MODELS - THE CALCULATION OF LOW-ENERGY CONFORMATIONS FROM POTENTIALS OF MEAN FORCE
    HENDLICH, M
    LACKNER, P
    WEITCKUS, S
    FLOECKNER, H
    FROSCHAUER, R
    GOTTSBACHER, K
    CASARI, G
    SIPPL, MJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 216 (01) : 167 - 180
  • [16] A NEW APPROACH TO PROTEIN FOLD RECOGNITION
    JONES, DT
    TAYLOR, WR
    THORNTON, JM
    [J]. NATURE, 1992, 358 (6381) : 86 - 89
  • [17] KINDERMANN R, 1980, MARKOV RANDOM FIELDS, P25
  • [18] MOLSCRIPT - A PROGRAM TO PRODUCE BOTH DETAILED AND SCHEMATIC PLOTS OF PROTEIN STRUCTURES
    KRAULIS, PJ
    [J]. JOURNAL OF APPLIED CRYSTALLOGRAPHY, 1991, 24 : 946 - 950
  • [19] LATHROP RH, UNPUB PROTEIN ENG
  • [20] LATHROP RH, 27TH P HAW INT C SYS