A generative, probabilistic model of local protein structure

被引:109
作者
Boomsma, Wouter [1 ]
Mardia, Kanti V. [2 ]
Taylor, Charles C. [2 ]
Ferkinghoff-Borg, Jesper [3 ]
Krogh, Anders [1 ]
Hamelryck, Thomas [1 ]
机构
[1] Univ Copenhagen, Dept Biol, Bioinformat Ctr, DK-2200 Copenhagen N, Denmark
[2] Univ Leeds, Dept Stat, Leeds LS2 9JT, W Yorkshire, England
[3] Tech Univ Denmark, DTU Elektro, DK-2800 Lyngby, Denmark
关键词
conformational sampling; directional statistics; probabilistic model; TorusDBN; Bayesian network;
D O I
10.1073/pnas.0801715105
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Despite significant progress in recent years, protein structure prediction maintains its status as one of the prime unsolved problems in computational biology. One of the key remaining challenges is an efficient probabilistic exploration of the structural space that correctly reflects the relative conformational stabilities. Here, we present a fully probabilistic, continuous model of local protein structure in atomic detail. The generative model makes efficient conformational sampling possible and provides a framework for the rigorous analysis of local sequence-structure correlations in the native state. Our method represents a significant theoretical and practical improvement over the widely used fragment assembly technique by avoiding the drawbacks associated with a discrete and nonprobabilistic approach.
引用
收藏
页码:8932 / 8937
页数:6
相关论文
共 29 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Helix capping [J].
Aurora, R ;
Rose, GD .
PROTEIN SCIENCE, 1998, 7 (01) :21-38
[3]   Prediction of local structure in proteins using a library of sequence-structure motifs [J].
Bystroff, C ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 281 (03) :565-577
[4]   HMMSTR: a hidden Markov model for local sequence-structure correlations in proteins [J].
Bystroff, C ;
Thorsson, V ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 301 (01) :173-190
[5]   Hidden Markov model approach for identifying the modular framework of the protein backbone [J].
Camproux, AC ;
Tuffery, P ;
Chevrolat, JP ;
Boisvieux, JF ;
Hazout, S .
PROTEIN ENGINEERING, 1999, 12 (12) :1063-1073
[6]   HMM sampling and applications to gene finding and alternative splicing [J].
Cawley, Simon L. ;
Pachter, Lior .
BIOINFORMATICS, 2003, 19 :II36-II41
[7]   Shaping up the protein folding funnel by local interaction: Lesson from a structure prediction study [J].
Chikenji, G ;
Fujitsuka, Y ;
Takada, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (09) :3141-3146
[8]  
Delano WL, 2002, PYMOL USERS MANUAL
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]  
Diebolt J., 1996, Markov Chain Monte Carlo in practice, P259