A Probabilistic Model of RNA Conformational Space

被引:76
作者
Frellsen, Jes [1 ]
Moltke, Ida [1 ]
Thiim, Martin [1 ]
Mardia, Kanti V. [2 ]
Ferkinghoff-Borg, Jesper [3 ]
Hamelryck, Thomas [1 ]
机构
[1] Univ Copenhagen, Dept Biol, Bioinformat Ctr, Copenhagen, Denmark
[2] Univ Leeds, Dept Stat, Leeds LS2 9JT, W Yorkshire, England
[3] Tech Univ Denmark, DTU Elektro, DK-2800 Lyngby, Denmark
基金
新加坡国家研究基金会;
关键词
NUCLEIC-ACIDS; ALGORITHM; SEQUENCE;
D O I
10.1371/journal.pcbi.1000406
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The increasing importance of non-coding RNA in biology and medicine has led to a growing interest in the problem of RNA 3-D structure prediction. As is the case for proteins, RNA 3-D structure prediction methods require two key ingredients: an accurate energy function and a conformational sampling procedure. Both are only partly solved problems. Here, we focus on the problem of conformational sampling. The current state of the art solution is based on fragment assembly methods, which construct plausible conformations by stringing together short fragments obtained from experimental structures. However, the discrete nature of the fragments necessitates the use of carefully tuned, unphysical energy functions, and their non-probabilistic nature impairs unbiased sampling. We offer a solution to the sampling problem that removes these important limitations: a probabilistic model of RNA structure that allows efficient sampling of RNA conformations in continuous space, and with associated probabilities. We show that the model captures several key features of RNA structure, such as its rotameric nature and the distribution of the helix lengths. Furthermore, the model readily generates native-like 3-D conformations for 9 out of 10 test structures, solely using coarse-grained base-pairing information. In conclusion, the method provides a theoretical and practical solution for a major bottleneck on the way to routine prediction and simulation of RNA structure and dynamics in atomic detail.
引用
收藏
页数:11
相关论文
共 43 条
[1]   RNA dynamics: it is about time [J].
Al-Hashimi, Hashim M. ;
Walter, Nils G. .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2008, 18 (03) :321-329
[2]   The eukaryotic genome as an RNA machine [J].
Amaral, Paulo P. ;
Dinger, Marcel E. ;
Mercer, Tim R. ;
Mattick, John S. .
SCIENCE, 2008, 319 (5871) :1787-1789
[3]   THE NUCLEIC-ACID DATABASE - A COMPREHENSIVE RELATIONAL DATABASE OF 3-DIMENSIONAL STRUCTURES OF NUCLEIC-ACIDS [J].
BERMAN, HM ;
OLSON, WK ;
BEVERIDGE, DL ;
WESTBROOK, J ;
GELBIN, A ;
DEMENY, T ;
HSIEH, SH ;
SRINIVASAN, AR ;
SCHNEIDER, B .
BIOPHYSICAL JOURNAL, 1992, 63 (03) :751-759
[4]   Assessing a mixture model for clustering with the integrated completed likelihood [J].
Biernacki, C ;
Celeux, G ;
Govaert, G .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (07) :719-725
[5]  
Bishop C. M., 2009, Pattern Recognition and Machine Learning
[6]   A generative, probabilistic model of local protein structure [J].
Boomsma, Wouter ;
Mardia, Kanti V. ;
Taylor, Charles C. ;
Ferkinghoff-Borg, Jesper ;
Krogh, Anders ;
Hamelryck, Thomas .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (26) :8932-8937
[7]  
Burnham K. P., 2002, Model selection and multimodel inference
[8]   HMM sampling and applications to gene finding and alternative splicing [J].
Cawley, Simon L. ;
Pachter, Lior .
BIOINFORMATICS, 2003, 19 :II36-II41
[9]   Automated de novo prediction of native-like RNA tertiary structures [J].
Das, Rhiju ;
Baker, David .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (37) :14664-14669
[10]   MolProbity: all-atom contacts and structure validation for proteins and nucleic acids [J].
Davis, Ian W. ;
Leaver-Fay, Andrew ;
Chen, Vincent B. ;
Block, Jeremy N. ;
Kapral, Gary J. ;
Wang, Xueyi ;
Murray, Laura W. ;
Arendall, W. Bryan, III ;
Snoeyink, Jack ;
Richardson, Jane S. ;
Richardson, David C. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :W375-W383