RNAalifold: improved consensus structure prediction for RNA alignments

被引:418
作者
Bernhart, Stephan H. [1 ,2 ]
Hofacker, Ivo L. [3 ]
Will, Sebastian [4 ]
Gruber, Andreas R. [3 ]
Stadler, Peter F. [1 ,2 ,3 ,5 ,6 ]
机构
[1] Univ Leipzig, Bioinformat Grp, Dept Comp Sci, D-04107 Leipzig, Germany
[2] Univ Leipzig, Interdisciplinary Ctr Bioinformat, D-04107 Leipzig, Germany
[3] Univ Vienna, Inst Theoret Chem, A-1090 Vienna, Austria
[4] Univ Freiburg, Bioinformat Grp, Dept Comp Sci, D-79110 Freiburg, Germany
[5] Fraunhofer Inst Cell Therapy & Immunol IZI, RNom Grp, D-04103 Leipzig, Germany
[6] Santa Fe Inst, Santa Fe, NM 87501 USA
基金
奥地利科学基金会;
关键词
D O I
10.1186/1471-2105-9-474
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The prediction of a consensus structure for a set of related RNAs is an important first step for subsequent analyses. RNAalifold, which computes the minimum energy structure that is simultaneously formed by a set of aligned sequences, is one of the oldest and most widely used tools for this task. In recent years, several alternative approaches have been advocated, pointing to several shortcomings of the original RNAalifold approach. Results: We show that the accuracy of RNAalifold predictions can be improved substantially by introducing a different, more rational handling of alignment gaps, and by replacing the rather simplistic model of covariance scoring with more sophisticated RIBOSUM-like scoring matrices. These improvements are achieved without compromising the computational efficiency of the algorithm. We show here that the new version of RNAalifold not only outperforms the old one, but also several other tools recently developed, on different datasets. Conclusion: The new version of RNAalifold not only can replace the old one for almost any application but it is also competitive with other approaches including those based on SCFGs, maximum expected accuracy, or hierarchical nearest neighbor classifiers.
引用
收藏
页数:13
相关论文
共 42 条
[1]  
*AF BOMPF RNA CONS, 2007, J EXP ZOOL PART B, V308, P1
[2]   Semiautomated improvement of RNA alignments [J].
Andersen, Ebbe S. ;
Lind-Thomsen, Allan ;
Knudsen, Bjarne ;
Kristensen, Susie E. ;
Havgaard, Jakob H. ;
Torarinsson, Elfar ;
Larsen, Niels ;
Zwieb, Christian ;
Sestoft, Peter ;
Kjems, Jorgen ;
Gorodkin, Jan .
RNA, 2007, 13 (11) :1850-1859
[3]   Efficient parameter estimation for RNA secondary structure prediction [J].
Andronescu, Mirela ;
Condon, Anne ;
Hoos, Holger H. ;
Mathews, David H. ;
Murphy, Kevin P. .
BIOINFORMATICS, 2007, 23 (13) :I19-I28
[4]   RNA STRAND: The RNA secondary structure and statistical analysis database [J].
Andronescu, Mirela ;
Bereg, Vera ;
Hoos, Holger H. ;
Condon, Anne .
BMC BIOINFORMATICS, 2008, 9 (1)
[5]   RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers [J].
Bindewald, E ;
Shapiro, BA .
RNA, 2006, 12 (03) :342-352
[6]   Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project [J].
Birney, Ewan ;
Stamatoyannopoulos, John A. ;
Dutta, Anindya ;
Guigo, Roderic ;
Gingeras, Thomas R. ;
Margulies, Elliott H. ;
Weng, Zhiping ;
Snyder, Michael ;
Dermitzakis, Emmanouil T. ;
Stamatoyannopoulos, John A. ;
Thurman, Robert E. ;
Kuehn, Michael S. ;
Taylor, Christopher M. ;
Neph, Shane ;
Koch, Christoph M. ;
Asthana, Saurabh ;
Malhotra, Ankit ;
Adzhubei, Ivan ;
Greenbaum, Jason A. ;
Andrews, Robert M. ;
Flicek, Paul ;
Boyle, Patrick J. ;
Cao, Hua ;
Carter, Nigel P. ;
Clelland, Gayle K. ;
Davis, Sean ;
Day, Nathan ;
Dhami, Pawandeep ;
Dillon, Shane C. ;
Dorschner, Michael O. ;
Fiegler, Heike ;
Giresi, Paul G. ;
Goldy, Jeff ;
Hawrylycz, Michael ;
Haydock, Andrew ;
Humbert, Richard ;
James, Keith D. ;
Johnson, Brett E. ;
Johnson, Ericka M. ;
Frum, Tristan T. ;
Rosenzweig, Elizabeth R. ;
Karnani, Neerja ;
Lee, Kirsten ;
Lefebvre, Gregory C. ;
Navas, Patrick A. ;
Neri, Fidencio ;
Parker, Stephen C. J. ;
Sabo, Peter J. ;
Sandstrom, Richard ;
Shafer, Anthony .
NATURE, 2007, 447 (7146) :799-816
[7]   Centroid estimation in discrete high-dimensional spaces with applications in biology [J].
Carvalho, Luis E. ;
Lawrence, Charles E. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2008, 105 (09) :3209-3214
[8]   Multiple sequence alignment with the Clustal series of programs [J].
Chenna, R ;
Sugawara, H ;
Koike, T ;
Lopez, R ;
Gibson, TJ ;
Higgins, DG ;
Thompson, JD .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3497-3500
[9]   Efficient pairwise RNA structure prediction and alignment using sequence alignment constraints [J].
D Dowell, Robin ;
Eddy, Sean R. .
BMC BIOINFORMATICS, 2006, 7 (1)
[10]   STRAL: progressive alignment of non-coding RNA using base pairing probability vectors in quadratic time [J].
Dalli, Deniz ;
Wilm, Andreas ;
Mainz, Indra ;
Steger, Gerhard .
BIOINFORMATICS, 2006, 22 (13) :1593-1599