Measuring covariation in RNA alignments: physical realism improves information measures

被引:37
作者
Lindgreen, S.
Gardner, P. P.
Krogh, A.
机构
[1] Univ Copenhagen, Inst Mol Biol, Bioinformat Ctr, DK-2100 Copenhagen O, Denmark
[2] Univ Copenhagen, Inst Mol Biol, Mol Evolut Grp, DK-2100 Copenhagen O, Denmark
关键词
D O I
10.1093/bioinformatics/btl514
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The importance of non-coding RNAs is becoming increasingly evident, and often the function of these molecules depends on the structure. It is common to use alignments of related RNA sequences to deduce the consensus secondary structure by detecting patterns of co-evolution. A central part of such an analysis is to measure covariation between two positions in an alignment. Here, we rank various measures ranging from simple mutual information to more advanced covariation measures. Results: Mutual information is still used for secondary structure prediction, but the results of this study indicate which measures are useful. Incorporating more structural information by considering e.g. indels and stacking improves accuracy, suggesting that physically realistic measures yield improved predictions. This can be used to improve both current and future programs for secondary structure prediction. The best measure tested is the RNAalifold covariation measure modified to include stacking. Availability: Scripts, data and supplementary material can be found at http://www.binf.ku.dk/Stinus_covariation Contact: stinus@binf.ku.dk Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:2988 / 2995
页数:8
相关论文
共 42 条
[31]   RNA folding and unfolding [J].
Onoa, B ;
Tinoco, I .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2004, 14 (03) :374-379
[32]   Identification and classification of conserved RNA secondary structures in the human genome [J].
Pedersen, Jakob Skou ;
Bejerano, Gill ;
Siepel, Adam ;
Rosenbloom, Kate ;
Lindblad-Toh, Kerstin ;
Lander, Eric S. ;
Kent, Jim ;
Miller, Webb ;
Haussler, David .
PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (04) :251-262
[33]   Noncoding RNA gene detection using comparative sequence analysis [J].
Rivas, Elena ;
Eddy, Sean R. .
BMC BIOINFORMATICS, 2001, 2 (1)
[34]   An Iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots [J].
Ruan, JH ;
Stormo, GD ;
Zhang, WX .
BIOINFORMATICS, 2004, 20 (01) :58-66
[36]   A MATHEMATICAL THEORY OF COMMUNICATION [J].
SHANNON, CE .
BELL SYSTEM TECHNICAL JOURNAL, 1948, 27 (03) :379-423
[37]   A MATHEMATICAL THEORY OF COMMUNICATION [J].
SHANNON, CE .
BELL SYSTEM TECHNICAL JOURNAL, 1948, 27 (04) :623-656
[38]   5S ribosomal RNA database [J].
Szymanski, M ;
Barciszewska, MZ ;
Erdmann, VA ;
Barciszewski, J .
NUCLEIC ACIDS RESEARCH, 2002, 30 (01) :176-178
[39]   Fast and reliable prediction of noncoding RNAs [J].
Washietl, S ;
Hofacker, IL ;
Stadler, PF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (07) :2454-2459
[40]   OPTIMAL COMPUTER FOLDING OF LARGE RNA SEQUENCES USING THERMODYNAMICS AND AUXILIARY INFORMATION [J].
ZUKER, M ;
STIEGLER, P .
NUCLEIC ACIDS RESEARCH, 1981, 9 (01) :133-148