Measuring covariation in RNA alignments: physical realism improves information measures

被引:37
作者
Lindgreen, S.
Gardner, P. P.
Krogh, A.
机构
[1] Univ Copenhagen, Inst Mol Biol, Bioinformat Ctr, DK-2100 Copenhagen O, Denmark
[2] Univ Copenhagen, Inst Mol Biol, Mol Evolut Grp, DK-2100 Copenhagen O, Denmark
关键词
D O I
10.1093/bioinformatics/btl514
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The importance of non-coding RNAs is becoming increasingly evident, and often the function of these molecules depends on the structure. It is common to use alignments of related RNA sequences to deduce the consensus secondary structure by detecting patterns of co-evolution. A central part of such an analysis is to measure covariation between two positions in an alignment. Here, we rank various measures ranging from simple mutual information to more advanced covariation measures. Results: Mutual information is still used for secondary structure prediction, but the results of this study indicate which measures are useful. Incorporating more structural information by considering e.g. indels and stacking improves accuracy, suggesting that physically realistic measures yield improved predictions. This can be used to improve both current and future programs for secondary structure prediction. The best measure tested is the RNAalifold covariation measure modified to include stacking. Availability: Scripts, data and supplementary material can be found at http://www.binf.ku.dk/Stinus_covariation Contact: stinus@binf.ku.dk Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:2988 / 2995
页数:8
相关论文
共 42 条
[1]   Phylogenetically enhanced statistical tools for RNA structure prediction [J].
Akmaev, VR ;
Kelley, ST ;
Stormo, GD .
BIOINFORMATICS, 2000, 16 (06) :501-512
[2]   RNA secondary structure prediction from sequence alignments using a network of k-nearest neighbor classifiers [J].
Bindewald, E ;
Shapiro, BA .
RNA, 2006, 12 (03) :342-352
[3]   STABILITY OF RIBONUCLEIC-ACID DOUBLE-STRANDED HELICES [J].
BORER, PN ;
DENGLER, B ;
TINOCO, I ;
UHLENBECK, OC .
JOURNAL OF MOLECULAR BIOLOGY, 1974, 86 (04) :843-853
[4]  
CHIU DKY, 1991, COMPUT APPL BIOSCI, V7, P347
[5]   MSARI: Multiple sequence alignments for statistical detection of RNA secondary structure [J].
Coventry, A ;
Kleitman, DJ ;
Berger, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (33) :12102-12107
[6]  
Cover TM, 2006, Elements of Information Theory
[7]  
Durbin R., 1998, Biological sequence analysis: Probabilistic models of proteins and nucleic acids
[8]   RNA SEQUENCE-ANALYSIS USING COVARIANCE-MODELS [J].
EDDY, SR ;
DURBIN, R .
NUCLEIC ACIDS RESEARCH, 1994, 22 (11) :2079-2088
[9]   A benchmark of multiple sequence alignment programs upon structural RNAs [J].
Gardner, PP ;
Wilm, A ;
Washietl, S .
NUCLEIC ACIDS RESEARCH, 2005, 33 (08) :2433-2439
[10]   A comprehensive comparison of comparative RNA structure prediction approaches [J].
Gardner, PP ;
Giegerich, R .
BMC BIOINFORMATICS, 2004, 5 (1)