MARNA: multiple alignment and consensus structure prediction of RNAs based on sequence structure comparisons

被引:113
作者
Siebert, S [1 ]
Backofen, R [1 ]
机构
[1] Univ Jena, Inst Comp Sci, Dept Bioinformat, D-07743 Jena, Germany
关键词
D O I
10.1093/bioinformatics/bti550
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Due to the importance of considering secondary structures in aligning functional RNAs, several pairwise sequence-structure alignment methods have been developed. They use extended alignment scores that evaluate secondary structure information in addition to sequence information. However, two problems for the multiple alignment step remain. First, how to combine pairwise sequence-structure alignments into a multiple alignment and second, how to generate secondary structure information for sequences whose explicit structural information is missing. Results: We describe a novel approach for multiple alignment of RNAs (MARNA) taking into consideration both the primary and the secondary structures. It is based on pairwise sequence-structure comparisons of RNAs. From these sequence-structure alignments, libraries of weighted alignment edges are generated. The weights reflect the sequential and structural conservation. For sequences whose secondary structures are missing, the libraries are generated by sampling low energy conformations. The libraries are then processed by the T-Coffee system, which is a consistency based multiple alignment method. Furthermore, we are able to extract a consensus-sequence and -structure from a multiple alignment. We have successfully tested MARNA on several datasets taken from the Rfam database.
引用
收藏
页码:3352 / 3359
页数:8
相关论文
共 33 条
[1]  
Backofen Rolf, 2004, J Bioinform Comput Biol, V2, P681, DOI 10.1142/S0219720004000818
[2]  
BROWN MP, 1999, THESIS U CALIFORNIA
[3]  
CHIU DKY, 1991, COMPUT APPL BIOSCI, V7, P347
[4]  
CORPET F, 1994, COMPUT APPL BIOSCI, V10, P389
[5]   Small RNAs make big splash [J].
Couzin, J .
SCIENCE, 2002, 298 (5602) :2296-2297
[6]   Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction [J].
Dowell, RD ;
Eddy, SR .
BMC BIOINFORMATICS, 2004, 5 (1)
[7]   RNA SEQUENCE-ANALYSIS USING COVARIANCE-MODELS [J].
EDDY, SR ;
DURBIN, R .
NUCLEIC ACIDS RESEARCH, 1994, 22 (11) :2079-2088
[8]   Abstract shapes of RNA [J].
Giegerich, R ;
Voss, B ;
Rehmsmeier, M .
NUCLEIC ACIDS RESEARCH, 2004, 32 (16) :4843-4851
[9]   Rfam: an RNA family database [J].
Griffiths-Jones, S ;
Bateman, A ;
Marshall, M ;
Khanna, A ;
Eddy, SR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :439-441
[10]   HIGHER-ORDER STRUCTURAL ELEMENTS IN RIBOSOMAL-RNAS - PSEUDO-KNOTS AND THE USE OF NONCANONICAL PAIRS [J].
GUTELL, RR ;
WOESE, CR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (02) :663-667