Consensus shapes: an alternative to the Sankoff algorithm for RNA consensus structure prediction

被引:66
作者
Reeder, J [1 ]
Giegerich, R [1 ]
机构
[1] Univ Bielefeld, Fac Technol, D-33615 Bielefeld, Germany
关键词
D O I
10.1093/bioinformatics/bti577
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The well-known Sankoff algorithm for simultaneous RNA sequence alignment and folding is currently considered an ideal, but computationally over-expensive method. Available tools implement this algorithm under various pragmatic restrictions. They are still expensive to use, and it is difficult to judge if the moderate quality of results is because of the underlying model or to its imperfect implementation. Results: We propose to redefine the consensus structure prediction problem in a way that does not imply a multiple sequence alignment step. For a family of RNA sequences, our method explicitly and independently enumerates the near-optimal abstract shape space, and predicts as the consensus an abstract shape common to all sequences. For each sequence, it delivers the thermodynamically best structure which has this common shape. Since the shape space is much smaller than the structure space, and identification of common shapes can be done in linear time (in the number of shapes considered), the method is essentially linear in the number of sequences. Our evaluation shows that the new method compares favorably with available alternatives.
引用
收藏
页码:3516 / 3523
页数:8
相关论文
共 27 条
  • [1] Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction
    Doshi, KJ
    Cannone, JJ
    Cobaugh, CW
    Gutell, RR
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [2] A comprehensive comparison of comparative RNA structure prediction approaches
    Gardner, PP
    Giegerich, R
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)
  • [3] Abstract shapes of RNA
    Giegerich, R
    Voss, B
    Rehmsmeier, M
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 (16) : 4843 - 4851
  • [4] Finding the most significant common sequence and structure motifs in a set of RNA sequences
    Gorodkin, J
    Heyer, LJ
    Stormo, GD
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (18) : 3724 - 3732
  • [5] Rfam: an RNA family database
    Griffiths-Jones, S
    Bateman, A
    Marshall, M
    Khanna, A
    Eddy, SR
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 439 - 441
  • [6] IDENTIFYING CONSTRAINTS ON THE HIGHER-ORDER STRUCTURE OF RNA - CONTINUED DEVELOPMENT AND APPLICATION OF COMPARATIVE SEQUENCE-ANALYSIS METHODS
    GUTELL, RR
    POWER, A
    HERTZ, GZ
    PUTZ, EJ
    STORMO, GD
    [J]. NUCLEIC ACIDS RESEARCH, 1992, 20 (21) : 5785 - 5795
  • [7] Pure multiple RNA secondary structure alignments:: A progressive profile approach
    Höchsmann, M
    Voss, B
    Giegerich, R
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2004, 1 (01) : 53 - 62
  • [8] FAST FOLDING AND COMPARISON OF RNA SECONDARY STRUCTURES
    HOFACKER, IL
    FONTANA, W
    STADLER, PF
    BONHOEFFER, LS
    TACKER, M
    SCHUSTER, P
    [J]. MONATSHEFTE FUR CHEMIE, 1994, 125 (02): : 167 - 188
  • [9] Secondary structure prediction for aligned RNA sequences
    Hofacker, IL
    Fekete, M
    Stadler, PF
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 2002, 319 (05) : 1059 - 1066
  • [10] An extensive class of small RNAs in Caenorhabditis elegans
    Lee, RC
    Ambros, V
    [J]. SCIENCE, 2001, 294 (5543) : 862 - 864