The language of RNA: a formal grammar that includes pseudoknots

被引:102
作者
Rivas, E [1 ]
Eddy, SR [1 ]
机构
[1] Washington Univ, Dept Genet, St Louis, MO 63110 USA
关键词
D O I
10.1093/bioinformatics/16.4.334
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In a previous paper we presented a polynomial time dynamic programming algorithm for predicting optimal RNA secondary structure including pseudoknots. However a formal grammatical representation for RNA secondary structure with pseudoknots was still lacking. Results: Here we show a one-to-one correspondence between that algorithm and a formal transformational grammar This grammar class encompasses the context-free grammars and goes beyond to generate pseudoknotted structures. The pseudoknot grammar avoids the use of general context-sensitive rules by introducing a small number of auxiliary symbols used to reorder the strings generated by an otherwise context-free grammar. This formal representation of the residue correlations in RNA structure is important because it means we can build full probabilistic models of RNA secondary structure, including pseudoknots, and use them to optimally parse sequences in polynomial time.
引用
收藏
页码:334 / 340
页数:7
相关论文
共 27 条
  • [1] PREDICTION OF RNA SECONDARY STRUCTURE, INCLUDING PSEUDOKNOTTING, BY COMPUTER-SIMULATION
    ABRAHAMS, JP
    VANDENBERG, M
    VANBATENBURG, E
    PLEIJ, C
    [J]. NUCLEIC ACIDS RESEARCH, 1990, 18 (10) : 3035 - 3044
  • [2] INDEXED GRAMMARS - AN EXTENSION OF CONTEXT-FREE GRAMMARS
    AHO, AV
    [J]. JOURNAL OF THE ACM, 1968, 15 (04) : 647 - &
  • [3] [Anonymous], 1991, FDN ISSUES NATURAL L
  • [4] BROWN M, 1996, PAC S BIOC 1996
  • [5] Bent pseudoknots and novel RNA inhibitors of type 1 human immunodeficiency virus (HIV-1) reverse transcriptase
    Burke, DH
    Scates, L
    Andrews, K
    Gold, L
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1996, 264 (04) : 650 - 666
  • [6] Cary R B, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P75
  • [7] CECH TR, 1993, RNA WORLD, P239
  • [8] Chomsky Noam, 1959, Infromation and Control, V2, P137, DOI 10.1016/S0019-9958(59)90362-6
  • [9] Durbin R., 1998, BIOL SEQUENCE ANAL P
  • [10] RNA SEQUENCE-ANALYSIS USING COVARIANCE-MODELS
    EDDY, SR
    DURBIN, R
    [J]. NUCLEIC ACIDS RESEARCH, 1994, 22 (11) : 2079 - 2088