Stochastic modeling of RNA pseudoknotted structures: a grammatical approach

被引:45
作者
Cai, Liming [1 ]
Malmberg, Russell L. [2 ]
Wu, Yunzhou [1 ]
机构
[1] Univ Georgia, Dept Comp Sci, Athens, GA 30602 USA
[2] Univ Georgia, Dept Plant Biol, Athens, GA 30602 USA
关键词
D O I
10.1093/bioinformatics/btg1007
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Modeling RNA pseudoknotted structures remains challenging. Methods have previously been developed to model RNA stem-loops successfully using stochastic context-free grammars (SCFG) adapted from computational linguistics; however, the additional complexity of pseudoknots has made modeling them more difficult. Formally a context-sensitive grammar is required, which would impose a large increase in complexity. Results: We introduce a new grammar modeling approach for RNA pseudoknotted structures based on parallel communicating grammar systems (PCGS). Our new approach can specify pseudoknotted structures, while avoiding context-sensitive rules, using a single CFG synchronized with a number of regular grammars. Technically, the stochastic version of the grammar model can be as simple as an SCFG. As with SCFG, the new approach permits automatic generation of a single-RNA structure prediction algorithm for each specified pseudoknotted structure model. This approach also makes it possible to develop full probabilistic models of pseudoknotted structures to allow the prediction of consensus structures by comparative analysis and structural homology recognition in database searches.
引用
收藏
页码:i66 / i73
页数:8
相关论文
共 27 条
[1]   Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots [J].
Akutsu, T .
DISCRETE APPLIED MATHEMATICS, 2000, 104 (1-3) :45-62
[2]  
Brown M., 1995, Pacific Symposium on Biocomputing '96, P109
[3]  
BROWN MP, 2000, P INT C INTEL SYST M, V56, P57
[4]  
CAI L, 1995, P 2 INT C DEV LANG T, P209
[5]  
Cai LM, 1996, COMPUT ARTIF INTELL, V15, P199
[6]  
CARY RB, 1995, P 3 INT C INT SYST M, P75
[7]  
CHOMSKY N, 1956, IRE T INFORM THEOR, V2, P113
[8]  
Durbin R., 1998, BIOL SEQUENCE ANAL P
[9]   RNA SEQUENCE-ANALYSIS USING COVARIANCE-MODELS [J].
EDDY, SR ;
DURBIN, R .
NUCLEIC ACIDS RESEARCH, 1994, 22 (11) :2079-2088
[10]   Phylogenetic analysis of tmRNA genes within a bacterial subgroup reveals a specific structural signature [J].
Felden, B ;
Massire, C ;
Westhof, E ;
Atkins, JF ;
Gesteland, RF .
NUCLEIC ACIDS RESEARCH, 2001, 29 (07) :1602-1607