An Iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots

被引:148
作者
Ruan, JH [1 ]
Stormo, GD
Zhang, WX
机构
[1] Washington Univ, Dept Comp Sci, St Louis, MO 63130 USA
[2] Washington Univ, Dept Genet, St Louis, MO 63130 USA
关键词
D O I
10.1093/bioinformatics/btg373
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Pseudoknots have generally been excluded from the prediction of RNA secondary structures due to its difficulty in modeling. Although, several dynamic programming algorithms exist for the prediction of pseudoknots using thermodynamic approaches, they are neither reliable nor efficient. On the other hand, comparative methods are more reliable, but are often done in an ad hoc manner and require expert intervention. Maximum weighted matching, an algorithm for pseudoknot prediction with comparative analysis, suffers from low-prediction accuracy in many cases. Results: Here we present an algorithm, iterated loop matching, for reliably and efficiently predicting RNA secondary structures including pseudoknots. The method can utilize either thermodynamic or comparative information or both, thus is able to predict pseudoknots for both aligned and individual sequences. We have tested the algorithm on a number of RNA families. Using 8-12 homologous sequences, the algorithm correctly identifies more than 90% of base-pairs for short sequences and 80% overall. It correctly predicts nearly all pseudoknots and produces very few spurious base-pairs for sequences without pseudoknots. Comparisons show that our algorithm is both more sensitive and more specific than the maximum weighted matching method. In addition, our algorithm has high-prediction accuracy on individual sequences, comparable with the PKNOTS algorithm, while using much less computational resources.
引用
收藏
页码:58 / 66
页数:9
相关论文
共 41 条
[1]   PREDICTION OF RNA SECONDARY STRUCTURE, INCLUDING PSEUDOKNOTTING, BY COMPUTER-SIMULATION [J].
ABRAHAMS, JP ;
VANDENBERG, M ;
VANBATENBURG, E ;
PLEIJ, C .
NUCLEIC ACIDS RESEARCH, 1990, 18 (10) :3035-3044
[2]  
AKMAEV V, 1999, P INT C INT SYST MOL, V7, P10
[3]   Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots [J].
Akutsu, T .
DISCRETE APPLIED MATHEMATICS, 2000, 104 (1-3) :45-62
[4]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[5]   Assessing the accuracy of prediction algorithms for classification: an overview [J].
Baldi, P ;
Brunak, S ;
Chauvin, Y ;
Andersen, CAF ;
Nielsen, H .
BIOINFORMATICS, 2000, 16 (05) :412-424
[6]   Pseudoknots in prion protein mRNAs confirmed by comparative sequence analysis and pattern searching [J].
Barrette, I ;
Poisson, G ;
Gendron, P ;
Major, F .
NUCLEIC ACIDS RESEARCH, 2001, 29 (03) :753-758
[7]   The Comparative RNA Web (CRW) Site:: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAs -: art. no. 2 [J].
Cannone, JJ ;
Subramanian, S ;
Schnare, MN ;
Collett, JR ;
D'Souza, LM ;
Du, YS ;
Feng, B ;
Lin, N ;
Madabusi, LV ;
Müller, KM ;
Pande, N ;
Shang, ZD ;
Yu, N ;
Gutell, RR .
BMC BIOINFORMATICS, 2002, 3 (1)
[8]  
Cary R B, 1995, Proc Int Conf Intell Syst Mol Biol, V3, P75
[9]   Secondary structure of vertebrate telomerase RNA [J].
Chen, JL ;
Blasco, MA ;
Greider, CW .
CELL, 2000, 100 (05) :503-514
[10]  
CHIU DKY, 1991, COMPUT APPL BIOSCI, V7, P347