Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structure

被引:1137
作者
Mathews, DH
Disney, MD
Childs, JL
Schroeder, SJ
Zuker, M
Turner, DH
机构
[1] Univ Rochester, Sch Med & Dent, Ctr Human Genet & Mol Pediat Dis, Aab Inst Biomed Sci, Rochester, NY 14642 USA
[2] Univ Rochester, Dept Chem, Rochester, NY 14627 USA
[3] Rensselaer Polytech Inst, Dept Math, Troy, NY 12180 USA
关键词
D O I
10.1073/pnas.0401799101
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A dynamic programming algorithm for prediction of RNA secondary structure has been revised to accommodate folding constraints determined by chemical modification and to include free energy increments for coaxial stacking of helices when they are either adjacent or separated by a single mismatch. Furthermore, free energy parameters are revised to account for recent experimental results for terminal mismatches and hairpin, bulge, internal, and multibranch loops. To demonstrate the applicability of this method, in vivo modification was performed on 5S rRNA in both Escherichia coli and Candida albicans with 1-cyclohexyl-3-(2-morpholinoethyl) carbodiimide metho-p-toluene sulfonate, dimethyl sulfate, and kethoxal. The percentage of known base pairs in the predicted structure increased from 26.3% to 86.8% for the E. coli sequence by using modification constraints. For C albicans, the accuracy remained 87.5% both with and without modification data. On average, for these sequences and a set of 14 sequences with known secondary structure and chemical modification data taken from the literature, accuracy improves from 67% to 76%. This enhancement primarily reflects improvement for three sequences that are predicted with <40% accuracy on the basis of energetics alone. For these sequences, inclusion of chemical modification constraints improves the average accuracy from 28% to 78%. For the 11 sequences with <6% pseudoknotted base pairs, structures predicted with constraints from chemical modification contain on average 84% of known canonical base pairs.
引用
收藏
页码:7287 / 7292
页数:6
相关论文
共 92 条