Analysis of internal loops within the RNA secondary structure in almost quadratic time

被引:28
作者
Ogurtsov, Aleksey Y.
Shabalina, Syetlana A.
Kondrashov, Alexey S.
Roytberg, Mikhail A. [1 ]
机构
[1] Russian Acad Sci, Inst Math Problems Biol, Pushchino 142290, Moscow Region, Russia
[2] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
基金
俄罗斯基础研究基金会; 美国国家卫生研究院;
关键词
D O I
10.1093/bioinformatics/btl083
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Evaluating all possible internal loops is one of the key steps in predicting the optimal secondary structure of an RNA molecule. The best algorithm available runs in time O(L-3), L is the length of the RNA. Results: We propose a new algorithm for evaluating internal loops, its run-time is O(M*log(2)L), M < L-2 is a number of possible nucleotide pairings. We created a software tool Afold which predicts the optimal secondary structure of RNA molecules of lengths up to 28 000 nt, using a computer with 2 Gb RAM. We also propose algorithms constructing sets of conditionally optimal multi-branch loop free (MLF) structures, e.g. the set that for every possible pairing (x, y) contains an optimal MLF structure in which nucleotides x and y form a pair. All the algorithms have run-time O(M*log(2)L).
引用
收藏
页码:1317 / 1324
页数:8
相关论文
共 28 条
[1]  
Aho A.V., 1974, The Design and Analysis of Computer Algorithms
[2]   The efficacy of small interfering RNAs targeted to the type 1 insulin-like growth factor receptor (IGF1R) is influenced by secondary structure in the IGF1R transcript [J].
Bohula, EA ;
Salisbury, AJ ;
Sohail, M ;
Playford, MP ;
Riedemann, J ;
Southern, EM ;
Macaulay, VM .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2003, 278 (18) :15991-15997
[3]   SPARSE DYNAMIC-PROGRAMMING .2. CONVEX AND CONCAVE COST-FUNCTIONS [J].
EPPSTEIN, D ;
GALIL, Z ;
GIANCARLO, R ;
ITALIANO, GF .
JOURNAL OF THE ACM, 1992, 39 (03) :546-567
[4]   The activity of siRNA in mammalian cells is related to structural target accessibility: a comparison with antisense oligonucleotides [J].
Far, RKK ;
Sczakiel, G .
NUCLEIC ACIDS RESEARCH, 2003, 31 (15) :4417-4424
[5]   COMPUTATION OF BIOPOLYMERS - A GENERAL-APPROACH TO DIFFERENT PROBLEMS [J].
FINKELSTEIN, AV ;
ROYTBERG, MA .
BIOSYSTEMS, 1993, 30 (1-3) :1-19
[6]   LINEAR SPACE ALGORITHM FOR COMPUTING MAXIMAL COMMON SUBSEQUENCES [J].
HIRSCHBERG, DS .
COMMUNICATIONS OF THE ACM, 1975, 18 (06) :341-343
[7]   FAST FOLDING AND COMPARISON OF RNA SECONDARY STRUCTURES [J].
HOFACKER, IL ;
FONTANA, W ;
STADLER, PF ;
BONHOEFFER, LS ;
TACKER, M ;
SCHUSTER, P .
MONATSHEFTE FUR CHEMIE, 1994, 125 (02) :167-188
[8]   Conserved RNA secondary structures in viral genomes: a survey [J].
Hofacker, IL ;
Stadler, PF ;
Stocsits, RR .
BIOINFORMATICS, 2004, 20 (10) :1495-1499
[9]   IMPROVED PREDICTIONS OF SECONDARY STRUCTURES FOR RNA [J].
JAEGER, JA ;
TURNER, DH ;
ZUKER, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1989, 86 (20) :7706-7710
[10]   ONLINE DYNAMIC-PROGRAMMING WITH APPLICATIONS TO THE PREDICTION OF RNA SECONDARY STRUCTURE [J].
LARMORE, LL ;
SCHIEBER, B .
JOURNAL OF ALGORITHMS, 1991, 12 (03) :490-515