Abstract shapes of RNA

被引:145
作者
Giegerich, R
Voss, B
Rehmsmeier, M
机构
[1] Univ Bielefeld, Inst Bioinformat, D-33501 Bielefeld, Germany
[2] Univ Bielefeld, Int NRW Grad Sch Bioinformat & Genome Res, D-33501 Bielefeld, Germany
关键词
D O I
10.1093/nar/gkh779
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The function of a non-protein-coding RNA is often determined by its structure. Since experimental determination of RNA structure is time-consuming and expensive, its computational prediction is of great interest, and efficient solutions based on thermodynamic parameters are known. Frequently, however, the predicted minimum free energy structures are not the native ones, leading to the necessity of generating suboptimal solutions. While this can be accomplished by a number of programs, the user is often confronted with large outputs of similar structures, although he or she is interested in structures with more fundamental differences, or, in other words, with different abstract shapes. Here, we formalize the concept of abstract shapes and introduce their efficient computation. Each shape of an RNA molecule comprises a class of similar structures and has a representative structure of minimal free energy within the class. Shape analysis is implemented in the program RNAshapes. We applied RNAshapes to the prediction of optimal and suboptimal abstract shapes of several RNAs. For a given energy range, the number of shapes is considerably smaller than the number of structures, and in all cases, the native structures were among the top shape representatives. This demonstrates that the researcher can quickly focus on the structures of interest, without processing up to thousands of near-optimal solutions. We complement this study with a large-scale analysis of the growth behaviour of structure and shape spaces. RNAshapes is available for download and as an online version on the Bielefeld Bioinformatics Server.
引用
收藏
页码:4843 / 4851
页数:9
相关论文
共 25 条
[1]  
[Anonymous], MATH METHODS DNA SEQ
[2]  
BURGE CB, 1999, RNA WORLD
[3]   A statistical sampling algorithm for RNA secondary structure prediction [J].
Ding, Y ;
Lawrence, CE .
NUCLEIC ACIDS RESEARCH, 2003, 31 (24) :7280-7301
[4]  
EVERS DJ, 2001, P GERM C BIOINF, P118
[5]   Barrier trees of degenerate landscapes [J].
Flamm, C ;
Hofacker, IL ;
Stadler, PF ;
Wolfinger, MT .
ZEITSCHRIFT FUR PHYSIKALISCHE CHEMIE-INTERNATIONAL JOURNAL OF RESEARCH IN PHYSICAL CHEMISTRY & CHEMICAL PHYSICS, 2002, 216 (02) :155-173
[6]   A discipline of dynamic programming over sequence data [J].
Giegerich, R ;
Meyer, C ;
Steffen, P .
SCIENCE OF COMPUTER PROGRAMMING, 2004, 51 (03) :215-263
[7]  
Giegerich R, 2000, LECT NOTES COMPUT SC, V1848, P46
[8]   Rfam: an RNA family database [J].
Griffiths-Jones, S ;
Bateman, A ;
Marshall, M ;
Khanna, A ;
Eddy, SR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :439-441
[9]   FAST FOLDING AND COMPARISON OF RNA SECONDARY STRUCTURES [J].
HOFACKER, IL ;
FONTANA, W ;
STADLER, PF ;
BONHOEFFER, LS ;
TACKER, M ;
SCHUSTER, P .
MONATSHEFTE FUR CHEMIE, 1994, 125 (02) :167-188
[10]  
HOFACKER IL, 2000, RNAFOLD MANUAL V1 4