Topological classification of RNA structures

被引:98
作者
Bon, Michael [2 ,3 ]
Vernizzi, Graziano [1 ]
Orland, Henri [2 ]
Zee, A. [4 ,5 ]
机构
[1] Northwestern Univ, Dept Mat Sci, Evanston, IL 60208 USA
[2] CEA Saclay, Serv Phys Theor, F-91191 Gif Sur Yvette, France
[3] Ecole Natl Super Mines, F-75006 Paris, France
[4] Univ Calif Santa Barbara, Dept Phys, Santa Barbara, CA 93106 USA
[5] Univ Calif Santa Barbara, Kavli Inst Theoret Phys, Santa Barbara, CA 93106 USA
基金
美国国家科学基金会;
关键词
secondary structure; pseudoknot; RNA folding; structure classification; topology;
D O I
10.1016/j.jmb.2008.04.033
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a novel topological classification of RNA secondary structures with pseudoknots. It is based on the topological genus of the circular diagram associated to the RNA base-pair structure. The genus is a positive integer number whose value quantifies the topological complexity of the folded RNA structure. In such a representation, planar diagrams correspond to pure RNA secondary structures and have zero genus, whereas non-planar diagrams correspond to pseudoknotted structures and have higher genus. The topological genus allows for the definition of topological folding motifs, similar in spirit to those introduced and commonly used in protein folding. We analyze real RNA structures from the databases Worldwide Protein Data Bank and Pseudobase and classify them according to their topological genus. For simplicity, we limit our analysis by considering only Watson-Crick complementary base pairs and G-U wobble base pairs. We compare the results of our statistical survey with existing theoretical and numerical models. We also discuss possible applications of this classification and show how it can be used for identifying new RNA structural motifs. (C) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:900 / 911
页数:12
相关论文
共 51 条
[1]   A curved RNA helix incorporating an internal loop with G center dot A and A center dot A non-Watson-Crick base pairing [J].
Baeyens, KJ ;
DeBondt, HL ;
Pardi, A ;
Holbrook, SR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (23) :12851-12855
[2]   Announcing the worldwide Protein Data Bank [J].
Berman, H ;
Henrick, K ;
Nakamura, H .
NATURE STRUCTURAL BIOLOGY, 2003, 10 (12) :980-980
[3]  
Bleher P., 2001, RANDOM MATRIX MODELS, V40
[4]   Predicting RNA pseudoknot folding thermodynamics [J].
Cao, Song ;
Chen, Shi-Jie .
NUCLEIC ACIDS RESEARCH, 2006, 34 (09) :2634-2652
[5]   EMBL Nucleotide Sequence Database: developments in 2005 [J].
Cochrane, Guy ;
Aldebert, Philippe ;
Althorpe, Nicola ;
Andersson, Mikael ;
Baker, Wendy ;
Baldwin, Alastair ;
Bates, Kirsty ;
Bhattacharyya, Sumit ;
Browne, Paul ;
van den Broek, Alexandra ;
Castro, Matias ;
Duggan, Karyn ;
Eberhardt, Ruth ;
Faruque, Nadeem ;
Gamble, John ;
Kanz, Carola ;
Kulikova, Tamara ;
Lee, Charles ;
Leinonen, Rasko ;
Lin, Quan ;
Lombard, Vincent ;
Lopez, Rodrigo ;
McHale, Michelle ;
McWilliam, Hamish ;
Mukherjee, Gaurab ;
Nardone, Francesco ;
Pastor, Maria Pilar Garcia ;
Sobhany, Siamak ;
Stoehr, Peter ;
Tzouvara, Katerina ;
Vaughan, Robert ;
Wu, Dan ;
Zhu, Weimin ;
Apweiler, Rolf .
NUCLEIC ACIDS RESEARCH, 2006, 34 :D10-D15
[6]   Classifying RNA pseudoknotted structures [J].
Condon, A ;
Davy, B ;
Rastegari, B ;
Zhao, S ;
Tarrant, F .
THEORETICAL COMPUTER SCIENCE, 2004, 320 (01) :35-50
[7]   The loop E loop D region of Escherichia coli 5S rRNA:: the solution structure reveals an unusual loop that may be important for binding ribosomal proteins [J].
Dallas, A ;
Moore, PB .
STRUCTURE, 1997, 5 (12) :1639-1653
[8]   An algorithm for computing nucleic acid base-pairing probabilities including pseudoknots [J].
Dirks, RM ;
Pierce, NA .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2004, 25 (10) :1295-1304
[9]   NMR structures of loop B RNAs from the stem-loop IV domain of the Enterovirus internal ribosome entry site:: A single C to U substitution drastically changes the shape and flexibility of RNA [J].
Du, ZH ;
Ulyanov, NB ;
Yu, JH ;
Andino, R ;
James, TL .
BIOCHEMISTRY, 2004, 43 (19) :5757-5771
[10]   Exploring the repertoire of RNA secondary motifs using graph theory; implications for RNA design [J].
Gan, HH ;
Pasquali, S ;
Schlick, T .
NUCLEIC ACIDS RESEARCH, 2003, 31 (11) :2926-2943