Using RNA secondary structures to guide sequence motif finding towards single-stranded regions

被引:118
作者
Hiller, Michael [1 ]
Pudimat, Rainer [1 ]
Busch, Anke [1 ]
Backofen, Rolf [1 ]
机构
[1] Univ Freiburg, Chair Bioinformat, Inst Comp Sci, D-79110 Freiburg, Germany
关键词
D O I
10.1093/nar/gkl544
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
RNA binding proteins recognize RNA targets in a sequence specific manner. Apart from the sequence, the secondary structure context of the binding site also affects the binding affinity. Binding sites are often located in single-stranded RNA regions and it was shown that the sequestration of a binding motif in a double-strand abolishes protein binding. Thus, it is desirable to include knowledge about RNA secondary structures when searching for the binding motif of a protein. We present the approach MEMERIS for searching sequence motifs in a set of RNA sequences and simultaneously integrating information about secondary structures. To abstract from specific structural elements, we precompute position-specific values measuring the single-strandedness of all substrings of an RNA sequence. These values are used as prior knowledge about the motif starts to guide the motif search. Extensive tests with artificial and biological data demonstrate that MEMERIS is able to identify motifs in single-stranded regions even if a stronger motif located in double-strand parts exists. The discovered motif occurrences in biological datasets mostly coincide with known protein-binding sites. This algorithm can be used for finding the binding motif of single-stranded RNA-binding proteins in SELEX or other biological sequence data.
引用
收藏
页数:10
相关论文
共 34 条
[1]  
Backofen Rolf, 2004, J Bioinform Comput Biol, V2, P681, DOI 10.1142/S0219720004000818
[2]  
Bailey T L, 1994, Proc Int Conf Intell Syst Mol Biol, V2, P28
[3]  
BAILEY TL, 1995, MACH LEARN, V21, P51, DOI 10.1007/BF00993379
[4]  
BAILEY TL, 1995, THESIS U CALIFORNIA
[5]   The neuronal RNA binding protein Nova-1 recognizes specific RNA targets in vitro and in vivo [J].
Buckanovich, RJ ;
Darnell, RB .
MOLECULAR AND CELLULAR BIOLOGY, 1997, 17 (06) :3194-3201
[6]   RNA sequence and secondary structure participate in high-affinity CsrA-RNA interaction [J].
Dubey, AK ;
Baker, CS ;
Romeo, T ;
Babitzke, P .
RNA, 2005, 11 (10) :1579-1587
[7]   Nucleolin is a sequence-specific RNA-binding protein: Characterization of targets on pre-ribosomal RNA [J].
GhisolfiNieto, L ;
Joseph, G ;
PuvionDutilleul, F ;
Amalric, F ;
Bouvet, P .
JOURNAL OF MOLECULAR BIOLOGY, 1996, 260 (01) :34-53
[8]   Rfam: annotating non-coding RNAs in complete genomes [J].
Griffiths-Jones, S ;
Moxon, S ;
Marshall, M ;
Khanna, A ;
Eddy, SR ;
Bateman, A .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D121-D124
[9]   The effect of RNA secondary structures on RNA-ligand binding and the modifier RNA mechanism:: a quantitative model [J].
Hackermüller, J ;
Meisner, NC ;
Auer, M ;
Jaritz, M ;
Stadler, PF .
GENE, 2005, 345 (01) :3-12
[10]   RNA-protein interactions [J].
Hall, KB .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (03) :283-288