Secondary structure alone is generally not statistically significant for the detection of noncoding RNAs

被引:198
作者
Rivas, E [1 ]
Eddy, SR [1 ]
机构
[1] Washington Univ, Dept Genet, St Louis, MO 63110 USA
关键词
D O I
10.1093/bioinformatics/16.7.583
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Several results in the literature suggest that biologically interesting RNAs have secondary structures that are more stable than expected by chance. Based on these observations, we developed a scanning algorithm for detecting noncoding RNA genes in genome sequences, using a fully probabilistic version of the Zuker minimum-energy folding algorithm. Results: Preliminary results were encouraging, but certain anomalies led us to do a carefully controlled investigation of this class of methods. Ultimately, our results argue that for the probabilistic model there is indeed a statistical effect, but it comes mostly from local base-composition bias and not from RNA secondary structure. For the thermodynamic implementation (which evaluates statistical significance by doing Monte Carlo shuffling in fixed-length sequence windows, thus eliminating the base-composition effect) the signals for noncoding RNAs are still usually indistinguishable from noise, especially when certain statistical artifacts resulting from local base-composition inhomogeneity are taken into account. We conclude that although a distinct, stable secondary structure is undoubtedly important in most noncoding RNAs, the stability of most noncoding RA secondary structures is not sufficiently different from the predicted stability of a random sequence to be useful as a general genefinding approach.
引用
收藏
页码:583 / 605
页数:23
相关论文
共 51 条
[1]  
[Anonymous], RNA WORLD
[2]   ANTISENSE SNORNAS - A FAMILY OF NUCLEOLAR RNAS WITH LONG COMPLEMENTARITIES TO RIBOSOMAL-RNA [J].
BACHELLERIE, JP ;
MICHOT, B ;
NICOLOSO, M ;
BALAKIN, A ;
NI, JW ;
FOURNIER, MJ .
TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (07) :261-264
[3]  
Barrett C, 1997, COMPUT APPL BIOSCI, V13, P191
[4]  
Baserga S., 1993, In The RNA world, P359
[5]  
Bovia F, 1996, J CELL SCI, V109, P2601
[6]   THE PRODUCT OF THE MOUSE XIST GENE IS A 15 KB INACTIVE X-SPECIFIC TRANSCRIPT CONTAINING NO CONSERVED ORF AND LOCATED IN THE NUCLEUS [J].
BROCKDORFF, N ;
ASHWORTH, A ;
KAY, GF ;
MCCABE, VM ;
NORRIS, DP ;
COOPER, PJ ;
SWIFT, S ;
RASTAN, S .
CELL, 1992, 71 (03) :515-526
[7]   THE HUMAN XIST GENE - ANALYSIS OF A 17 KB INACTIVE X-SPECIFIC RNA THAT CONTAINS CONSERVED REPEATS AND IS HIGHLY LOCALIZED WITHIN THE NUCLEUS [J].
BROWN, CJ ;
HENDRICH, BD ;
RUPERT, JL ;
LAFRENIERE, RG ;
XING, Y ;
LAWRENCE, J ;
WILLARD, HF .
CELL, 1992, 71 (03) :527-542
[8]   Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii [J].
Bult, CJ ;
White, O ;
Olsen, GJ ;
Zhou, LX ;
Fleischmann, RD ;
Sutton, GG ;
Blake, JA ;
FitzGerald, LM ;
Clayton, RA ;
Gocayne, JD ;
Kerlavage, AR ;
Dougherty, BA ;
Tomb, JF ;
Adams, MD ;
Reich, CI ;
Overbeek, R ;
Kirkness, EF ;
Weinstock, KG ;
Merrick, JM ;
Glodek, A ;
Scott, JL ;
Geoghagen, NSM ;
Weidman, JF ;
Fuhrmann, JL ;
Nguyen, D ;
Utterback, TR ;
Kelley, JM ;
Peterson, JD ;
Sadow, PW ;
Hanna, MC ;
Cotton, MD ;
Roberts, KM ;
Hurst, MA ;
Kaine, BP ;
Borodovsky, M ;
Klenk, HP ;
Fraser, CM ;
Smith, HO ;
Woese, CR ;
Venter, JC .
SCIENCE, 1996, 273 (5278) :1058-1073
[9]  
CECH TR, 1993, RNA WORLD, P239
[10]  
CHEN JH, 1990, COMPUT APPL BIOSCI, V6, P7