We report an efficient method for detecting functional RNAs. The approach, which combines comparative sequence analysis and structure prediction, already has yielded excellent results for a small number of aligned sequences and is suitable for large-scale genomic screens. It consists of two basic components: (i) a measure for RNA secondary structure conservation based on computing a consensus secondary structure, and (h) a measure for thermodynamic stability, which, in the spirit of a z score, is normalized with respect to both sequence length and base composition but can be calculated without sampling from shuffled sequences. Functional RNA secondary structures can be identified in multiple sequence alignments with high sensitivity and high specificity. We demonstrate that this approach is not only much more accurate than previous methods but also significantly faster. The method is implemented in the program RNAZ, which can be downloaded from www.tbi.univie.ac.at/similar towash/RNAz. We screened all alignments of length n greater than or equal to 50 in the Comparative Regulatory Genomics database, which compiles conserved noncoding elements in upstream regions of orthologous genes from human, mouse, rat, Fugu, and zebrafish. We recovered all of the known noncoding RNAs and cis-acting elements with high significance and found compelling evidence for many other conserved RNA secondary structures not described so far to our knowledge.
机构:
Univ Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USAUniv Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
Bejerano, Gill
Haussler, David
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USAUniv Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
Haussler, David
Blanchette, Mathieu
论文数: 0引用数: 0
h-index: 0
机构:
3775 Univ, McGill Ctr Bioinformat, Sch Comp Sci, Montreal, PQ H3A 2B4, CanadaUniv Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Blanchette, M
Kent, WJ
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Kent, WJ
Riemer, C
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Riemer, C
Elnitski, L
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Elnitski, L
Smit, AFA
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Smit, AFA
Roskin, KM
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Roskin, KM
Baertsch, R
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Baertsch, R
Rosenbloom, K
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Rosenbloom, K
Clawson, H
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Clawson, H
Green, ED
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Green, ED
Haussler, D
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Haussler, D
Miller, W
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USAPenn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
机构:
Univ Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USAUniv Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
Bejerano, Gill
Haussler, David
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USAUniv Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
Haussler, David
Blanchette, Mathieu
论文数: 0引用数: 0
h-index: 0
机构:
3775 Univ, McGill Ctr Bioinformat, Sch Comp Sci, Montreal, PQ H3A 2B4, CanadaUniv Calif Santa Cruz, Baskin Sch Engn, Ctr Biomol Sci & Engn, Santa Cruz, CA 95064 USA
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Blanchette, M
Kent, WJ
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Kent, WJ
Riemer, C
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Riemer, C
Elnitski, L
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Elnitski, L
Smit, AFA
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Smit, AFA
Roskin, KM
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Roskin, KM
Baertsch, R
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Baertsch, R
Rosenbloom, K
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Rosenbloom, K
Clawson, H
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Clawson, H
Green, ED
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Green, ED
Haussler, D
论文数: 0引用数: 0
h-index: 0
机构:Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA
Haussler, D
Miller, W
论文数: 0引用数: 0
h-index: 0
机构:
Penn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USAPenn State Univ, Ctr Comparat Genom & Bioinformat, University Pk, PA 16802 USA