Consensus folding of aligned sequences as a new measure for the detection of functional RNAs by comparative genomics

被引:104
作者
Washietl, S [1 ]
Hofacker, IL [1 ]
机构
[1] Univ Vienna, Inst Theoret Chem & Mol Strukt Biol, A-1090 Vienna, Austria
基金
奥地利科学基金会;
关键词
conserved secondary structure; consensus structure prediction; non-coding RNAs; comparative genomics; randomizing multiple sequence alignments;
D O I
10.1016/j.jmb.2004.07.018
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Facing the ever-growing list of newly discovered classes of functional RNAs, it can be expected that further types of functional RNAs are still hidden in recently completed genomes. The computational identification of such RNA genes is, therefore, of major importance. While most known functional RNAs have characteristic secondary structures, their free energies are generally not statistically significant enough to distinguish RNA genes from the genomic background. Additional information is required. Considering the wide availability of new genomic data of closely related species, comparative studies seem to be the most promising approach. Here, we show that prediction of consensus structures of aligned sequences can be a significant measure to detect functional RNAs. We report a new method to test multiple sequence alignments for the existence of an unusually structured and conserved fold. We show for alignments of six types of well-known functional RNA that an energy score consisting of free energy and a covariation term significantly improves sensitivity compared to single sequence predictions. We further test our method on a number of non-coding RNAs from Caenorhabditis elegans/Caenorhabditis briggsae and seven Saccharomyces species. Most RNAs can be detected with high significance. We provide a Perl implementation that can be used readily to score single alignments and discuss how the methods described here can be extended to allow for efficient genome-wide screens. (C) 2004 Elsevier Ltd. All rights reserved.
引用
收藏
页码:19 / 30
页数:12
相关论文
共 44 条
[1]  
ALTSCHUL SF, 1985, MOL BIOL EVOL, V2, P526
[2]  
[Anonymous], 1998, SCIENCE, V282, P2012
[3]  
CHEN JH, 1990, COMPUT APPL BIOSCI, V6, P7
[4]   Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis [J].
Cliften, PF ;
Hillier, LW ;
Fulton, L ;
Graves, T ;
Miner, T ;
Gish, WR ;
Waterston, RH ;
Johnston, M .
GENOME RESEARCH, 2001, 11 (07) :1175-1186
[5]   Non-coding RNA genes and the modern RNA world [J].
Eddy, SR .
NATURE REVIEWS GENETICS, 2001, 2 (12) :919-929
[6]   A search for H/ACA snoRNAs in yeast using MFE secondary structure prediction [J].
Edvardsson, S ;
Gardner, PP ;
Poole, AM ;
Hendy, MD ;
Penny, D ;
Moulton, V .
BIOINFORMATICS, 2003, 19 (07) :865-873
[7]   EnteriX 2003: visualization tools for genome alignments of Enterobacteriaceae [J].
Florea, L ;
McClelland, M ;
Riemer, C ;
Schwartz, S ;
Miller, W .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3527-3532
[8]   Rfam: an RNA family database [J].
Griffiths-Jones, S ;
Bateman, A ;
Marshall, M ;
Khanna, A ;
Eddy, SR .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :439-441
[9]   FAST FOLDING AND COMPARISON OF RNA SECONDARY STRUCTURES [J].
HOFACKER, IL ;
FONTANA, W ;
STADLER, PF ;
BONHOEFFER, LS ;
TACKER, M ;
SCHUSTER, P .
MONATSHEFTE FUR CHEMIE, 1994, 125 (02) :167-188
[10]   Prediction of locally stable RNA secondary structures for genome-wide surveys [J].
Hofacker, IL ;
Priwitzer, B ;
Stadler, PF .
BIOINFORMATICS, 2004, 20 (02) :186-190