Recognition of unknown conserved alternatively spliced exons

被引:40
作者
Ohler, U [1 ]
Shomron, N [1 ]
Burge, CB [1 ]
机构
[1] MIT, Dept Biol, Cambridge, MA 02139 USA
关键词
D O I
10.1371/journal.pcbi.0010015
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The split structure of most mammalian protein-coding genes allows for the potential to produce multiple different mRNA and protein isoforms from a single gene locus through the process of alternative splicing (AS). We propose a computational approach called UNCOVER based on a pair hidden Markov model to discover conserved coding exonic sequences subject to AS that have so far gone undetected. Applying UNCOVER to orthologous introns of known human and mouse genes predicts skipped exons or retained introns present in both species, while discriminating them from conserved noncoding sequences. The accuracy of the model is evaluated on a curated set of genes with known conserved AS events. The prediction of skipped exons in the similar to 1% of the human genome represented by the ENCODE regions leads to more than 50 new exon candidates. Five novel predicted AS exons were validated by RT-PCR and sequencing analysis of 15 introns with strong UNCOVER predictions and lacking EST evidence. These results imply that a considerable number of conserved exonic sequences and associated isoforms are still completely missing from the current annotation of known genes. UNCOVER also identifies a small number of candidates for conserved intron retention.
引用
收藏
页码:113 / 122
页数:10
相关论文
共 40 条
[1]   SLAM: Cross-species gene finding and alignment with a generalized pair hidden Markov model [J].
Alexandersson, M ;
Cawley, S ;
Pachter, L .
GENOME RESEARCH, 2003, 13 (03) :496-502
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   The Vertebrate Genome Annotation (Vega) database [J].
Ashurst, JL ;
Chen, CK ;
Gilbert, JGR ;
Jekosch, K ;
Keenan, S ;
Meidl, P ;
Searle, SM ;
Stalker, J ;
Storey, R ;
Trevanion, S ;
Wilming, L ;
Hubbard, T .
NUCLEIC ACIDS RESEARCH, 2005, 33 :D459-D465
[4]   How did alternative splicing evolve? [J].
Ast, G .
NATURE REVIEWS GENETICS, 2004, 5 (10) :773-782
[5]   Ultraconserved elements in the human genome [J].
Bejerano, G ;
Pheasant, M ;
Makunin, I ;
Stephen, S ;
Kent, WJ ;
Mattick, JS ;
Haussler, D .
SCIENCE, 2004, 304 (5675) :1321-1325
[6]   Ensembl 2004 [J].
Birney, E ;
Andrews, D ;
Bevan, P ;
Caccamo, M ;
Cameron, G ;
Chen, Y ;
Clarke, L ;
Coates, G ;
Cox, T ;
Cuff, J ;
Curwen, V ;
Cutts, T ;
Down, T ;
Durbin, R ;
Eyras, E ;
Fernandez-Suarez, XM ;
Gane, P ;
Gibbins, B ;
Gilbert, J ;
Hammond, M ;
Hotz, H ;
Iyer, V ;
Kahari, A ;
Jekosch, K ;
Kasprzyk, A ;
Keefe, D ;
Keenan, S ;
Lehvaslaiho, H ;
McVicker, G ;
Melsopp, C ;
Meidl, P ;
Mongin, E ;
Pettett, R ;
Potter, S ;
Proctor, G ;
Rae, M ;
Searle, S ;
Slater, G ;
Smedley, D ;
Smith, J ;
Spooner, W ;
Stabenau, A ;
Stalker, J ;
Storey, R ;
Ureta-Vidal, A ;
Woodwark, C ;
Clamp, M ;
Hubbard, T .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D468-D470
[7]   Mechanisms of alternative pre-messenger RNA splicing [J].
Black, DL .
ANNUAL REVIEW OF BIOCHEMISTRY, 2003, 72 :291-336
[8]   Alternative splicing and genome complexity [J].
Brett, D ;
Pospisil, H ;
Valcárcel, J ;
Reich, J ;
Bork, P .
NATURE GENETICS, 2002, 30 (01) :29-30
[9]  
Burge CB, 1999, RNA WORLD, P525
[10]   HMM sampling and applications to gene finding and alternative splicing [J].
Cawley, Simon L. ;
Pachter, Lior .
BIOINFORMATICS, 2003, 19 :II36-II41