Recent advances in gene structure prediction

被引:83
作者
Brent, MR
Guigó, R
机构
[1] Washington Univ, Lab Comp Genom, St Louis, MO 63130 USA
[2] Univ Pompeu Fabra, Ctr Regulacio Genom, Inst Municipal Invest Med, Res Grp Biomed Informat, Barcelona, Catalonia, Spain
关键词
D O I
10.1016/j.sbi.2004.05.007
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
De novo gene predictors are programs that predict the exon-intron structures of genes using the sequences of one or more genomes as their only input. In the past two years, dual-genome de novo predictors, which exploit local rates and patterns of mutation inferred from alignments between two genomes, have led to significant improvements in accuracy. Systems that exploit more than two genomes simultaneously have only recently begun to appear and are not yet competitive on practical tasks, but offer the greatest hope for near-term improvements. Dual-genome de novo prediction for compact eukaryotic genomes such as those of Arabidopsis thaliana and Caenorhabditis elegans is already quite accurate. Although mammalian gene prediction lags behind in accuracy, it is yielding ever more useful results. Coupled with significant improvements in pseudogene detection methods, which have eliminated many false positives, we have reached the point where de novo gene predictions are being used as hypotheses to drive experimental annotation via systematic RT-PCR and sequencing.
引用
收藏
页码:264 / 272
页数:9
相关论文
共 65 条
[1]   SLAM: Cross-species gene finding and alignment with a generalized pair hidden Markov model [J].
Alexandersson, M ;
Cawley, S ;
Pachter, L .
GENOME RESEARCH, 2003, 13 (03) :496-502
[2]  
Allen JE, 2004, GENOME RES, V14, P142, DOI 10.1101/gr.1562804
[3]  
[Anonymous], 1997, THESIS STANFORD U
[4]   Modeling splicing sites with pairwise correlations [J].
Arita, M ;
Tsuda, K ;
Asai, K .
BIOINFORMATICS, 2002, 18 :S27-S34
[5]   Dragon Gene Start Finder: An advanced system for finding approximate locations of the start of gene transcriptional units [J].
Bajic, VB ;
Seah, SH .
GENOME RESEARCH, 2003, 13 (08) :1923-1929
[6]   Using GeneWise in the Drosophila annotation experiment [J].
Birney, E ;
Durbin, R .
GENOME RESEARCH, 2000, 10 (04) :547-548
[7]   Phylogenetic shadowing of primate sequences to find functional regions of the human genome [J].
Boffelli, D ;
McAuliffe, J ;
Ovcharenko, D ;
Lewis, KD ;
Ovcharenko, I ;
Pachter, L ;
Rubin, EM .
SCIENCE, 2003, 299 (5611) :1391-1394
[8]  
BRENDEL V, 2004, IN PRESS BIOINFORMAT
[9]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[10]  
BURGE CB, 1999, RNA WORLD, pCH20