Recent advances in gene structure prediction

被引:83
作者
Brent, MR
Guigó, R
机构
[1] Washington Univ, Lab Comp Genom, St Louis, MO 63130 USA
[2] Univ Pompeu Fabra, Ctr Regulacio Genom, Inst Municipal Invest Med, Res Grp Biomed Informat, Barcelona, Catalonia, Spain
关键词
D O I
10.1016/j.sbi.2004.05.007
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
De novo gene predictors are programs that predict the exon-intron structures of genes using the sequences of one or more genomes as their only input. In the past two years, dual-genome de novo predictors, which exploit local rates and patterns of mutation inferred from alignments between two genomes, have led to significant improvements in accuracy. Systems that exploit more than two genomes simultaneously have only recently begun to appear and are not yet competitive on practical tasks, but offer the greatest hope for near-term improvements. Dual-genome de novo prediction for compact eukaryotic genomes such as those of Arabidopsis thaliana and Caenorhabditis elegans is already quite accurate. Although mammalian gene prediction lags behind in accuracy, it is yielding ever more useful results. Coupled with significant improvements in pseudogene detection methods, which have eliminated many false positives, we have reached the point where de novo gene predictions are being used as hypotheses to drive experimental annotation via systematic RT-PCR and sequencing.
引用
收藏
页码:264 / 272
页数:9
相关论文
共 65 条
[31]   Distribution and characterization of regulatory elements in the human genome [J].
Majewski, J ;
Ott, J .
GENOME RESEARCH, 2002, 12 (12) :1827-1836
[32]   Gene structure conservation aids similarity based gene prediction [J].
Meyer, IM ;
Durbin, R .
NUCLEIC ACIDS RESEARCH, 2004, 32 (02) :776-783
[33]  
*MGC PROJ TEAM, 2004, IN PRESS GENOME RES, V14
[34]   Gene structure prediction in syntenic DNA segments [J].
Moore, JE ;
Lake, JA .
NUCLEIC ACIDS RESEARCH, 2003, 31 (24) :7271-7279
[35]   ETOPE: evolutionary test of predicted exons [J].
Nekrutenko, A ;
Chung, WY ;
Li, WH .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3564-3567
[36]   An evolutionary approach reveals a high protein-coding capacity of the human genome [J].
Nekrutenko, A ;
Chung, WY ;
Li, WH .
TRENDS IN GENETICS, 2003, 19 (06) :306-310
[37]  
Noguchi Hideki, 2002, Genome Inform, V13, P183
[38]   GeneID in Drosophila [J].
Parra, G ;
Blanco, E ;
Guigó, R .
GENOME RESEARCH, 2000, 10 (04) :511-515
[39]   Comparative gene prediction in human and mouse [J].
Parra, G ;
Agarwal, P ;
Abril, JF ;
Wiehe, T ;
Fickett, JW ;
Guigó, R .
GENOME RESEARCH, 2003, 13 (01) :108-117
[40]   A Bayesian framework for combining gene predictions [J].
Pavlovic, V ;
Garg, A ;
Kasif, S .
BIOINFORMATICS, 2002, 18 (01) :19-27