Gene finding in novel genomes

被引:2224
作者
Korf, I [1 ]
机构
[1] Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England
关键词
D O I
10.1186/1471-2105-5-59
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Computational gene prediction continues to be an important problem, especially for genomes with little experimental data. Results: I introduce the SNAP gene finder which has been designed to be easily adaptable to a variety of genomes. In novel genomes without an appropriate gene finder, I demonstrate that employing a foreign gene finder can produce highly inaccurate results, and that the most compatible parameters may not come from the nearest phylogenetic neighbor. I find that foreign gene finders are more usefully employed to bootstrap parameter estimation and that the resulting parameters can be highly accurate. Conclusion: Since gene prediction is sensitive to species-specific parameters, every genome needs a dedicated gene finder.
引用
收藏
页数:9
相关论文
共 24 条
  • [11] KROGH A, 1997, ISMB, V5, P179
  • [12] Kulp D, 1996, Proc Int Conf Intell Syst Mol Biol, V4, P134
  • [13] A computational analysis of sequence features involved in recognition of short introns
    Lim, LP
    Burge, CB
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (20) : 11193 - 11198
  • [14] GlimmerM, Exonomy and Unveil:: three ab initio eukaryotic genefinders
    Majoros, WH
    Pertea, M
    Antonescu, C
    Salzberg, SL
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (13) : 3601 - 3604
  • [15] GeneID in Drosophila
    Parra, G
    Blanco, E
    Guigó, R
    [J]. GENOME RESEARCH, 2000, 10 (04) : 511 - 515
  • [16] Genome annotation assessment in Drosophila melanogaster
    Reese, MG
    Hartzell, G
    Harris, NL
    Ohler, U
    Abril, JF
    Lewis, SE
    [J]. GENOME RESEARCH, 2000, 10 (04) : 483 - 501
  • [17] RiceGAAS: an automated annotation system and database for rice genome sequence
    Sakata, K
    Nagamura, Y
    Numa, H
    Antonio, BA
    Nagasaki, H
    Idonuma, A
    Watanabe, W
    Shimizu, Y
    Horiuchi, I
    Matsumoto, T
    Sasaki, T
    Higo, K
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (01) : 98 - 102
  • [18] *SANG I, SRS7 SANG I
  • [19] Smit AFA, RepeatMasker
  • [20] SOLOVYEV V, 1997, ISMB, V5, P294