Computational modeling of gene structure in Arabidopsis thaliana

被引:15
作者
Brendel, V [1 ]
Zhu, W
机构
[1] Iowa State Univ, Dept Zool & Genet, Ames, IA 50010 USA
[2] Iowa State Univ, Dept Stat, Ames, IA 50010 USA
关键词
EST analysis; gene prediction; spliced alignment;
D O I
10.1023/A:1013778321222
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Computational gene identification by sequence inspection remains a challenging problem. For a typical Arabidopsis thaliana gene with five exons, at least one of the exons is expected to have at least one of its borders predicted incorrectly by ab initio gene finding programs. More detailed analysis for individual genomic loci can often resolve the uncertainty on the basis of EST evidence or similarity to potential protein homologues. Such methods are part of the routine annotation process. However, because the EST and protein databases are constantly growing, in many cases original annotation must be re-evaluated, extended, and corrected on the basis of the latest evidence. The Arabidopsis Genome Initiative is undertaking this task on the whole-genome scale via its participating genome centers. The current Arabidopsis genome annotation provides an excellent starting point for assessing the protein repertoire of a flowering plant. More accurate whole-genome annotation will require the combination of high-throughput and individual gene experimental approaches and computational methods. The purpose of this article is to discuss tools available to an individual researcher to evaluate gene structure prediction for a particular locus.
引用
收藏
页码:49 / 58
页数:10
相关论文
共 23 条