Computational modeling of gene structure in Arabidopsis thaliana

被引：15

作者：

Brendel, V ^{[1
]}

Zhu, W

机构：

[1] Iowa State Univ, Dept Zool & Genet, Ames, IA 50010 USA

[2] Iowa State Univ, Dept Stat, Ames, IA 50010 USA

来源：

PLANT MOLECULAR BIOLOGY | 2002年 / 48卷 / 1-2期

关键词：

EST analysis; gene prediction; spliced alignment;

D O I：

10.1023/A:1013778321222

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Computational gene identification by sequence inspection remains a challenging problem. For a typical Arabidopsis thaliana gene with five exons, at least one of the exons is expected to have at least one of its borders predicted incorrectly by ab initio gene finding programs. More detailed analysis for individual genomic loci can often resolve the uncertainty on the basis of EST evidence or similarity to potential protein homologues. Such methods are part of the routine annotation process. However, because the EST and protein databases are constantly growing, in many cases original annotation must be re-evaluated, extended, and corrected on the basis of the latest evidence. The Arabidopsis Genome Initiative is undertaking this task on the whole-genome scale via its participating genome centers. The current Arabidopsis genome annotation provides an excellent starting point for assessing the protein repertoire of a flowering plant. More accurate whole-genome annotation will require the combination of high-throughput and individual gene experimental approaches and computational methods. The purpose of this article is to discuss tools available to an individual researcher to evaluate gene structure prediction for a particular locus.

引用

页码：49 / 58

页数：10

共 23 条

[1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Altschul, SF
Madden, TL
Schaffer, AA
Zhang, JH
Zhang, Z
Miller, W
Lipman, DJ
[J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
[2] [Anonymous], 1998, NATURE, DOI DOI 10.1038/35140
[3] Comparative sequence analysis of plant nuclear genomes: Microcolinearity and its many exceptions
Bennetzen, JL
[J]. PLANT CELL, 2000, 12 (07) : 1021 - 1029
[4] Prediction of locally optimal splice sites in plant pre-mRNA with applications to gene identification in Arabidopsis thaliana genomic DNA
Brendel, V
Kleffe, J
[J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (20) : 4748 - 4757
[5] Prediction of complete gene structures in human genomic DNA
Burge, C
Karlin, S
[J]. JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) : 78 - 94
[6] Computational methods for gene annotation:: the Arabidopsis genome
Cho, YR
Walbot, V
[J]. CURRENT OPINION IN BIOTECHNOLOGY, 2001, 12 (02) : 126 - 130
[7] Computational methods for the identification of genes in vertebrate genomic sequences
Claverie, JM
[J]. HUMAN MOLECULAR GENETICS, 1997, 6 (10) : 1735 - 1744
[8] A computer program for aligning a cDNA sequence with a genomic DNA sequence
Florea, L
Hartzell, G
Zhang, Z
Rubin, GM
Miller, W
[J]. GENOME RESEARCH, 1998, 8 (09) : 967 - 974
[9] Gene recognition via spliced sequence alignment
Gelfand, MS
Mironov, AA
Pevzner, PA
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (17) : 9061 - 9066
[10] Huang XQ, 1996, COMPUT APPL BIOSCI, V12, P497

← 1 2 3 →