Computational gene finding in plants

被引:30
作者
Pertea, M [1 ]
Salzberg, SL [1 ]
机构
[1] Inst Genome Res, Rockville, MD 20850 USA
关键词
computational gene finding; genome sequencing;
D O I
10.1023/A:1013770123580
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Automated methods for identifying protein coding regions in genomic DNA have progressed significantly in recent years, but there is still a strong need for more accurate computational solutions to the gene finding problem. Large-scale genome sequencing projects depend greatly on gene finding to generate accurate and complete gene annotation. Improvements in gene finding software are being driven by the development of better computational algorithms, a better understanding of the cell's mechanisms for transcription and translation, and the enormous increases in genomic sequence data. This paper reviews some of the most widely used algorithms for gene finding in plants, including technical descriptions of how they work and recent measurements of their success on the genomes of Arabidopsis thaliana and rice.
引用
收藏
页码:39 / 48
页数:10
相关论文
共 38 条
[1]   The genome sequence of Drosophila melanogaster [J].
Adams, MD ;
Celniker, SE ;
Holt, RA ;
Evans, CA ;
Gocayne, JD ;
Amanatides, PG ;
Scherer, SE ;
Li, PW ;
Hoskins, RA ;
Galle, RF ;
George, RA ;
Lewis, SE ;
Richards, S ;
Ashburner, M ;
Henderson, SN ;
Sutton, GG ;
Wortman, JR ;
Yandell, MD ;
Zhang, Q ;
Chen, LX ;
Brandon, RC ;
Rogers, YHC ;
Blazej, RG ;
Champe, M ;
Pfeiffer, BD ;
Wan, KH ;
Doyle, C ;
Baxter, EG ;
Helt, G ;
Nelson, CR ;
Miklos, GLG ;
Abril, JF ;
Agbayani, A ;
An, HJ ;
Andrews-Pfannkoch, C ;
Baldwin, D ;
Ballew, RM ;
Basu, A ;
Baxendale, J ;
Bayraktaroglu, L ;
Beasley, EM ;
Beeson, KY ;
Benos, PV ;
Berman, BP ;
Bhandari, D ;
Bolshakov, S ;
Borkova, D ;
Botchan, MR ;
Bouck, J ;
Brokstein, P .
SCIENCE, 2000, 287 (5461) :2185-2195
[2]  
[Anonymous], 1998, COMPUTATIONAL METHOD
[3]   PREDICTION OF HUMAN MESSENGER-RNA DONOR AND ACCEPTOR SITES FROM THE DNA-SEQUENCE [J].
BRUNAK, S ;
ENGELBRECHT, J ;
KNUDSEN, S .
JOURNAL OF MOLECULAR BIOLOGY, 1991, 220 (01) :49-65
[4]   Prediction of complete gene structures in human genomic DNA [J].
Burge, C ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 268 (01) :78-94
[5]   Computational methods for the identification of genes in vertebrate genomic sequences [J].
Claverie, JM .
HUMAN MOLECULAR GENETICS, 1997, 6 (10) :1735-1744
[6]   STATISTICAL-ANALYSIS OF VERTEBRATE SEQUENCES REVEALS THAT LONG GENES ARE SCARCE IN GC-RICH ISOCHORES [J].
DURET, L ;
MOUCHIROUD, D ;
GAUTIER, C .
JOURNAL OF MOLECULAR EVOLUTION, 1995, 40 (03) :308-317
[7]   Prediction of transcription terminators in bacterial genomes [J].
Ermolaeva, MD ;
Khalak, HG ;
White, O ;
Smith, HO ;
Salzberg, SL .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 301 (01) :27-33
[8]   DETERMINATION OF EUKARYOTIC PROTEIN CODING REGIONS USING NEURAL NETWORKS AND INFORMATION-THEORY [J].
FARBER, R ;
LAPEDES, A ;
SIROTKIN, K .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 226 (02) :471-479
[9]   The gene identification problem: An overview for developers [J].
Fickett, JW .
COMPUTERS & CHEMISTRY, 1996, 20 (01) :103-118
[10]   IDENTIFICATION OF NEW SCHISTOSOMA-MANSONI GENES BY THE EST STRATEGY USING A DIRECTIONAL CDNA LIBRARY [J].
FRANCO, GR ;
ADAMS, MD ;
SOARES, MB ;
SIMPSON, AJG ;
VENTER, JC ;
PENA, SDJ .
GENE, 1995, 152 (02) :141-147