Importing statistical measures into Artemis enhances gene identification in the Leishmania genome project -: art. no. 23

被引:16
作者
Aggarwal, G
Worthey, EA
McDonagh, PD
Myler, PJ
机构
[1] Seattle Biomed Res Inst, Seattle, WA 98109 USA
[2] Immunex Res & Dev Corp, Seattle, WA 98101 USA
[3] Univ Washington, Dept Pathobiol, Seattle, WA 98195 USA
[4] Univ Washington, Dept Med Educ, Seattle, WA 98195 USA
[5] Univ Washington, Dept Biomed Informat, Seattle, WA 98195 USA
基金
英国惠康基金;
关键词
D O I
10.1186/1471-2105-4-23
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Seattle Biomedical Research Institute (SBRI) as part of the Leishmania Genome Network (LGN) is sequencing chromosomes of the trypanosomatid protozoan species Leishmania major. At SBRI, chromosomal sequence is annotated using a combination of trained and untrained non-consensus gene-prediction algorithms with ARTEMIS, an annotation platform with rich and user-friendly interfaces. Results: Here we describe a methodology used to import results from three different proteincoding gene-prediction algorithms (GLIMMER, TESTCODE and GENESCAN) into the ARTEMIS sequence viewer and annotation tool. Comparison of these methods, along with the CODONUSAGE algorithm built into ARTEMIS, shows the importance of combining methods to more accurately annotate the L. major genomic sequence. Conclusion: An improvised and powerful tool for gene prediction has been developed by importing data from widely-used algorithms into an existing annotation platform. This approach is especially fruitful in the Leishmania genome project where there is large proportion of novel genes requiring manual annotation.
引用
收藏
页数:5
相关论文
共 19 条
[1]   Ab initio gene identification:: prokaryote genome annotation with GeneScan and GLIMMER [J].
Aggarwal, G ;
Ramaswamy, R .
JOURNAL OF BIOSCIENCES, 2002, 27 (01) :7-14
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]   Finding the genes in genomic DNA [J].
Burge, CB ;
Karlin, S .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1998, 8 (03) :346-354
[4]   Computational methods for the identification of genes in vertebrate genomic sequences [J].
Claverie, JM .
HUMAN MOLECULAR GENETICS, 1997, 6 (10) :1735-1744
[5]   Improved microbial gene identification with GLIMMER [J].
Delcher, AL ;
Harmon, D ;
Kasif, S ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (23) :4636-4641
[6]   The gene identification problem: An overview for developers [J].
Fickett, JW .
COMPUTERS & CHEMISTRY, 1996, 20 (01) :103-118
[7]   RECOGNITION OF PROTEIN CODING REGIONS IN DNA-SEQUENCES [J].
FICKETT, JW .
NUCLEIC ACIDS RESEARCH, 1982, 10 (17) :5303-5318
[8]  
Guigo R, 1999, GENETIC DATABASES, P53
[9]   GAZE: A generic framework for the integration of gene-prediction data by dynamic programming [J].
Howe, KL ;
Chothia, T ;
Durbin, R .
GENOME RESEARCH, 2002, 12 (09) :1418-1427
[10]  
Jones J, 2002, INT J ONCOL, V20, P697