MAGPIE/EGRET annotation of the 2.9-Mb Drosophila melanogaster Adh region

被引:10
作者
Gaasterland, T
Sczyrba, A
Thomas, E
Aytekin-Kurban, G
Gordon, P
Sensen, CW
机构
[1] Rockefeller Univ, Lab Computat Genomics, New York, NY 10021 USA
[2] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[3] Natl Res Council Canada, Atlantic Reg Lab, Inst Marine Biosci, Halifax, NS B3H 3Z1, Canada
关键词
D O I
10.1101/gr.10.4.502
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Our challenge in annotating the 2.91-Mb Adh region of the Drosophila melanogaster genome was to identify genetic and genomic features automatically, completely, and precisely within a 6-week period. To do so, we augmented the MAGPIE microbial genome annotation system to handle eukaryotic genomic sequence data. The new configuration required the integration of eukaryotic gene-finding tools and DNA repeat tools into the automatic data collection module. It also required us to define in MAGPIE new strategies to combine data about eukaryotic exon predictions with functional data to refine the exon predictions. At the heart of the resulting new eukaryotic genome annotation system is a reverse comparison of public protein and complementary DNA sequences against the input genome to identify missing exons and to refine exon boundaries. The software modules that add eukaryotic genome annotation capability to MAGPIE are available as EGRET (Eukaryotic Genome Rapid Evaluation Tool).
引用
收藏
页码:502 / 510
页数:9
相关论文
共 17 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Ashburner M, 1999, GENETICS, V153, P179
[3]   PRINTS prepares for the new millennium [J].
Attwood, TK ;
Flower, DR ;
Lewis, AP ;
Mabey, JE ;
Morgan, SR ;
Scordis, P ;
Selley, JN ;
Wright, W .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :220-225
[4]   Finding the genes in genomic DNA [J].
Burge, CB ;
Karlin, S .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1998, 8 (03) :346-354
[5]   The complete genome of the hyperthermophilic bacterium Aquifex aeolicus [J].
Deckert, G ;
Warren, PV ;
Gaasterland, T ;
Young, WG ;
Lenox, AL ;
Graham, DE ;
Overbeek, R ;
Snead, MA ;
Keller, M ;
Aujay, M ;
Huber, R ;
Feldman, RA ;
Short, JM ;
Olsen, GJ ;
Swanson, RV .
NATURE, 1998, 392 (6674) :353-358
[6]  
FIELDS D, 1999, CALYPSO TANDEM REPEA
[7]  
Gaasterland T., 1997, Fundamenta Informaticae, V32, P121
[8]   Fully automated genome analysis that reflects user needs and preferences. A detailed introduction to the MAGPIE system architecture [J].
Gaasterland, T ;
Sensen, CW .
BIOCHIMIE, 1996, 78 (05) :302-310
[9]  
GAASTERLAND T, 1998, J MICROB COMP GENOMI, V3, P199
[10]  
GAASTERLAND T, 1998, J MICROB COMP GENOMI, V3, P177