GenDB -: an open source genome annotation system for prokaryote genomes

被引:570
作者
Meyer, F [1 ]
Goesmann, A
McHardy, AC
Bartels, D
Bekel, T
Clausen, J
Kalinowski, J
Linke, B
Rupp, O
Giegerich, R
Pühler, A
机构
[1] Univ Bielefeld, Ctr Gene Res, Tech Fak, D-4800 Bielefeld, Germany
[2] Univ Bielefeld, Dept Biol, D-4800 Bielefeld, Germany
关键词
D O I
10.1093/nar/gkg312
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The flood of sequence data resulting from the large number of current genome projects has increased the need for a flexible, open source genome annotation system, which so far has not existed. To account for the individual needs of different projects, such a system should be modular and easily extensible. We present a genome annotation system for prokaryote genomes, which is well tested and readily adaptable to different tasks. The modular system was developed using an object-oriented approach, and it relies on a relational database backend. Using a well defined application programmers interface (API), the system can be linked easily to other systems. GenDB supports manual as well as automatic annotation strategies. The software currently is in use in more than a dozen microbial genome annotation projects. In addition to its use as a production genome annotation system, it can be employed as a flexible framework for the large-scale evaluation of different annotation strategies. The system is open source.
引用
收藏
页码:2187 / 2195
页数:9
相关论文
共 20 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Automated genome sequence analysis and annotation [J].
Andrade, MA ;
Brown, NP ;
Leroy, C ;
Hoersch, S ;
de Daruvar, A ;
Reich, C ;
Franchini, A ;
Tamames, J ;
Valencia, A ;
Ouzounis, C ;
Sander, C .
BIOINFORMATICS, 1999, 15 (05) :391-412
[3]   The InterPro database, an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, T ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :37-40
[4]   CRITICA: Coding region identification tool invoking comparative analysis [J].
Badger, JH ;
Olsen, GJ .
MOLECULAR BIOLOGY AND EVOLUTION, 1999, 16 (04) :512-524
[5]   Improved microbial gene identification with GLIMMER [J].
Delcher, AL ;
Harmon, D ;
Kasif, S ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (23) :4636-4641
[6]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[7]  
ETZOLD T, 1993, COMPUT APPL BIOSCI, V9, P49
[8]   Functional and structural genomics using PEDANT [J].
Frishman, D ;
Albermann, K ;
Hani, J ;
Heumann, K ;
Metanomski, A ;
Zollner, A ;
Mewes, HW .
BIOINFORMATICS, 2001, 17 (01) :44-57
[9]   MAGPIE: Automated genome interpretation [J].
Gaasterland, T ;
Sensen, CW .
TRENDS IN GENETICS, 1996, 12 (02) :76-78
[10]   The composite genome of the legume symbiont Sinorhizobium meliloti [J].
Galibert, F ;
Finan, TM ;
Long, SR ;
Pühler, A ;
Abola, P ;
Ampe, F ;
Barloy-Hubler, F ;
Barnett, MJ ;
Becker, A ;
Boistard, P ;
Bothe, G ;
Boutry, M ;
Bowser, L ;
Buhrmester, J ;
Cadieu, E ;
Capela, D ;
Chain, P ;
Cowie, A ;
Davis, RW ;
Dréano, S ;
Federspiel, NA ;
Fisher, RF ;
Gloux, S ;
Godrie, T ;
Goffeau, A ;
Golding, B ;
Gouzy, J ;
Gurjal, M ;
Hernandez-Lucas, I ;
Hong, A ;
Huizar, L ;
Hyman, RW ;
Jones, T ;
Kahn, D ;
Kahn, ML ;
Kalman, S ;
Keating, DH ;
Kiss, E ;
Komp, C ;
Lalaure, V ;
Masuy, D ;
Palm, C ;
Peck, MC ;
Pohl, TM ;
Portetelle, D ;
Purnelle, B ;
Ramsperger, U ;
Surzycki, R ;
Thébault, P ;
Vandenbol, M .
SCIENCE, 2001, 293 (5530) :668-672