IMG ER: a system for microbial genome annotation expert review and curation

被引:711
作者
Markowitz, Victor M. [1 ]
Mavromatis, Konstantinos [2 ]
Ivanova, Natalia N. [2 ]
Chen, I-Min A. [1 ]
Chu, Ken [1 ]
Kyrpides, Nikos C. [2 ]
机构
[1] Univ Calif Berkeley, Lawrence Berkeley Lab, Biol Data Management & Technol Ctr, Berkeley, CA 94720 USA
[2] DOE Joint Genome Inst, Genome Biol Program, Walnut Creek, CA 94598 USA
关键词
DATABASE; SEQUENCE; ENZYMES;
D O I
10.1093/bioinformatics/btp393
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: A rapidly increasing number of microbial genomes are sequenced by organizations worldwide and are eventually included into various public genome data resources. The quality of the annotations depends largely on the original dataset providers, with erroneous or incomplete annotations often carried over into the public resources and difficult to correct. Results: We have developed an Expert Review (ER) version of the Integrated Microbial Genomes (IMG) system, with the goal of supporting systematic and efficient revision of microbial genome annotations. IMG ER provides tools for the review and curation of annotations of both new and publicly available microbial genomes within IMG's rich integrated genome framework. New genome datasets are included into IMG ER prior to their public release either with their native annotations or with annotations generated by IMG ER's annotation pipeline. IMG ER tools allow addressing annotation problems detected with IMG's comparative analysis tools, such as genes missed by gene prediction pipelines or genes without an associated function. Over the past year, IMG ER was used for improving the annotations of about 150 microbial genomes.
引用
收藏
页码:2271 / 2278
页数:8
相关论文
共 27 条
[21]   NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins [J].
Pruitt, Kim D. ;
Tatusova, Tatiana ;
Maglott, Donna R. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D61-D65
[22]   Artemis: sequence visualization and annotation [J].
Rutherford, K ;
Parkhill, J ;
Crook, J ;
Horsnell, T ;
Rice, P ;
Rajandream, MA ;
Barrell, B .
BIOINFORMATICS, 2000, 16 (10) :944-945
[23]   Genome re-annotation: a wiki solution? [J].
Salzberg, Steven L. .
GENOME BIOLOGY, 2007, 8 (01)
[24]   TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomes [J].
Selengut, Jeremy D. ;
Haft, Daniel H. ;
Davidsen, Tanja ;
Ganapathy, Anurhada ;
Gwinn-Giglio, Michelle ;
Nelson, William C. ;
Richter, Alexander R. ;
White, Owen .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D260-D264
[25]   The COG database: an updated version includes eukaryotes [J].
Tatusov, RL ;
Fedorova, ND ;
Jackson, JD ;
Jacobs, AR ;
Kiryutin, B ;
Koonin, EV ;
Krylov, DM ;
Mazumder, R ;
Mekhedov, SL ;
Nikolskaya, AN ;
Rao, BS ;
Smirnov, S ;
Sverdlov, AV ;
Vasudevan, S ;
Wolf, YI ;
Yin, JJ ;
Natale, DA .
BMC BIOINFORMATICS, 2003, 4 (1)
[26]   MaGe:: a microbial genome annotation system supported by synteny results [J].
Vallenet, D ;
Labarre, L ;
Rouy, Z ;
Barbe, V ;
Bocs, S ;
Cruveiller, S ;
Lajus, A ;
Pascal, G ;
Scarpelli, C ;
Médigue, C .
NUCLEIC ACIDS RESEARCH, 2006, 34 (01) :53-65
[27]   Pseudomonas Genome Database: facilitating user-friendly, comprehensive comparisons of microbial genomes [J].
Winsor, Geoffrey L. ;
Van Rossum, Thea ;
Lo, Raymond ;
Khaira, Bhavjinder ;
Whiteside, Matthew D. ;
Hancock, Robert E. W. ;
Brinkman, Fiona S. L. .
NUCLEIC ACIDS RESEARCH, 2009, 37 :D483-D488