Update on RefSeq microbial genomes resources

被引:100
作者
Tatusova, Tatiana [1 ]
Ciufo, Stacy [1 ]
Federhen, Scott [1 ]
Fedorov, Boris [1 ]
McVeigh, Richard [1 ]
O'Neill, Kathleen [1 ]
Tolstoy, Igor [1 ]
Zaslavsky, Leonid [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
DATABASE;
D O I
10.1093/nar/gku1062
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
NCBI RefSeq genome collection ext-link-type="uri" xlink:href="http://www.ncbi.nlm.nih.gov/genome" xlink:type="simple">http://www.ncbi.nlm.nih.gov/genome represents all three major domains of life: Eukarya, Bacteria and Archaea as well as Viruses. Prokaryotic genome sequences are the most rapidly growing part of the collection. During the year of 2014 more than 10 000 microbial genome assemblies have been publicly released bringing the total number of prokaryotic genomes close to 30 000. We continue to improve the quality and usability of the microbial genome resources by providing easy access to the data and the results of the pre-computed analysis, and improving analysis and visualization tools. A number of improvements have been incorporated into the Prokaryotic Genome Annotation Pipeline. Several new features have been added to RefSeq prokaryotic genomes data processing pipeline including the calculation of genome groups (clades) and the optimization of protein clusters generation using pan-genome approach.
引用
收藏
页码:D599 / D605
页数:7
相关论文
共 14 条
  • [1] BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata
    Barrett, Tanya
    Clark, Karen
    Gevorgyan, Robert
    Gorelenkov, Vyacheslav
    Gribov, Eugene
    Karsch-Mizrachi, Ilene
    Kimelman, Michael
    Pruitt, Kim D.
    Resenchuk, Sergei
    Tatusova, Tatiana
    Yaschenko, Eugene
    Ostell, James
    [J]. NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) : D57 - D63
  • [2] BLAST plus : architecture and applications
    Camacho, Christiam
    Coulouris, George
    Avagyan, Vahram
    Ma, Ning
    Papadopoulos, Jason
    Bealer, Kevin
    Madden, Thomas L.
    [J]. BMC BIOINFORMATICS, 2009, 10
  • [3] PhyloSift: phylogenetic analysis of genomes and metagenomes
    Darling, Aaron E.
    Jospin, Guillaume
    Lowe, Eric
    Matsen, Frederick A., IV
    Bik, Holly M.
    Eisen, Jonathan A.
    [J]. PEERJ, 2014, 2
  • [4] Toward richer metadata for microbial sequences: replacing strain-level NCBI taxonomy taxids with BioProject, BioSample and Assembly records
    Federhen, Scott
    Clark, Karen
    Barrett, Tanya
    Parkinson, Helen
    Ostell, James
    Kodama, Yuichi
    Mashima, Jun
    Nakamura, Yasukazu
    Cochrane, Guy
    Karsch-Mizrachi, Ilene
    [J]. STANDARDS IN GENOMIC SCIENCES, 2014, 9 (03): : 1275 - 1277
  • [5] A novel three-unit tRNA splicing endonuclease found in ultrasmall Archaea possesses broad substrate specificity
    Fujishima, Kosuke
    Sugahara, Junichi
    Miller, Christopher S.
    Baker, Brett J.
    Di Giulio, Massimo
    Takesue, Kanako
    Sato, Asako
    Tomita, Masaru
    Banfield, Jillian F.
    Kanai, Akio
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (22) : 9695 - 9704
  • [6] Extraordinary expansion of a Sorangium cellulosum genome from an alkaline milieu
    Han, Kui
    Li, Zhi-feng
    Peng, Ran
    Zhu, Li-ping
    Zhou, Tao
    Wang, Lu-guang
    Li, Shu-guang
    Zhang, Xiao-bo
    Hu, Wei
    Wu, Zhi-hong
    Qin, Nan
    Li, Yue-zhong
    [J]. SCIENTIFIC REPORTS, 2013, 3
  • [7] Benchmarking of Methods for Genomic Taxonomy
    Larsen, Mette V.
    Cosentino, Salvatore
    Lukjancenko, Oksana
    Saputra, Dhany
    Rasmussen, Simon
    Hasman, Henrik
    Sicheritz-Ponten, Thomas
    Aarestrup, Frank M.
    Ussery, David W.
    Lund, Ole
    [J]. JOURNAL OF CLINICAL MICROBIOLOGY, 2014, 52 (05) : 1529 - 1539
  • [8] High-throughput bacterial genome sequencing: an embarrassment of choice, a world of opportunity
    Loman, Nicholas J.
    Constantinidou, Chrystala
    Chan, Jacqueline Z. M.
    Halachev, Mihail
    Sergeant, Martin
    Penn, Charles W.
    Robinson, Esther R.
    Pallen, Mark J.
    [J]. NATURE REVIEWS MICROBIOLOGY, 2012, 10 (09) : 599 - 606
  • [9] Mende DR, 2013, NAT METHODS, V10, P881, DOI [10.1038/NMETH.2575, 10.1038/nmeth.2575]
  • [10] The International Nucleotide Sequence Database Collaboration
    Nakamura, Yasukazu
    Cochrane, Guy
    Karsch-Mizrachi, Ilene
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D21 - D24