GenBank

被引:309
作者
Benson, Dennis A. [1 ]
Karsch-Mizrachi, Ilene [1 ]
Lipman, David J. [1 ]
Ostell, James [1 ]
Sayers, Eric W. [1 ]
机构
[1] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
DATABASE; GENERATION;
D O I
10.1093/nar/gkq1079
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
GenBank (R) is a comprehensive database that contains publicly available nucleotide sequences for more than 380 000 organisms named at the genus level or lower, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and accession numbers are assigned by GenBank staff upon receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system that integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI Homepage: www.ncbi.nlm.nih.gov.
引用
收藏
页码:D32 / D37
页数:6
相关论文
共 10 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [2] [Anonymous], NUCL ACIDS IN PRESS
  • [3] Benson DA, 2013, NUCLEIC ACIDS RES, V41, pD36, DOI [10.1093/nar/gkn723, 10.1093/nar/gkp1024, 10.1093/nar/gkw1070, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkl986, 10.1093/nar/gkq1079, 10.1093/nar/gks1195, 10.1093/nar/gkg057]
  • [4] DBEST - DATABASE FOR EXPRESSED SEQUENCE TAGS
    BOGUSKI, MS
    LOWE, TMJ
    TOLSTOSHEV, CM
    [J]. NATURE GENETICS, 1993, 4 (04) : 332 - 333
  • [5] NCBIBLAST: a better web interface
    Johnson, Mark
    Zaretskaya, Irena
    Raytselis, Yan
    Merezhuk, Yuri
    McGinnis, Scott
    Madden, Thomas L.
    [J]. NUCLEIC ACIDS RESEARCH, 2008, 36 : W5 - W9
  • [6] DDBJ launches a new archive database with analytical tools for next-generation sequence data
    Kaminuma, Eli
    Mashima, Jun
    Kodama, Yuichi
    Gojobori, Takashi
    Ogasawara, Osamu
    Okubo, Kousaku
    Takagi, Toshihisa
    Nakamura, Yasukazu
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D33 - D38
  • [7] KANS JA, 2001, BIOINFORMATICS PRACT, P65
  • [8] Functional annotation of a full-length mouse cDNA collection
    Kawai, J
    Shinagawa, A
    Shibata, K
    Yoshino, M
    Itoh, M
    Ishii, Y
    Arakawa, T
    Hara, A
    Fukunishi, Y
    Konno, H
    Adachi, J
    Fukuda, S
    Aizawa, K
    Izawa, M
    Nishi, K
    Kiyosawa, H
    Kondo, S
    Yamanaka, I
    Saito, T
    Okazaki, Y
    Gojobori, T
    Bono, H
    Kasukawa, T
    Saito, R
    Kadota, K
    Matsuda, H
    Ashburner, M
    Batalov, S
    Casavant, T
    Fleischmann, W
    Gaasterland, T
    Gissi, C
    King, B
    Kochiwa, H
    Kuehl, P
    Lewis, S
    Matsuo, Y
    Nikaido, I
    Pesole, G
    Quackenbush, J
    Schriml, LM
    Staubli, F
    Suzuki, R
    Tomita, M
    Wagner, L
    Washio, T
    Sakai, K
    Okido, T
    Furuno, M
    Aono, H
    [J]. NATURE, 2001, 409 (6821) : 685 - 690
  • [9] Improvements to services at the European Nucleotide Archive
    Leinonen, Rasko
    Akhtar, Ruth
    Birney, Ewan
    Bonfield, James
    Bower, Lawrence
    Corbett, Matt
    Cheng, Ying
    Demiralp, Fehmi
    Faruque, Nadeem
    Goodgame, Neil
    Gibson, Richard
    Hoad, Gemma
    Hunter, Christopher
    Jang, Mikyung
    Leonard, Steven
    Lin, Quan
    Lopez, Rodrigo
    Maguire, Michael
    McWilliam, Hamish
    Plaister, Sheila
    Radhakrishnan, Rajesh
    Sobhany, Siamak
    Slater, Guy
    Ten Hoopen, Petra
    Valentin, Franck
    Vaughan, Robert
    Zalunin, Vadim
    Zerbino, Daniel
    Cochrane, Guy
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D39 - D45
  • [10] Protein sequence similarity searches using patterns as seeds
    Zhang, Z
    Schaffer, AA
    Miller, W
    Madden, TL
    Lipman, DJ
    Koonin, EV
    Altschul, SF
    [J]. NUCLEIC ACIDS RESEARCH, 1998, 26 (17) : 3986 - 3990