GenBank

被引:1073
作者
Clark, Karen [1 ]
Karsch-Mizrachi, Ilene [1 ]
Lipman, David J. [1 ]
Ostell, James [1 ]
Sayers, Eric W. [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bldg 38A,8600 Rockville Pike, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
DATABASE; BLAST;
D O I
10.1093/nar/gkv1276
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
GenBank (R) (www.ncbi.nlm.nih.gov/genbank/) is a comprehensive database that contains publicly available nucleotide sequences for over 340 000 formally described species. Recent developments include a new starting page for submitters, a shift toward using accession. version identifiers rather than GI numbers, a wizard for submitting 16S rRNA sequences, and an Identical Protein Report to address growing issues of data redundancy. GenBank organizes the sequence data received from individual laboratories and largescale sequencing projects into 18 divisions, and GenBank staff assign unique accession. version identifiers upon data receipt. Most submitters use the web-based BankIt or standalone Sequin programs. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the nuccore, nucest, and nucgss databases of the Entrez retrieval system, which integrates these records with a variety of other data including taxonomy nodes, genomes, protein structures, and biomedical journal literature in PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP.
引用
收藏
页码:D67 / D72
页数:6
相关论文
共 14 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
[Anonymous], NUCL ACIDS RES
[3]   BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata [J].
Barrett, Tanya ;
Clark, Karen ;
Gevorgyan, Robert ;
Gorelenkov, Vyacheslav ;
Gribov, Eugene ;
Karsch-Mizrachi, Ilene ;
Kimelman, Michael ;
Pruitt, Kim D. ;
Resenchuk, Sergei ;
Tatusova, Tatiana ;
Yaschenko, Eugene ;
Ostell, James .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D57-D63
[4]   GenBank [J].
Benson, Dennis A. ;
Clark, Karen ;
Karsch-Mizrachi, Ilene ;
Lipman, David J. ;
Ostell, James ;
Sayers, Eric W. .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D30-D35
[5]   GenBank [J].
Benson, Dennis A. ;
Karsch-Mizrachi, Ilene ;
Clark, Karen ;
Lipman, David J. ;
Ostell, James ;
Sayers, Eric W. .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D48-D53
[6]   BLAST: a more efficient report with usability improvements [J].
Boratyn, Grzegorz M. ;
Camacho, Christiam ;
Cooper, Peter S. ;
Coulouris, George ;
Fong, Amelia ;
Ma, Ning ;
Madden, Thomas L. ;
Matten, Wayne T. ;
McGinnis, Scott D. ;
Merezhuk, Yuri ;
Raytselis, Yan ;
Sayers, Eric W. ;
Tao, Tao ;
Ye, Jian ;
Zaretskaya, Irena .
NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) :W29-W33
[7]   The NCBI Taxonomy database [J].
Federhen, Scott .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D136-D143
[8]   Assembly: a resource for assembled genomes at NCBI [J].
Kitts, Paul A. ;
Church, Deanna M. ;
Thibaud-Nissen, Francoise ;
Choi, Jinna ;
Hem, Vichet ;
Sapojnikov, Victor ;
Smith, Robert G. ;
Tatusova, Tatiana ;
Xiang, Charlie ;
Zherikov, Andrey ;
DiCuccio, Michael ;
Murphy, Terence D. ;
Pruitt, Kim D. ;
Kimchi, Avi .
NUCLEIC ACIDS RESEARCH, 2016, 44 (D1) :D73-D80
[9]   The sequence read archive: explosive growth of sequencing data [J].
Kodama, Yuichi ;
Shumway, Martin ;
Leinonen, Rasko .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D54-D56
[10]   DDBJ progress report: a new submission system for leading to a correct annotation [J].
Kosuge, Takehide ;
Mashima, Jun ;
Kodama, Yuichi ;
Fujisawa, Takatomo ;
Kaminuma, Eli ;
Ogasawara, Osamu ;
Okubo, Kousaku ;
Takagi, Toshihisa ;
Nakamura, Yasukazu .
NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) :D44-D49