Assembly: a resource for assembled genomes at NCBI

被引:239
作者
Kitts, Paul A. [1 ]
Church, Deanna M. [1 ,2 ]
Thibaud-Nissen, Francoise [1 ]
Choi, Jinna [1 ]
Hem, Vichet [1 ]
Sapojnikov, Victor [1 ]
Smith, Robert G. [1 ]
Tatusova, Tatiana [1 ]
Xiang, Charlie [1 ]
Zherikov, Andrey [1 ]
DiCuccio, Michael [1 ]
Murphy, Terence D. [1 ]
Pruitt, Kim D. [1 ]
Kimchi, Avi [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
[2] Personalis Inc, Menlo Pk, CA 94025 USA
基金
美国国家卫生研究院;
关键词
READ ALIGNMENT; DRAFT GENOME; METABOLISM; SEQUENCE; METADATA; DATABASE;
D O I
10.1093/nar/gkv1226
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The NCBI Assembly database (www.ncbi.nlm.nih.gov/assembly/) provides stable accessioning and data tracking for genome assembly data. The model underlying the database can accommodate a range of assembly structures, including sets of unordered contig or scaffold sequences, bacterial genomes consisting of a single complete chromosome, or complex structures such as a human genome with modeled allelic variation. The database provides an assembly accession and version to unambiguously identify the set of sequences that make up a particular version of an assembly, and tracks changes to updated genome assemblies. The Assembly database reports metadata such as assembly names, simple statistical reports of the assembly (number of contigs and scaffolds, contiguity metrics such as contig N50, total sequence length and total gap length) as well as the assembly update history. The Assembly database also tracks the relationship between an assembly submitted to the International Nucleotide Sequence Database Consortium (INSDC) and the assembly represented in the NCBI RefSeq project. Users can find assemblies of interest by querying the Assembly Resource directly or by browsing available assemblies for a particular organism. Links in the Assembly Resource allow users to easily download sequence and annotations for current versions of genome assemblies from the NCBI genomes FTP site.
引用
收藏
页码:D73 / D80
页数:8
相关论文
共 23 条
[21]   Single haplotype assembly of the human genome from a hydatidiform mole [J].
Steinberg, Karyn Meltz ;
Schneider, Valerie A. ;
Graves-Lindsay, Tina A. ;
Fulton, Robert S. ;
Agarwala, Richa ;
Huddleston, John ;
Shiryev, Sergey A. ;
Morgulis, Aleksandr ;
Surti, Urvashi ;
Warren, Wesley C. ;
Church, Deanna M. ;
Eichler, Evan E. ;
Wilson, Richard K. .
GENOME RESEARCH, 2014, 24 (12) :2066-2076
[22]   Full genome SNP-based phylogenetic analysis reveals the origin and global spread of Brucella melitensis [J].
Tan, Kim-Kee ;
Tan, Yung-Chie ;
Chang, Li-Yen ;
Lee, Kok Wei ;
Nore, Siti Sarah ;
Yee, Wai-Yan ;
Isa, Mohd Noor Mat ;
Jafar, Faizatul Lela ;
Hoh, Chee-Choong ;
AbuBakar, Sazaly .
BMC GENOMICS, 2015, 16
[23]   Ten years of next-generation sequencing technology [J].
van Dijk, Erwin L. ;
Auger, Helene ;
Jaszczyszyn, Yan ;
Thermes, Claude .
TRENDS IN GENETICS, 2014, 30 (09) :418-426