Assembly: a resource for assembled genomes at NCBI

被引:239
作者
Kitts, Paul A. [1 ]
Church, Deanna M. [1 ,2 ]
Thibaud-Nissen, Francoise [1 ]
Choi, Jinna [1 ]
Hem, Vichet [1 ]
Sapojnikov, Victor [1 ]
Smith, Robert G. [1 ]
Tatusova, Tatiana [1 ]
Xiang, Charlie [1 ]
Zherikov, Andrey [1 ]
DiCuccio, Michael [1 ]
Murphy, Terence D. [1 ]
Pruitt, Kim D. [1 ]
Kimchi, Avi [1 ]
机构
[1] NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA
[2] Personalis Inc, Menlo Pk, CA 94025 USA
基金
美国国家卫生研究院;
关键词
READ ALIGNMENT; DRAFT GENOME; METABOLISM; SEQUENCE; METADATA; DATABASE;
D O I
10.1093/nar/gkv1226
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The NCBI Assembly database (www.ncbi.nlm.nih.gov/assembly/) provides stable accessioning and data tracking for genome assembly data. The model underlying the database can accommodate a range of assembly structures, including sets of unordered contig or scaffold sequences, bacterial genomes consisting of a single complete chromosome, or complex structures such as a human genome with modeled allelic variation. The database provides an assembly accession and version to unambiguously identify the set of sequences that make up a particular version of an assembly, and tracks changes to updated genome assemblies. The Assembly database reports metadata such as assembly names, simple statistical reports of the assembly (number of contigs and scaffolds, contiguity metrics such as contig N50, total sequence length and total gap length) as well as the assembly update history. The Assembly database also tracks the relationship between an assembly submitted to the International Nucleotide Sequence Database Consortium (INSDC) and the assembly represented in the NCBI RefSeq project. Users can find assemblies of interest by querying the Assembly Resource directly or by browsing available assemblies for a particular organism. Links in the Assembly Resource allow users to easily download sequence and annotations for current versions of genome assemblies from the NCBI genomes FTP site.
引用
收藏
页码:D73 / D80
页数:8
相关论文
共 23 条
[1]   BioProject and BioSample databases at NCBI: facilitating capture and organization of metadata [J].
Barrett, Tanya ;
Clark, Karen ;
Gevorgyan, Robert ;
Gorelenkov, Vyacheslav ;
Gribov, Eugene ;
Karsch-Mizrachi, Ilene ;
Kimelman, Michael ;
Pruitt, Kim D. ;
Resenchuk, Sergei ;
Tatusova, Tatiana ;
Yaschenko, Eugene ;
Ostell, James .
NUCLEIC ACIDS RESEARCH, 2012, 40 (D1) :D57-D63
[2]   UniProt: a hub for protein information [J].
Bateman, Alex ;
Martin, Maria Jesus ;
O'Donovan, Claire ;
Magrane, Michele ;
Apweiler, Rolf ;
Alpi, Emanuele ;
Antunes, Ricardo ;
Arganiska, Joanna ;
Bely, Benoit ;
Bingley, Mark ;
Bonilla, Carlos ;
Britto, Ramona ;
Bursteinas, Borisas ;
Chavali, Gayatri ;
Cibrian-Uhalte, Elena ;
Da Silva, Alan ;
De Giorgi, Maurizio ;
Dogan, Tunca ;
Fazzini, Francesco ;
Gane, Paul ;
Cas-tro, Leyla Garcia ;
Garmiri, Penelope ;
Hatton-Ellis, Emma ;
Hieta, Reija ;
Huntley, Rachael ;
Legge, Duncan ;
Liu, Wudong ;
Luo, Jie ;
MacDougall, Alistair ;
Mutowo, Prudence ;
Nightin-gale, Andrew ;
Orchard, Sandra ;
Pichler, Klemens ;
Poggioli, Diego ;
Pundir, Sangya ;
Pureza, Luis ;
Qi, Guoying ;
Rosanoff, Steven ;
Saidi, Rabie ;
Sawford, Tony ;
Shypitsyna, Aleksandra ;
Turner, Edward ;
Volynkin, Vladimir ;
Wardell, Tony ;
Watkins, Xavier ;
Zellner, Hermann ;
Cowley, Andrew ;
Figueira, Luis ;
Li, Weizhong ;
McWilliam, Hamish .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D204-D212
[3]   Identification of cucurbitacins and assembly of a draft genome for Aquilaria agallocha [J].
Chen, Chuan-Hung ;
Kuo, Tony Chien-Yen ;
Yang, Meng-Han ;
Chien, Ting-Ying ;
Chu, Mei-Ju ;
Huang, Li-Chun ;
Chen, Chien-Yu ;
Lo, Hsiao-Feng ;
Jeng, Shih-Tong ;
Chen, Long-Fang O. .
BMC GENOMICS, 2014, 15
[4]   The First Myriapod Genome Sequence Reveals Conservative Arthropod Gene Content and Genome Organisation in the Centipede Strigamia maritima [J].
Chipman, Ariel D. ;
Ferrier, David E. K. ;
Brena, Carlo ;
Qu, Jiaxin ;
Hughes, Daniel S. T. ;
Schroeder, Reinhard ;
Torres-Oliva, Montserrat ;
Znassi, Nadia ;
Jiang, Huaiyang ;
Almeida, Francisca C. ;
Alonso, Claudio R. ;
Apostolou, Zivkos ;
Aqrawi, Peshtewani ;
Arthur, Wallace ;
Barna, Jennifer C. J. ;
Blankenburg, Kerstin P. ;
Brites, Daniela ;
Capella-Gutierrez, Salvador ;
Coyle, Marcus ;
Dearden, Peter K. ;
Du Pasquier, Louis ;
Duncan, Elizabeth J. ;
Ebert, Dieter ;
Eibner, Cornelius ;
Erikson, Galina ;
Evans, Peter D. ;
Extavour, Cassandra G. ;
Francisco, Liezl ;
Gabaldon, Toni ;
Gillis, William J. ;
Goodwin-Horn, Elizabeth A. ;
Green, Jack E. ;
Griffiths-Jones, Sam ;
Grimmelikhuijzen, Cornelis J. P. ;
Gubbala, Sai ;
Guigo, Roderic ;
Han, Yi ;
Hauser, Frank ;
Havlak, Paul ;
Hayden, Luke ;
Helbing, Sophie ;
Holder, Michael ;
Hui, Jerome H. L. ;
Hunn, Julia P. ;
Hunnekuhl, Vera S. ;
Jackson, LaRonda ;
Javaid, Mehwish ;
Jhangiani, Shalini N. ;
Jiggins, Francis M. ;
Jones, Tamsin E. .
PLOS BIOLOGY, 2014, 12 (11)
[5]   Modernizing Reference Genome Assemblies [J].
Church, Deanna M. ;
Schneider, Valerie A. ;
Graves, Tina ;
Auger, Katherine ;
Cunningham, Fiona ;
Bouk, Nathan ;
Chen, Hsiu-Chuan ;
Agarwala, Richa ;
McLaren, William M. ;
Ritchie, Graham R. S. ;
Albracht, Derek ;
Kremitzki, Milinn ;
Rock, Susan ;
Kotkiewicz, Holland ;
Kremitzki, Colin ;
Wollam, Aye ;
Trani, Lee ;
Fulton, Lucinda ;
Fulton, Robert ;
Matthews, Lucy ;
Whitehead, Siobhan ;
Chow, Will ;
Torrance, James ;
Dunn, Matthew ;
Harden, Glenn ;
Threadgold, Glen ;
Wood, Jonathan ;
Collins, Joanna ;
Heath, Paul ;
Griffiths, Guy ;
Pelan, Sarah ;
Grafham, Darren ;
Eichler, Evan E. ;
Weinstock, George ;
Mardis, Elaine R. ;
Wilson, Richard K. ;
Howe, Kerstin ;
Flicek, Paul ;
Hubbard, Tim .
PLOS BIOLOGY, 2011, 9 (07)
[6]   Ensembl 2015 [J].
Cunningham, Fiona ;
Amode, M. Ridwan ;
Barrell, Daniel ;
Beal, Kathryn ;
Billis, Konstantinos ;
Brent, Simon ;
Carvalho-Silva, Denise ;
Clapham, Peter ;
Coates, Guy ;
Fitzgerald, Stephen ;
Gil, Laurent ;
Giron, Carlos Garcia ;
Gordon, Leo ;
Hourlier, Thibaut ;
Hunt, Sarah E. ;
Janacek, Sophie H. ;
Johnson, Nathan ;
Juettemann, Thomas ;
Kaehaeri, Andreas K. ;
Keenan, Stephen ;
Martin, Fergal J. ;
Maurel, Thomas ;
McLaren, William ;
Murphy, Daniel N. ;
Nag, Rishi ;
Overduin, Bert ;
Parker, Anne ;
Patricio, Mateus ;
Perry, Emily ;
Pignatelli, Miguel ;
Riat, Harpreet Singh ;
Sheppard, Daniel ;
Taylor, Kieron ;
Thormann, Anja ;
Vullo, Alessandro ;
Wilder, Steven P. ;
Zadissa, Amonida ;
Aken, Bronwen L. ;
Birney, Ewan ;
Harrow, Jennifer ;
Kinsella, Rhoda ;
Muffato, Matthieu ;
Ruffier, Magali ;
Searle, Stephen M. J. ;
Spudich, Giulietta ;
Trevanion, Stephen J. ;
Yates, Andy ;
Zerbino, Daniel R. ;
Flicek, Paul .
NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) :D662-D669
[7]   Toward richer metadata for microbial sequences: replacing strain-level NCBI taxonomy taxids with BioProject, BioSample and Assembly records [J].
Federhen, Scott ;
Clark, Karen ;
Barrett, Tanya ;
Parkinson, Helen ;
Ostell, James ;
Kodama, Yuichi ;
Mashima, Jun ;
Nakamura, Yasukazu ;
Cochrane, Guy ;
Karsch-Mizrachi, Ilene .
STANDARDS IN GENOMIC SCIENCES, 2014, 9 (03) :1275-1277
[8]   Draft genome sequence of the mulberry tree Morus notabilis [J].
He, Ningjia ;
Zhang, Chi ;
Qi, Xiwu ;
Zhao, Shancen ;
Tao, Yong ;
Yang, Guojun ;
Lee, Tae-Ho ;
Wang, Xiyin ;
Cai, Qingle ;
Li, Dong ;
Lu, Mengzhu ;
Liao, Sentai ;
Luo, Guoqing ;
He, Rongjun ;
Tan, Xu ;
Xu, Yunmin ;
Li, Tian ;
Zhao, Aichun ;
Jia, Ling ;
Fu, Qiang ;
Zeng, Qiwei ;
Gao, Chuan ;
Ma, Bi ;
Liang, Jiubo ;
Wang, Xiling ;
Shang, Jingzhe ;
Song, Penghua ;
Wu, Haiyang ;
Fan, Li ;
Wang, Qing ;
Shuai, Qin ;
Zhu, Juanjuan ;
Wei, Congjin ;
Zhu-Salzman, Keyan ;
Jin, Dianchuan ;
Wang, Jinpeng ;
Liu, Tao ;
Yu, Maode ;
Tang, Cuiming ;
Wang, Zhenjiang ;
Dai, Fanwei ;
Chen, Jiafei ;
Liu, Yan ;
Zhao, Shutang ;
Lin, Tianbao ;
Zhang, Shougong ;
Wang, Junyi ;
Wang, Jian ;
Yang, Huanming ;
Yang, Guangwei .
NATURE COMMUNICATIONS, 2013, 4
[9]   The sheep genome illuminates biology of the rumen and lipid metabolism [J].
Jiang, Yu ;
Xie, Min ;
Chen, Wenbin ;
Talbot, Richard ;
Maddox, Jillian F. ;
Faraut, Thomas ;
Wu, Chunhua ;
Muzny, Donna M. ;
Li, Yuxiang ;
Zhang, Wenguang ;
Stanton, Jo-Ann ;
Brauning, Rudiger ;
Barris, Wesley C. ;
Hourlier, Thibaut ;
Aken, Bronwen L. ;
Searle, Stephen M. J. ;
Adelson, David L. ;
Bian, Chao ;
Cam, Graham R. ;
Chen, Yulin ;
Cheng, Shifeng ;
DeSilva, Udaya ;
Dixen, Karen ;
Dong, Yang ;
Fan, Guangyi ;
Franklin, Ian R. ;
Fu, Shaoyin ;
Fuentes-Utrilla, Pablo ;
Guan, Rui ;
Highland, Margaret A. ;
Holder, Michael E. ;
Huang, Guodong ;
Ingham, Aaron B. ;
Jhangiani, Shalini N. ;
Kalra, Divya ;
Kovar, Christie L. ;
Lee, Sandra L. ;
Liu, Weiqing ;
Liu, Xin ;
Lu, Changxin ;
Lv, Tian ;
Mathew, Tittu ;
McWilliam, Sean ;
Menzies, Moira ;
Pan, Shengkai ;
Robelin, David ;
Servin, Bertrand ;
Townley, David ;
Wang, Wenliang ;
Wei, Bin .
SCIENCE, 2014, 344 (6188) :1168-1173
[10]   Genome sequences of wild and domestic bactrian camels [J].
Jirimutu ;
Wang, Zhen ;
Ding, Guohui ;
Chen, Gangliang ;
Sun, Yamin ;
Sun, Zhihong ;
Zhang, Heping ;
Wang, Lei ;
Hasi, Surong ;
Zhang, Yan ;
Li, Jianmei ;
Shi, Yixiang ;
Xu, Ze ;
He, Chuan ;
Yu, Siriguleng ;
Li, Shengdi ;
Zhang, Wenbin ;
Batmunkh, Mijiddorj ;
Ts, Batsukh ;
Narenbatu ;
Unierhu ;
Bat-Ireedui, Shirzana ;
Gao, Hongwei ;
Baysgalan, Banzragch ;
Li, Qing ;
Jia, Zhiling ;
Turigenbayila ;
Subudenggerile ;
Narenmanduhu ;
Wang, Zhaoxia ;
Wang, Juan ;
Pan, Lei ;
Chen, Yongcan ;
Ganerdene, Yaichil ;
Dabxilt ;
Erdemt ;
Altansha ;
Altansukh ;
Liu, Tuya ;
Cao, Minhui ;
Aruuntsever ;
Bayart ;
Hosblig ;
He, Fei ;
Zha-ti, A. ;
Zheng, Guangyong ;
Qiu, Feng ;
Sun, Zikui ;
Zhao, Lele ;
Zhao, Wenjing .
NATURE COMMUNICATIONS, 2012, 3