The Sequence Read Archive

被引:1781
作者
Leinonen, Rasko [1 ]
Sugawara, Hideaki [2 ,3 ]
Shumway, Martin [4 ]
机构
[1] European Bioinformat Inst, Cambridge CB10 1SD, England
[2] Res Org Informat & Syst, Ctr Informat Biol, Mishima, Shizuoka 4118540, Japan
[3] Res Org Informat & Syst, DNA Data Bank Japan, Natl Inst Genet, Mishima, Shizuoka 4118540, Japan
[4] Natl Lib Med, Natl Ctr Biotechnol Informat, NIH, Bethesda, MD 20894 USA
基金
英国惠康基金;
关键词
FORMAT;
D O I
10.1093/nar/gkq1019
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The combination of significantly lower cost and increased speed of sequencing has resulted in an explosive growth of data submitted into the primary next-generation sequence data archive, the Sequence Read Archive (SRA). The preservation of experimental data is an important part of the scientific record, and increasing numbers of journals and funding agencies require that next-generation sequence data are deposited into the SRA. The SRA was established as a public repository for the next-generation sequence data and is operated by the International Nucleotide Sequence Database Collaboration (INSDC). INSDC partners include the National Center for Biotechnology Information (NCBI), the European Bioinformatics Institute (EBI) and the DNA Data Bank of Japan (DDBJ). The SRA is accessible at http://www.ncbi.nlm.nih.gov/Traces/sra from NCBI, at http://www.ebi.ac.uk/ena from EBI and at http://trace.ddbj.nig.ac.jp from DDBJ. In this article, we present the content and structure of the SRA, detail our support for sequencing platforms and provide recommended data submission levels and formats. We also briefly outline our response to the challenge of data growth.
引用
收藏
页码:D19 / D21
页数:3
相关论文
共 9 条
  • [1] Benson DA, 2013, NUCLEIC ACIDS RES, V41, pD36, DOI [10.1093/nar/gkn723, 10.1093/nar/gkp1024, 10.1093/nar/gkw1070, 10.1093/nar/gkr1202, 10.1093/nar/gkx1094, 10.1093/nar/gkl986, 10.1093/nar/gkq1079, 10.1093/nar/gks1195, 10.1093/nar/gkg057]
  • [2] ZTR: a new format for DNA sequence trace data
    Bonfield, JK
    Staden, R
    [J]. BIOINFORMATICS, 2002, 18 (01) : 3 - 10
  • [3] Human genomes as email attachments
    Christley, Scott
    Lu, Yiming
    Li, Chen
    Xie, Xiaohui
    [J]. BIOINFORMATICS, 2009, 25 (02) : 274 - 275
  • [4] The International Nucleotide Sequence Database Collaboration
    Cochrane, Guy
    Karsch-Mizrachi, Ilene
    Nakamura, Yasukazu
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 : D15 - D18
  • [5] The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants
    Cock, Peter J. A.
    Fields, Christopher J.
    Goto, Naohisa
    Heuer, Michael L.
    Rice, Peter M.
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 (06) : 1767 - 1771
  • [6] DDBJ launches a new archive database with analytical tools for next-generation sequence data
    Kaminuma, Eli
    Mashima, Jun
    Kodama, Yuichi
    Gojobori, Takashi
    Ogasawara, Osamu
    Okubo, Kousaku
    Takagi, Toshihisa
    Nakamura, Yasukazu
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D33 - D38
  • [7] Improvements to services at the European Nucleotide Archive
    Leinonen, Rasko
    Akhtar, Ruth
    Birney, Ewan
    Bonfield, James
    Bower, Lawrence
    Corbett, Matt
    Cheng, Ying
    Demiralp, Fehmi
    Faruque, Nadeem
    Goodgame, Neil
    Gibson, Richard
    Hoad, Gemma
    Hunter, Christopher
    Jang, Mikyung
    Leonard, Steven
    Lin, Quan
    Lopez, Rodrigo
    Maguire, Michael
    McWilliam, Hamish
    Plaister, Sheila
    Radhakrishnan, Rajesh
    Sobhany, Siamak
    Slater, Guy
    Ten Hoopen, Petra
    Valentin, Franck
    Vaughan, Robert
    Zalunin, Vadim
    Zerbino, Daniel
    Cochrane, Guy
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D39 - D45
  • [8] Fast and accurate short read alignment with Burrows-Wheeler transform
    Li, Heng
    Durbin, Richard
    [J]. BIOINFORMATICS, 2009, 25 (14) : 1754 - 1760
  • [9] Archiving next generation sequencing data
    Shumway, Martin
    Cochrane, Guy
    Sugawara, Hideaki
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : D870 - D871