共 9 条
The future of DNA sequence archiving
被引:21
作者:

Cochrane, Guy
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL Bioinformat Inst, Hinxton CB10 1SD, England EMBL Bioinformat Inst, Hinxton CB10 1SD, England

Cook, Charles E.
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL Bioinformat Inst, Hinxton CB10 1SD, England EMBL Bioinformat Inst, Hinxton CB10 1SD, England

Birney, Ewan
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL Bioinformat Inst, Hinxton CB10 1SD, England EMBL Bioinformat Inst, Hinxton CB10 1SD, England
机构:
[1] EMBL Bioinformat Inst, Hinxton CB10 1SD, England
来源:
GIGASCIENCE
|
2012年
/
1卷
关键词:
DNA;
Sequence;
Archive;
Compression;
Storage;
Image;
D O I:
10.1186/2047-217X-1-2
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Archives operating under the International Nucleotide Sequence Database Collaboration currently preserve all submitted sequences equally, but rapid increases in the rate of global sequence production will soon require differentiated treatment of DNA sequences submitted for archiving. Here, we propose a graded system in which the ease of reproduction of a sequencing-based experiment and the relative availability of a sample for resequencing define the level of lossy compression applied to stored data.
引用
收藏
页数:5
相关论文
共 9 条
[1]
The Western English Channel contains a persistent microbial seed bank
[J].
Caporaso, J. Gregory
;
Paszkiewicz, Konrad
;
Field, Dawn
;
Knight, Rob
;
Gilbert, Jack A.
.
ISME JOURNAL,
2012, 6 (06)
:1089-1093

Caporaso, J. Gregory
论文数: 0 引用数: 0
h-index: 0
机构:
No Arizona Univ, Dept Comp Sci, Flagstaff, AZ 86011 USA Univ Chicago, Argonne Natl Lab, Dept Ecol & Evolut, Argonne, IL 60439 USA

Paszkiewicz, Konrad
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Exeter, Dept Biosci, Exeter, Devon, England Univ Chicago, Argonne Natl Lab, Dept Ecol & Evolut, Argonne, IL 60439 USA

Field, Dawn
论文数: 0 引用数: 0
h-index: 0
机构: Univ Chicago, Argonne Natl Lab, Dept Ecol & Evolut, Argonne, IL 60439 USA

Knight, Rob
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Colorado, Dept Chem & Biochem, Boulder, CO 80309 USA
Univ Colorado, Howard Hughes Med Inst, Boulder, CO 80309 USA Univ Chicago, Argonne Natl Lab, Dept Ecol & Evolut, Argonne, IL 60439 USA

Gilbert, Jack A.
论文数: 0 引用数: 0
h-index: 0
机构:
Univ Chicago, Argonne Natl Lab, Dept Ecol & Evolut, Argonne, IL 60439 USA Univ Chicago, Argonne Natl Lab, Dept Ecol & Evolut, Argonne, IL 60439 USA
[2]
TranscriptSNPView: a genome-wide catalog of mouse coding variation
[J].
Cunningham, Fiona
;
Rios, Daniel
;
Griffiths, Mark
;
Smith, James
;
Ning, Zemin
;
Cox, Tony
;
Flicek, Paul
;
Marin-Garcin, Pablo
;
Herrero, Javier
;
Rogers, Jane
;
Van der Weyden, Louise
;
Bradley, Allan
;
Birney, Ewan
;
Adams, David J.
.
NATURE GENETICS,
2006, 38 (08)
:853-853

Cunningham, Fiona
论文数: 0 引用数: 0
h-index: 0
机构:
Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Rios, Daniel
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Griffiths, Mark
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Smith, James
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Ning, Zemin
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Cox, Tony
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

论文数: 引用数:
h-index:
机构:

Marin-Garcin, Pablo
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

论文数: 引用数:
h-index:
机构:

Rogers, Jane
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Van der Weyden, Louise
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Bradley, Allan
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Birney, Ewan
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England

Adams, David J.
论文数: 0 引用数: 0
h-index: 0
机构: Wellcome Trust Sanger Inst, Hinxton CB10 1SA, Cambs, England
[3]
WHOLE-GENOME RANDOM SEQUENCING AND ASSEMBLY OF HAEMOPHILUS-INFLUENZAE RD
[J].
FLEISCHMANN, RD
;
ADAMS, MD
;
WHITE, O
;
CLAYTON, RA
;
KIRKNESS, EF
;
KERLAVAGE, AR
;
BULT, CJ
;
TOMB, JF
;
DOUGHERTY, BA
;
MERRICK, JM
;
MCKENNEY, K
;
SUTTON, G
;
FITZHUGH, W
;
FIELDS, C
;
GOCAYNE, JD
;
SCOTT, J
;
SHIRLEY, R
;
LIU, LI
;
GLODEK, A
;
KELLEY, JM
;
WEIDMAN, JF
;
PHILLIPS, CA
;
SPRIGGS, T
;
HEDBLOM, E
;
COTTON, MD
;
UTTERBACK, TR
;
HANNA, MC
;
NGUYEN, DT
;
SAUDEK, DM
;
BRANDON, RC
;
FINE, LD
;
FRITCHMAN, JL
;
FUHRMANN, JL
;
GEOGHAGEN, NSM
;
GNEHM, CL
;
MCDONALD, LA
;
SMALL, KV
;
FRASER, CM
;
SMITH, HO
;
VENTER, JC
.
SCIENCE,
1995, 269 (5223)
:496-512

FLEISCHMANN, RD
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

ADAMS, MD
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

WHITE, O
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

CLAYTON, RA
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

KIRKNESS, EF
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

KERLAVAGE, AR
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

BULT, CJ
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

TOMB, JF
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

DOUGHERTY, BA
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

MERRICK, JM
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

MCKENNEY, K
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

SUTTON, G
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

FITZHUGH, W
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

FIELDS, C
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

GOCAYNE, JD
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

SCOTT, J
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

SHIRLEY, R
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

LIU, LI
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

GLODEK, A
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

KELLEY, JM
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

WEIDMAN, JF
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

PHILLIPS, CA
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

SPRIGGS, T
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

HEDBLOM, E
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

COTTON, MD
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

UTTERBACK, TR
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

HANNA, MC
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

NGUYEN, DT
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

SAUDEK, DM
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

BRANDON, RC
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

FINE, LD
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

FRITCHMAN, JL
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

FUHRMANN, JL
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

GEOGHAGEN, NSM
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

GNEHM, CL
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

MCDONALD, LA
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

SMALL, KV
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

FRASER, CM
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

SMITH, HO
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA

VENTER, JC
论文数: 0 引用数: 0
h-index: 0
机构: INST GENOM RES, GAITHERSBURG, MD 20878 USA
[4]
Efficient storage of high throughput DNA sequencing data using reference-based compression
[J].
Fritz, Markus Hsi-Yang
;
Leinonen, Rasko
;
Cochrane, Guy
;
Birney, Ewan
.
GENOME RESEARCH,
2011, 21 (05)
:734-740

Fritz, Markus Hsi-Yang
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL EBI, Hinxton CB10 1SD, Cambs, England EMBL EBI, Hinxton CB10 1SD, Cambs, England

Leinonen, Rasko
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL EBI, Hinxton CB10 1SD, Cambs, England EMBL EBI, Hinxton CB10 1SD, Cambs, England

Cochrane, Guy
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL EBI, Hinxton CB10 1SD, Cambs, England EMBL EBI, Hinxton CB10 1SD, Cambs, England

Birney, Ewan
论文数: 0 引用数: 0
h-index: 0
机构:
EMBL EBI, Hinxton CB10 1SD, Cambs, England EMBL EBI, Hinxton CB10 1SD, Cambs, England
[5]
The International Nucleotide Sequence Database Collaboration
[J].
Karsch-Mizrachi, Ilene
;
Nakamura, Yasukazu
;
Cochrane, Guy
.
NUCLEIC ACIDS RESEARCH,
2012, 40 (D1)
:D33-D37

Karsch-Mizrachi, Ilene
论文数: 0 引用数: 0
h-index: 0
机构:
NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20892 USA NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20892 USA

Nakamura, Yasukazu
论文数: 0 引用数: 0
h-index: 0
机构:
Res Org Informat & Syst, Natl Inst Genet, Ctr Informat Biol, Mishima, Shizuoka 4118510, Japan
Res Org Informat & Syst, Natl Inst Genet, DNA Data Bank Japan, Mishima, Shizuoka 4118510, Japan NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20892 USA

论文数: 引用数:
h-index:
机构:
[6]
The sequence read archive: explosive growth of sequencing data
[J].
Kodama, Yuichi
;
Shumway, Martin
;
Leinonen, Rasko
.
NUCLEIC ACIDS RESEARCH,
2012, 40 (D1)
:D54-D56

Kodama, Yuichi
论文数: 0 引用数: 0
h-index: 0
机构:
Res Org Informat & Syst, Ctr Informat Biol, Mishima, Shizuoka 4118540, Japan
Res Org Informat & Syst, DNA Data Bank Japan, Natl Inst Genet, Mishima, Shizuoka 4118540, Japan Res Org Informat & Syst, Ctr Informat Biol, Mishima, Shizuoka 4118540, Japan

Shumway, Martin
论文数: 0 引用数: 0
h-index: 0
机构:
NIH, Natl Ctr Biotechnol Informat, Natl Lib Med, Bethesda, MD 20894 USA Res Org Informat & Syst, Ctr Informat Biol, Mishima, Shizuoka 4118540, Japan

论文数: 引用数:
h-index:
机构:
[7]
Genomic information infrastructure after the deluge
[J].
Parkhill, Julian
;
Birney, Ewan
;
Kersey, Paul
.
GENOME BIOLOGY,
2010, 11 (07)

Parkhill, Julian
论文数: 0 引用数: 0
h-index: 0
机构:
Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England

Birney, Ewan
论文数: 0 引用数: 0
h-index: 0
机构:
European Bioinformat Inst, Cambridge CB10 1SD, England Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England

Kersey, Paul
论文数: 0 引用数: 0
h-index: 0
机构:
European Bioinformat Inst, Cambridge CB10 1SD, England Wellcome Trust Sanger Inst, Cambridge CB10 1SA, England
[8]
Serendipitous discovery of Wolbachia genomes in multiple Drosophila species -: art. no. R23
[J].
Salzberg, SL
;
Hotopp, JCD
;
Delcher, AL
;
Pop, M
;
Smith, DR
;
Eisen, MB
;
Nelson, WC
.
GENOME BIOLOGY,
2005, 6 (03)

Salzberg, SL
论文数: 0 引用数: 0
h-index: 0
机构: Inst Genome Res, Rockville, MD 20850 USA

Hotopp, JCD
论文数: 0 引用数: 0
h-index: 0
机构: Inst Genome Res, Rockville, MD 20850 USA

Delcher, AL
论文数: 0 引用数: 0
h-index: 0
机构: Inst Genome Res, Rockville, MD 20850 USA

Pop, M
论文数: 0 引用数: 0
h-index: 0
机构: Inst Genome Res, Rockville, MD 20850 USA

Smith, DR
论文数: 0 引用数: 0
h-index: 0
机构: Inst Genome Res, Rockville, MD 20850 USA

Eisen, MB
论文数: 0 引用数: 0
h-index: 0
机构: Inst Genome Res, Rockville, MD 20850 USA

Nelson, WC
论文数: 0 引用数: 0
h-index: 0
机构: Inst Genome Res, Rockville, MD 20850 USA
[9]
NUCLEOTIDE-SEQUENCE OF BACTERIOPHAGE PHICHI174 DNA
[J].
SANGER, F
;
AIR, GM
;
BARRELL, BG
;
BROWN, NL
;
COULSON, AR
;
FIDDES, JC
;
HUTCHISON, CA
;
SLOCOMBE, PM
;
SMITH, M
.
NATURE,
1977, 265 (5596)
:687-695

SANGER, F
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND

AIR, GM
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND

BARRELL, BG
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND

BROWN, NL
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND

COULSON, AR
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND

FIDDES, JC
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND

HUTCHISON, CA
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND

SLOCOMBE, PM
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND

SMITH, M
论文数: 0 引用数: 0
h-index: 0
机构:
MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND MRC,MOLEC BIOL LAB,CAMBRIDGE CB2 2QH,ENGLAND