Genome sequence data: management, storage, and visualization

被引:38
作者
Batley, Jacqueline [1 ,2 ]
Edwards, David [1 ]
机构
[1] Univ Queensland, Sch Land Crop & Food Sci, Australian Ctr Plant Funct Genom, Brisbane, Qld 4072, Australia
[2] Univ Queensland, ARC Ctr Excellence Integrat Legume Res, Brisbane, Qld 4072, Australia
基金
澳大利亚研究理事会;
关键词
second generation sequencing; genome sequencing databases; genome visualization; TECHNOLOGIES; BROWSER;
D O I
10.2144/000113134
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Over the last few years there has been a revolution in DNA sequencing technology that has brought down the cost of DNA sequencing and made the sequencing of an increasing number of genomes both feasible and cost effective. There has also been a dramatic shift in the type of sequence data being generated, with vast numbers of short reads or pairs of short reads replacing the traditional relatively long reads produced by Sanger sequencing. These changes in data quantity and format have led to a rethinking of sequence data management, storage, and Visualization, and provide a challenge for bioinformatics. The vast amount of sequence data that will be generated over the next few years will require a change in what data arc stored and how users query the information.
引用
收藏
页码:333 / +
页数:3
相关论文
共 14 条
  • [1] Toward the $1000 human genome
    Bennett, ST
    Barnes, C
    Cox, A
    Davies, L
    Brown, C
    [J]. PHARMACOGENOMICS, 2005, 6 (04) : 373 - 382
  • [2] GenBank
    Benson, Dennis A.
    Karsch-Mizrachi, Ilene
    Lipman, David J.
    Ostell, James
    Wheeler, David L.
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D16 - D20
  • [3] An overview of ensembl
    Birney, E
    Andrews, TD
    Bevan, P
    Caccamo, M
    Chen, Y
    Clarke, L
    Coates, G
    Cuff, J
    Curwen, V
    Cutts, T
    Down, T
    Eyras, E
    Fernandez-Suarez, XM
    Gane, P
    Gibbins, B
    Gilbert, J
    Hammond, M
    Hotz, HR
    Iyer, V
    Jekosch, K
    Kahari, A
    Kasprzyk, A
    Keefe, D
    Keenan, S
    Lehvaslaiho, H
    McVicker, G
    Melsopp, C
    Meidl, P
    Mongin, E
    Pettett, R
    Potter, S
    Proctor, G
    Rae, M
    Searle, S
    Slater, G
    Smedley, D
    Smith, J
    Spooner, W
    Stabenau, A
    Stalker, J
    Storey, R
    Ureta-Vidal, A
    Woodwark, KC
    Cameron, G
    Durbin, R
    Cox, A
    Hubbard, T
    Clamp, M
    [J]. GENOME RESEARCH, 2004, 14 (05) : 925 - 928
  • [4] EMBL Nucleotide Sequence Database: developments in 2005
    Cochrane, Guy
    Aldebert, Philippe
    Althorpe, Nicola
    Andersson, Mikael
    Baker, Wendy
    Baldwin, Alastair
    Bates, Kirsty
    Bhattacharyya, Sumit
    Browne, Paul
    van den Broek, Alexandra
    Castro, Matias
    Duggan, Karyn
    Eberhardt, Ruth
    Faruque, Nadeem
    Gamble, John
    Kanz, Carola
    Kulikova, Tamara
    Lee, Charles
    Leinonen, Rasko
    Lin, Quan
    Lombard, Vincent
    Lopez, Rodrigo
    McHale, Michelle
    McWilliam, Hamish
    Mukherjee, Gaurab
    Nardone, Francesco
    Pastor, Maria Pilar Garcia
    Sobhany, Siamak
    Stoehr, Peter
    Tzouvara, Katerina
    Vaughan, Robert
    Wu, Dan
    Zhu, Weimin
    Apweiler, Rolf
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D10 - D15
  • [5] Single-molecule DNA sequencing technologies for future genomics research
    Gupta, Pushpendra K.
    [J]. TRENDS IN BIOTECHNOLOGY, 2008, 26 (11) : 602 - 611
  • [6] Single-molecule DNA sequencing of a viral genome
    Harris, Timothy D.
    Buzby, Phillip R.
    Babcock, Hazen
    Beer, Eric
    Bowers, Jayson
    Braslavsky, Ido
    Causey, Marie
    Colonell, Jennifer
    DiMeo, James
    Efcavitch, J. William
    Giladi, Eldar
    Gill, Jaime
    Healy, John
    Jarosz, Mirna
    Lapen, Dan
    Moulton, Keith
    Quake, Stephen R.
    Steinmann, Kathleen
    Thayer, Edward
    Tyurina, Anastasia
    Ward, Rebecca
    Weiss, Howard
    Xie, Zheng
    [J]. SCIENCE, 2008, 320 (5872) : 106 - 109
  • [7] The UCSC Genome Browser Database
    Karolchik, D
    Baertsch, R
    Diekhans, M
    Furey, TS
    Hinrichs, A
    Lu, YT
    Roskin, KM
    Schwartz, M
    Sugnet, CW
    Thomas, DJ
    Weber, RJ
    Haussler, D
    Kent, WJ
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 51 - 54
  • [8] ZOOM! Zillions of oligos mapped
    Lin, Hao
    Zhang, Zefeng
    Zhang, Michael Q.
    Ma, Bin
    Li, Ming
    [J]. BIOINFORMATICS, 2008, 24 (21) : 2431 - 2437
  • [9] Applications of next-generation sequencing technologies in functional genomics
    Morozova, Olena
    Marra, Marco A.
    [J]. GENOMICS, 2008, 92 (05) : 255 - 264
  • [10] The Rice Annotation Project Database (RAP-DB):: hub for Oryza sativa ssp japonica genome information
    Ohyanagi, Hajime
    Tanaka, Tsuyoshi
    Sakai, Hiroaki
    Shigemoto, Yasumasa
    Yamaguchi, Kaori
    Habara, Takuya
    Fujii, Yasuyuki
    Antonio, Baltazar A.
    Nagamura, Yoshiaki
    Imanishi, Tadashi
    Ikeo, Kazuho
    Itoh, Takeshi
    Gojobori, Takashi
    Sasaki, Takuji
    [J]. NUCLEIC ACIDS RESEARCH, 2006, 34 : D741 - D744