Big Data: Astronomical or Genomical?

被引:797
作者
Stephens, Zachary D. [1 ,2 ]
Lee, Skylar Y. [1 ,2 ]
Faghri, Faraz [3 ]
Campbell, Roy H. [3 ]
Zhai, Chengxiang [3 ,4 ]
Efron, Miles J. [5 ]
Iyer, Ravishankar [1 ,2 ]
Schatz, Michael C. [6 ]
Sinha, Saurabh [3 ,4 ]
Robinson, Gene E. [7 ,8 ]
机构
[1] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
[2] Univ Illinois, Dept Elect & Comp Engn, Urbana, IL USA
[3] Univ Illinois, Dept Comp Sci, Urbana, IL USA
[4] Univ Illinois, Carl R Woese Inst Genom Biol, Urbana, IL USA
[5] Univ Illinois, Sch Lib & Informat Sci, Urbana, IL USA
[6] Cold Spring Harbor Lab, Simons Ctr Quantitat Biol, Cold Spring Harbor, NY 11724 USA
[7] Univ Illinois, Dept Entomol, Carl R Woese Inst Genom Biol, Urbana, IL USA
[8] Univ Illinois, Neurosci Program, Urbana, IL USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
GENOMES; CHALLENGES;
D O I
10.1371/journal.pbio.1002195
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genomics is a Big Data science and is going to get much bigger, very soon, but it is not known whether the needs of genomics will exceed other Big Data domains. Projecting to the year 2025, we compared genomics with three other major generators of Big Data: astronomy, YouTube, and Twitter. Our estimates show that genomics is a "four-headed beast"-it is either on par with or the most demanding of the domains analyzed here in terms of data acquisition, storage, distribution, and analysis. We discuss aspects of new technologies that will need to be developed to rise up and meet the computational challenges that genomics poses for the near future. Now is the time for concerted, community-wide planning for the "genomical" challenges of the next decade.
引用
收藏
页数:11
相关论文
共 48 条
[1]  
Abecasis G.R., 2012, NATURE, V491, P56, DOI DOI 10.1038/nature11632
[2]  
[Anonymous], EX AGGR CONS EXAC BR
[3]  
[Anonymous], ACM SIGARCH
[4]  
[Anonymous], EMCS RECORD BREAKING
[5]  
[Anonymous], SCI INSIDER
[6]  
[Anonymous], UBUNTUNET ALL ANN C
[7]  
[Anonymous], PLOS BIOL
[8]  
[Anonymous], 2014, Twitter Data Analytics
[9]  
[Anonymous], J INEQUAL APPL
[10]  
[Anonymous], GENE SEQUENCING FUTU