Networking Materials Data: Accelerating Discovery at Experimental Facilities

被引:6
作者
Foster, Ian [1 ,2 ,3 ,4 ]
Ananthakrishnan, Rachana [1 ,2 ]
Blaiszik, Ben [1 ,2 ]
Chard, Kyle [1 ,2 ]
Osborn, Ray [5 ]
Tuecke, Steven [1 ,2 ,3 ]
Wilde, Michael [1 ,2 ,3 ]
Wozniak, Justin [1 ,2 ,3 ]
机构
[1] Argonne Natl Lab, Computat Inst, Chicago, IL 60637 USA
[2] Univ Chicago, Chicago, IL 60637 USA
[3] Argonne Natl Lab, Math & Comp Sci Div, Argonne, IL 60439 USA
[4] Univ Chicago, Dept Comp Sci, Chicago, IL 60637 USA
[5] Argonne Natl Lab, Div Mat Sci, Argonne, IL 60439 USA
来源
BIG DATA AND HIGH PERFORMANCE COMPUTING | 2015年 / 26卷
关键词
Materials science; parallel computing; X-ray source; Swift; Globus; X-RAY-SCATTERING; SERVICE;
D O I
10.3233/978-1-61499-583-8-117
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Advances in both sensor and computing technologies promise new approaches to discovery in materials science and engineering. For example, it appears possible to integrate theoretical modeling and experiment in new ways, test existing models with unprecedented rigor, and infer entirely new models from first principles. But, before these new approaches can become useful in practice, practitioners must be able to work with petabytes and petaflops as intuitively and interactively as they do with gigabytes and gigaflops today. The Discovery Engines for Big Data project at Argonne National Laboratory is tackling key bottlenecks along the end-to-end discovery path, focusing in particular on opportunities at Argonne's Advanced Photon Source. Here, we describe results relating to data acquisition, management, and analysis. For acquisition, we describe automated pipelines based on Globus services that link instruments, computations, and people for rapid and reliable data exchange. For management, we describe digital asset management solutions that enable the capture, management, sharing, publication, and discovery of large quantities of complex and diverse data, along with associated metadata and programs. For analysis, we describe the use of 100K+ supercomputer cores to enable new research modalities based on near-real-time processing and feedback, and the use of Swift parallel scripting to facilitate authoring, understanding, and reuse of data generation, transformation, and analysis software.
引用
收藏
页码:117 / 132
页数:16
相关论文
共 41 条
[1]   Software as a Service for Data Scientists [J].
Allen, Bryce ;
Bresnahan, John ;
Childers, Lisa ;
Foster, Ian ;
Kandaswamy, Gopi ;
Kettimuthu, Raj ;
Kordas, Jack ;
Link, Mike ;
Martin, Stuart ;
Pickett, Karl ;
Tuecke, Steven .
COMMUNICATIONS OF THE ACM, 2012, 55 (02) :81-88
[2]   Globus platform-as-a-service for collaborative science applications [J].
Ananthakrishnan, Rachana ;
Chard, Kyle ;
Foster, Ian ;
Tuecke, Steven .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (02) :290-305
[3]  
[Anonymous], 2003, P SIGCHI C HUMAN FAC, DOI DOI 10.1145/642611.642681
[4]  
Armstrong T. G., 2014, P SC
[5]  
Austerberry D., 2012, Digital asset management
[6]  
Bicer T., 2015, 21 INT EUR C PAR DIS
[7]  
Chard K., 2015, 11 IEEE INT C ESCIEN
[8]   Globus Nexus: Research Identity, Profile, and Group Management as a Service [J].
Chard, Kyle ;
Lidman, Mattias ;
Bryan, Josh ;
Howe, Tom ;
McCollam, Brendan ;
Ananthakrishnan, Rachana ;
Tuecke, Steven ;
Foster, Ian .
2014 IEEE 10TH INTERNATIONAL CONFERENCE ON E-SCIENCE (E-SCIENCE), VOL 1, 2014, :31-38
[9]   HipGISAXS: a high-performance computing code for simulating grazing-incidence X-ray scattering data [J].
Chourou, Slim T. ;
Sarje, Abhinav ;
Li, Xiaoye S. ;
Chan, Elaine R. ;
Hexemer, Alexander .
JOURNAL OF APPLIED CRYSTALLOGRAPHY, 2013, 46 :1781-1795
[10]   AFLOWLIB.ORG: A distributed materials properties repository from high-throughput ab initio calculations [J].
Curtarolo, Stefano ;
Setyawan, Wahyu ;
Wang, Shidong ;
Xue, Junkai ;
Yang, Kesong ;
Taylor, Richard H. ;
Nelson, Lance J. ;
Hart, Gus L. W. ;
Sanvito, Stefano ;
Buongiorno-Nardelli, Marco ;
Mingo, Natalio ;
Levy, Ohad .
COMPUTATIONAL MATERIALS SCIENCE, 2012, 58 :227-235