caGrid 1.0: An enterprise Grid infrastructure for biomedical research

被引:66
作者
Oster, Scott [1 ]
Langella, Stephen [1 ]
Hastings, Shannon [1 ]
Ervin, David [1 ]
Madduri, Ravi [2 ]
Phillips, Joshua [4 ]
Kurc, Tahsin [1 ]
Siebenlist, Frank [2 ]
Covitz, Peter [3 ]
Shanbhag, Krishnakant [3 ]
Foster, Ian [2 ]
Saltz, Joel [1 ]
机构
[1] Ohio State Univ, Dept Biomed Informat, Columbus, OH 43210 USA
[2] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
[3] NCI, Ctr Bioinformat, Rockville, MD USA
[4] SemanticsBits, Reston, VA USA
关键词
D O I
10.1197/jamia.M2522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: To develop software infrastructure that will provide support for discovery, characterization, integrated access, and management of diverse and disparate collections of information sources, analysis methods, and applications in biomedical research. Design: An enterprise Grid software infrastructure, called caGrid version 1.0 (caGrid 1.0), has been developed as the core Grid architecture of the NCI-sponsored cancer Biomedical Informatics Grid (caBIG (TM)) program. It is designed to support a wide range of use cases in basic, translational, and clinical research, including 1) discovery, 2) integrated and large-scale data analysis, and 3) coordinated study. Measurements: The caGrid is built as a Grid software infrastructure and leverages Grid computing technologies and the Web Services Resource Framework standards. It provides a set of core services, toolkits for the development and deployment of new community provided services, and application programming interfaces for building client applications. Results: The caGrid 1.0 was released to the caBIG community in December 2006. It is built on open source components and caGrid source code is publicly and freely available under a liberal open source license. The core software, associated tools, and documentation can be downloaded from the following URL: https://cabig.nci.nih. gov/workspaces/Architecture/caGrid. Conclusions: While caGrid 1.0 is designed to address use cases in cancer research, the requirements associated with discovery, analysis and integration of large scale data, and coordinated studies are common in other biomedical fields. In this respect, caGrid 1.0 is the realization of a framework that can benefit the entire biomedical community.
引用
收藏
页码:138 / 149
页数:12
相关论文
共 37 条
[1]  
ALEXANDER J, WEB SERVICES ENUMERA
[2]  
ALEXANDER J, WEB SERVICES TRANSFE
[3]  
ALLCOCK B, 2005, SUPERCOMPUTING
[4]  
AMENDOLIA SR, 2003, P 18 MED INF EUR C M, P194
[5]   The MyProxy online credential repository [J].
Basney, J ;
Humphrey, M ;
Welch, V .
SOFTWARE-PRACTICE & EXPERIENCE, 2005, 35 (09) :801-816
[6]  
BHATIA K, 2005, 1 INT C ESCIENCE GRI
[7]  
Booth D., WEB SERVICES ARCHITE
[8]   caCORE: A common infrastructure for cancer informatics [J].
Covitz, PA ;
Hartel, F ;
Schaefer, C ;
De Coronado, S ;
Fragoso, G ;
Sahni, H ;
Gustafson, S ;
Buetow, KH .
BIOINFORMATICS, 2003, 19 (18) :2404-2412
[9]   DM2 :: A distributed medical data manager for grids [J].
Duque, H ;
Montagnat, J ;
Pierson, JM ;
Brunie, L ;
Magnin, LE .
CCGRID 2003: 3RD IEEE/ACM INTERNATIONAL SYMPOSIUM ON CLUSTER COMPUTING AND THE GRID, PROCEEDINGS, 2003, :606-611
[10]  
Erberich SG, 2006, INT J COMPUT ASS RAD, V1, P100