Distributed processing of very large datasets with DataCutter

被引:97
作者
Beynon, MD
Kurc, T
Catalyurek, U
Chang, CL
Sussman, A
Saltz, J [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
[2] Johns Hopkins Med Inst, Dept Pathol, Baltimore, MD 21287 USA
基金
美国国家科学基金会;
关键词
multi-dimensional datasets; data analysis; distributed computing; runtime systems; component architectures;
D O I
10.1016/S0167-8191(01)00099-0
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We describe a framework, called DataCutter, that is designed to provide support for subsetting and processing of datasets in a distributed and heterogeneous environment. We illustrate the use of DataCutter with several data-intensive applications from diverse fields, and present experimental results. (C) 2001 Published by Elsevier Science B.V.
引用
收藏
页码:1457 / 1478
页数:22
相关论文
共 26 条
[21]  
Schroeder W., 2010, VISUALIZATION TOOLKI, V4th ed.
[22]  
SMITH PH, 1998, 1998 DVC WORKSHOP SE
[23]  
TELLER M, 1998, 6 NASA GODD SPAC FLG
[24]  
*US GEOL SURV, LAND SAT LANDSAT THE
[25]  
HPSS HIGH PERFORMANC
[26]  
GLOB GRID FOR