Distributed data mining on grids: Services, tools, and applications

被引:68
作者
Cannataro, M [1 ]
Congiusta, A
Pugliese, A
Talia, D
Trunfio, P
机构
[1] Univ Catanzaro, I-88100 Catanzaro, Italy
[2] Univ Calabria, DEIS, I-87036 Arcavacata Di Rende, CS, Italy
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2004年 / 34卷 / 06期
关键词
grid computing; grid programming; grid scheduling; knowledge grid; data mining;
D O I
10.1109/TSMCB.2004.836890
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data mining algorithms are widely used today for the analysis of large corporate and scientific datasets stored in databases and data archives. Industry, science, and commerce fields often need to analyze very large datasets maintained over geographically distributed sites by using the computational power of distributed and parallel systems. The grid can play a significant role in providing an effective computational support for distributed knowledge discovery applications. For the development of data mining applications on grids we designed a system called KNOWLEDGE GRID. This paper describes the KNOWLEDGE GRID framework and presents the toolset provided by the KNOWLEDGE GRID for implementing distributed knowledge discovery. The paper discusses how to design and implement data mining applications by using the KNOWLEDGE GRID tools starting from searching grid resources, composing software and data components, and executing the resulting data mining process on a grid. Some performance results are also discussed.
引用
收藏
页码:2451 / 2465
页数:15
相关论文
共 40 条
[1]  
ABRAHAM A, 2000, P IEEE 8 INT C ADV C
[2]  
[Anonymous], ADV KNOWLEDGE DISCOV
[3]  
ARNOLD D, 2002, CONCUR COMPUT
[4]  
AVERY P, 2001, GRIPHYN PROJECT DESC
[5]   From TeraGrid to knowledge grid [J].
Berman, F .
COMMUNICATIONS OF THE ACM, 2001, 44 (11) :27-28
[6]  
BERMAN F, 2001, COMMUNICATION NOV
[7]  
CANNATARO A, 2002, P C DAT MIN BOL IT
[8]  
Cannataro M, 2003, LECT NOTES COMPUT SC, V2840, P619
[9]   The knowledge grid [J].
Cannataro, M ;
Talia, D .
COMMUNICATIONS OF THE ACM, 2003, 46 (01) :89-93
[10]  
Cannataro M., 2003, Proceedings of (SemPGrid2003)), P113