Distributed data mining on Agent Grid: Issues, platform and development toolkit

被引:16
作者
Luo, Jiewen [1 ]
Wang, Maoguang
Hu, Jun
Shi, Zhongzhi
机构
[1] Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China
[2] Chinese Acad Sci, Grad Univ, Beijing 100049, Peoples R China
[3] China Univ Min & Technol, Sch Comp Sci, Xuzhou 221008, Peoples R China
[4] NanChang Univ, Informat Engn Sch, Nanchang 330047, Peoples R China
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF GRID COMPUTING THEORY METHODS AND APPLICATIONS | 2007年 / 23卷 / 01期
基金
中国国家自然科学基金;
关键词
D O I
10.1016/j.future.2006.04.015
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Centralized data mining techniques are widely used today for the analysis of large corporate and scientific data stored in databases. However, industry, science, and commerce fields often need to analyze very large datasets maintained over geographically distributed sites by using the computational power of distributed systems. The Grid can play a significant role in providing an effective computational infrastructure support for this kind of data mining. Similarly, the advent of multi-agent systems has brought us a new paradigm for the development of complex distributed applications. During the past decades, there have been several models and systems proposed to apply agent technology building distributed data mining (DDM). Through a combination of these two techniques, we investigated the critical issues to build DDM on Grid infrastructure and design an Agent Grid Intelligent Platform as a testbed. We also implement an integrated toolkit VAStudio for quickly developing agent-based DDM applications and compare its function with other systems. (C) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:61 / 68
页数:8
相关论文
共 13 条
[1]  
[Anonymous], 2004, The Grid: Blueprint for a New Computing Infrastructure. Vol
[2]   Distributed data mining on the grid [J].
Cannataro, M ;
Talia, D ;
Trunfio, P .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2002, 18 (08) :1101-1112
[3]  
Chattratichat J, 1999, LECT NOTES COMPUT SC, V1593, P573
[4]   Grid-enabled data warehousing for PF molecular engineering [J].
Dubitzky, W ;
McCourt, D ;
Galushka, M ;
Romberg, M ;
Schuller, B .
PARALLEL COMPUTING, 2004, 30 (9-10) :1019-1035
[5]  
FOSTER I, AAMAS2004
[6]  
FOSTER I, 2001, INT J SPERCOMPUT APP, V15
[7]  
GROSSMAN R, 1999, P SUP
[8]  
Kargupta H., 1997, Proceedings of the Third International Conference on Knowledge Discovery and Data Mining, P211
[9]  
KARGUPTA H, 1999, ADV DISTRIBUTED DATA, P407
[10]  
Kargupta H., 1998, WORKSH DISTR DAT MIN