Data mining and KDD: Promise and challenges

被引:107
作者
Fayyad, U [1 ]
Stolorz, P [1 ]
机构
[1] CALTECH,JET PROP LAB,PASADENA,CA 91109
关键词
data mining; data analysis; science data analysis; overview article; knowledge discovery in databases; databases; parallel data mining;
D O I
10.1016/S0167-739X(97)00015-0
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Databases are growing in size to a stage where traditional techniques for analysis and visualization of the data are breaking down. Data mining and knowledge discovery in databases (KDD) are concerned with extracting models and patterns of interest from large databases. Data mining techniques have their origins in methods from statistics, pattern recognition, databases, artificial intelligence, high performance and parallel computing, and visualization. In this article, we provide an overview of this growing multi-disciplinary research area, outline the basic techniques, and provide brief coverage of how they are used in some applications. We discuss the role of high performance and parallel computing in data mining problems, and we provide a brief overview of a few applications in science data analysis. We conclude by listing challenges and opportunites for future research.
引用
收藏
页码:99 / 115
页数:17
相关论文
共 26 条
[1]  
[Anonymous], 1987, DISCOVERING CAUSAL S
[2]  
AUBELE J, 1995, P 26 LUN PLAN SCI C, V1458
[3]  
BRACHMAN R, 1996, COMMUN ACM, V39
[4]  
Burl M. C., 1994, Proceedings 1994 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.94CH3405-8), P302, DOI 10.1109/CVPR.1994.323844
[5]  
CODD E, 1993, PROVIDING OLAP ONLIN
[6]  
Duda R. O., 1973, PATTERN CLASSIFICATI, V3
[7]  
Fayyad U., 1996, COMMUN ACM, V39
[8]  
FAYYAD UM, 1996, ADV KNOWLEDGE DISCOV
[9]  
GLYMOUR C, 1997, DATA MINING KNOWLEDG, V1
[10]  
GRAY J, 1997, DATA MINING KNOWLEDG, V1