Data mining: Statistics and more?

被引:175
作者
Hand, DJ [1 ]
机构
[1] Open Univ, Dept Stat, Milton Keynes MK7 6AA, Bucks, England
关键词
databases; exploratory data analysis; knowledge discovery;
D O I
10.2307/2685468
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Data mining is a new discipline lying at the interface of statistics, database technology, pattern recognition, machine learning, and other areas. It is concerned with the secondary analysis of large databases in order to find previously unsuspected relationships which are of interest or value to the database owners. New problems arise, partly as a consequence of the sheer size of the data sets involved, and partly because of issues of pattern matching. However, since statistics provides the intellectual glue underlying the effort, it is important for statisticians to become involved. There are very real opportunities for statisticians to make significant contributions.
引用
收藏
页码:112 / 118
页数:7
相关论文
共 23 条
[1]  
[Anonymous], 1990, Statistical Science, DOI [10.1214/ss/1177012165, DOI 10.1214/SS/1177012165)]
[2]  
BABCOCK C, 1994, COMPUTER WORLD, P6
[3]   Advanced scout: Data mining and knowledge discovery in NBA data [J].
Bhandari, I ;
Colet, E ;
Parker, J ;
Pines, Z ;
Pratap, R ;
Ramanujam, K .
DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (01) :121-125
[4]  
BOX G, 1965, TECHNOMETRICS, V7, P57
[5]   Inference for non-random samples [J].
Chesher, A .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1997, 59 (01) :77-95
[6]  
CORTES C, 1997, UNPUB MEGA MONITORIN
[7]  
Fayyad U. M., 1996, ADV KNOWLEDGE DISCOV, P1, DOI DOI 10.1609/AIMAG.V17I3.1230
[8]  
Fayyad U.M., 1996, Advances in Knowledge Discovery and Data Mining, P471
[9]  
Fayyad UM, 1997, DATA MIN KNOWL DISC, V1, P5, DOI 10.1023/A:1009715820935
[10]  
GALE WA, 1993, HDB STAT, V9, P535