Principles of data mining

被引:262
作者
Hand, David J. [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Math, London SW7 2AZ, England
关键词
Data Mining; Adverse Drug Reaction; Pattern Discovery; Selectivity Bias; Poor Quality Data;
D O I
10.2165/00002018-200730070-00010
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 [公共卫生与预防医学]; 120402 [社会医学与卫生事业管理];
摘要
Data mining is the discovery of interesting, unexpected or valuable structures in large datasets. As such, it has two rather different aspects. One of these concerns large-scale, 'global' structures, and the aim is to model the shapes, or features of the shapes, of distributions. The other concerns small-scale, 'local' structures, and the aim is to detect these anomalies and decide if they are real or chance occurrences. In the context of signal detection in the pharmaceutical sector, most interest lies in the second of the above two aspects; however, signal detection occurs relative to an assumed background model, therefore, some discussion of the first aspect is also necessary. This paper gives a lightning overview of data mining and its relation to statistics, with particular emphasis on tools for the detection of adverse drug reactions.
引用
收藏
页码:621 / 622
页数:2
相关论文
共 2 条
[1]
Hand DJ, 2000, STAT SCI, V15, P111
[2]
HAND DJ, 2001, PRINICIPLES DAT MINI