Looking for natural patterns in data - Part 1. Density-based approach

被引:202
作者
Daszykowski, M [1 ]
Walczak, B [1 ]
Massart, DL [1 ]
机构
[1] FABI VUB, ChemoAC, B-1090 Brussels, Belgium
关键词
pattern recognition; density-based clustering; outliers and inliers identification;
D O I
10.1016/S0169-7439(01)00111-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A density-based unsupervised clustering approach for detecting natural patterns in data (further denoted as NP) is presented, and its performance is illustrated for data sets with different types of clusters. NP works for arbitrary clusters, is a single-scan technique, requires no presumptions regarding data distribution and requires only one input parameter, which describes the minimal number of objects, considered as cluster. Moreover, a comparison of NP with partitioning approaches is demonstrated. NP can be applied not only for data clustering, but also for the identification of outliers. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:83 / 92
页数:10
相关论文
共 14 条
[1]  
BISHOP CM, 1996, ADV NEURAL INFORMATI, V8, P456
[2]   ART-2 - SELF-ORGANIZATION OF STABLE CATEGORY RECOGNITION CODES FOR ANALOG INPUT PATTERNS [J].
CARPENTER, GA ;
GROSSBERG, S .
APPLIED OPTICS, 1987, 26 (23) :4919-4930
[3]  
Ester M, 1996, 2 INT C KNOWL DISCOV, P226, DOI DOI 10.5555/3001460.3001507
[4]  
Kohonen, 1984, SELF ORG ASS MEMORY
[5]  
MacQueen J, 1965, P 5 BERK S MATH STAT, P281
[6]   NEURAL-GAS NETWORK FOR VECTOR QUANTIZATION AND ITS APPLICATION TO TIME-SERIES PREDICTION [J].
MARTINETZ, TM ;
BERKOVICH, SG ;
SCHULTEN, KJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (04) :558-569
[7]  
Rousseeuw P.J., 1990, Finding groups in data: An introduction to cluster analysis, V1
[8]   Density-based clustering in spatial databases: The algorithm GDBSCAN and its applications [J].
Sander, J ;
Ester, M ;
Kriegel, HP ;
Xu, XW .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :169-194
[9]  
Vandeginste B. G. M., 1998, HDB CHEMOMETRICS Q B
[10]  
Vogt W, 1987, CLUSTER ANAL CLIN CH