A novel approach to noise clustering for outlier detection

被引:48
作者
Rehm, Frank [1 ]
Klawonn, Frank
Kruse, Rudolf
机构
[1] German Aerosp Ctr, Braunschweig, Germany
[2] Univ Appl Sci Braunschweig Wolfenbuettel, Braunschweig, Germany
[3] Univ Magdeburg, D-39106 Magdeburg, Germany
关键词
noise clustering; outlier detection; fuzzy clustering;
D O I
10.1007/s00500-006-0112-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Noise clustering, as a robust clustering method, performs partitioning of data sets reducing errors caused by outliers. Noise clustering defines outliers in terms of a certain distance, which is called noise distance. The probability or membership degree of data points belonging to the noise cluster increases with their distance to regular clusters. The main purpose of noise clustering is to reduce the influence of outliers on the regular clusters. The emphasis is not put on exactly identifying outliers. However, in many applications outliers contain important information and their correct identification is crucial. In this paper we present a method to estimate the noise distance in noise clustering based on the preservation of the hypervolume of the feature space. Our examples will demonstrate the efficiency of this approach.
引用
收藏
页码:489 / 494
页数:6
相关论文
共 15 条
[2]  
Dave R. N., 1997, P 7 INT FUZZ SYST AS, V3, P205
[3]   CHARACTERIZATION AND DETECTION OF NOISE IN CLUSTERING [J].
DAVE, RN .
PATTERN RECOGNITION LETTERS, 1991, 12 (11) :657-664
[4]   Generalized noise clustering as a robust fuzzy c-M-estimators model [J].
Davé, RN ;
Sen, S .
1998 CONFERENCE OF THE NORTH AMERICAN FUZZY INFORMATION PROCESSING SOCIETY - NAFIPS, 1998, :256-260
[5]   Robust clustering methods: A unified view [J].
Dave, RN ;
Krishnapuram, R .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1997, 5 (02) :270-293
[6]   Fast and robust general purpose clustering algorithms [J].
Estivill-Castro, V ;
Yang, J .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (02) :127-150
[7]   UNSUPERVISED OPTIMAL FUZZY CLUSTERING [J].
GATH, I ;
GEVA, AB .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (07) :773-781
[8]  
Gustafson D. E., 1979, Proceedings of the 1978 IEEE Conference on Decision and Control Including the 17th Symposium on Adaptive Processes, P761
[9]  
Hawkins D.M, 1980, IDENTIFICATION OUTLI, V11, DOI [10.1007/978-94-015-3994-4, DOI 10.1007/978-94-015-3994-4]
[10]  
Klawonn F, 2004, ADV SOFT COMP, P133