Minkowski metric, feature weighting and anomalous cluster initializing in K-Means clustering

被引:204
作者
de Amorim, Renato Cordeiro [1 ]
Mirkin, Boris [1 ,2 ,3 ]
机构
[1] Birkbeck Univ London, Dept Comp Sci & Informat Syst, London WC1E 7XH, England
[2] Birkbeck Univ London, Dept Comp Sci, London WC1E 7XH, England
[3] Natl Res Univ, Dept Data Anal & Machine Intelligence, Higher Sch Econ, Moscow, Russia
关键词
K-means; Minkowski metric; Feature weights; Noise features; Anomalous cluster; FEATURE-SELECTION; ALGORITHM; KERNEL;
D O I
10.1016/j.patcog.2011.08.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper represents another step in overcoming a drawback of K-Means, its lack of defense against noisy features, using feature weights in the criterion. The Weighted K-Means method by Huang et al. (2008, 2004, 2005) [5-7] is extended to the corresponding Minkowski metric for measuring distances. Under Minkowski metric the feature weights become intuitively appealing feature rescaling factors in a conventional K-Means criterion. To see how this can be used in addressing another issue of K-Means, the initial setting, a method to initialize K-Means with anomalous clusters is adapted. The Minkowski metric based method is experimentally validated on datasets from the UCI Machine Learning Repository and generated sets of Gaussian clusters, both as they are and with additional uniform random noise features, and appears to be competitive in comparison with other K-Means based feature weighting algorithms. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1061 / 1075
页数:15
相关论文
共 49 条
[1]  
Aggarwal CC, 2001, LECT NOTES COMPUT SC, V1973, P420
[2]  
Amorim R. C., 2010, BBKS1001 BIRKB U LON
[3]  
[Anonymous], 2004, ICML
[4]  
[Anonymous], P C ADV NEUR INF PRO
[5]  
[Anonymous], 2010, NETLAB NEURAL NETWOR
[6]  
[Anonymous], 2000, ICML
[7]  
[Anonymous], 2010, WORLD ACAD SCI ENG T
[8]  
[Anonymous], 2007, Uci machine learning repository
[9]  
[Anonymous], Pattern Recognition with Fuzzy Objective Function Algorithms
[10]  
[Anonymous], P 12 EUR S ART NEUR