A unifying criterion for unsupervised clustering and feature selection

被引:58
作者
Breaban, Mihaela [1 ]
Luchian, Henri [1 ]
机构
[1] Alexandru Ioan Cuza Univ, Fac Comp Sci, Iasi 700483, Romania
关键词
Unsupervised feature selection; Unsupervised clustering; Global optimization; INDEXES;
D O I
10.1016/j.patcog.2010.10.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Exploratory data analysis methods are essential for getting insight into data. Identifying the most important variables and detecting quasi-homogenous groups of data are problems of interest in this context. Solving such problems is a difficult task, mainly due to the unsupervised nature of the underlying learning process. Unsupervised feature selection and unsupervised clustering can be successfully approached as optimization problems by means of global optimization heuristics if an appropriate objective function is considered. This paper introduces an objective function capable of efficiently guiding the search for significant features and simultaneously for the respective optimal partitions. Experiments conducted on complex synthetic data suggest that the function we propose is unbiased with respect to both the number of clusters and the number of features. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:854 / 865
页数:12
相关论文
共 29 条
[1]  
Aggarwal CC, 2001, LECT NOTES COMPUT SC, V1973, P420
[2]  
[Anonymous], P WORKSH PROB GRAPH
[3]  
[Anonymous], 2000, Pattern Classification
[4]  
[Anonymous], 2003, Genetic programming IV: routine human-competitive machine intelligence
[5]  
[Anonymous], 1998, Feature Extraction, Construction and Selection: A Data Mining Perspective
[6]  
Bautu E, 2005, SEVENTH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, PROCEEDINGS, P325
[7]   Some new indexes of cluster validity [J].
Bezdek, JC ;
Pal, NR .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03) :301-315
[8]   Fuzzy Subspace Clustering [J].
Borgelt, Christian .
ADVANCES IN DATA ANALYSIS, DATA HANDLING AND BUSINESS INTELLIGENCE, 2010, :93-103
[9]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[10]  
Diday E., 1982, ELEMENTS ANAL DONNEE