Clustering feature vectors with mixed numerical and categorical attributes

被引:6
作者
Brouwer, Roelof K.
机构
[1] Department of Mechanical and Mechatronics Engineering, Stellenbosch University
基金
加拿大自然科学与工程研究理事会;
关键词
Fuzzy clustering; gradient descent; categorical; nominal clustering; fuzzy c-means;
D O I
10.1080/18756891.2008.9727625
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a method for finding a fuzzy membership matrix in case of numerical and categorical features. The set of feature vectors with mixed features is mapped to a set of feature vectors with only real valued components with the condition that the new set of vectors has the same proximity matrix as the original feature vectors. This new set of vectors is then clustered using fuzzy c-means. Simulations show the method to be very effective in comparison with other methods.
引用
收藏
页码:285 / 298
页数:14
相关论文
共 61 条
[1]   A method to compute distance between two categorical values of same attribute in unsupervised learning for categorical data set [J].
Ahmad, Amir ;
Dey, Lipika .
PATTERN RECOGNITION LETTERS, 2007, 28 (01) :110-118
[2]  
Anderberg M.R., 1973, Probability and Mathematical Statistics
[3]   Bi-level clustering of mixed categorical and numerical biomedical data [J].
Andreopoulos, Bill ;
An, Aijun ;
Wang, Xiaogang .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2006, 1 (01) :19-56
[4]  
[Anonymous], 1999, Fuzzy Cluster Analysis
[5]  
[Anonymous], 1996, P PAC RIM KNOWL ACQS
[6]  
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[7]  
Asuncion A., 2007, MACHINE LEARNING REP
[8]  
Barbara D., 2002, Proceedings of the Eleventh International Conference on Information and Knowledge Management. CIKM 2002, P582, DOI 10.1145/584792.584888
[9]  
BEZDEK JC, 1991, PATTERN RECOGN, V24, P783
[10]  
BROUWER RK, 2006, TRUCSCIG200601 TR