Clustering techniques

被引:37
作者
Michaud, P
机构
关键词
unsupervised learning; partitioning criteria; neural network; new Condorcet criterion;
D O I
10.1016/S0167-739X(97)00017-4
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Given a population of individuals described by a set of attribute variables, clustering them into ''similar'' groups has many applications. The clustering problem, also known as unsupervised learning, is the problem of partitioning a population into clusters (or classes). The population is a set of n elements that can be clients, products, shops, agencies, etc., described by m attributes. These attributes can be quantitative (salary), categorical (type of profession) or binary (owner of a credit card). The goal is to construct a partition in which elements of a cluster are ''similar'' and elements of different clusters are ''dissimilar'' in terms of the m attributes. Here we define the clustering problem and discuss the ideas behind some of the major approaches, including a relatively new method, called RDA/AREVOMS, that is based on the theory of voting.
引用
收藏
页码:135 / 147
页数:13
相关论文
共 25 条
[1]  
ANDERBERG M, 1973, CLUSTER ANAL APPLICA
[2]  
[Anonymous], 1996, ADV KNOWLEDGE DISCOV
[3]  
ARROW KJ, 1974, SOCIAL CHOICE INDIVI
[4]   MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING [J].
BANFIELD, JD ;
RAFTERY, AE .
BIOMETRICS, 1993, 49 (03) :803-821
[5]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[6]  
FORGY EW, 1965, BIOMETRICS, V21, P768
[7]  
Kohonen T., 1995, SELF ORG MAPS
[8]  
MacQueen J, 1965, P 5 BERK S MATH STAT, V7, P281
[9]  
Marcotorchino J. F., 1981, METHODS OPERATIONS R, V43, P395
[10]  
McLachlan G.J., 1988, Mixture models: inference and applications to clustering, V38