AN ORDER THEORETIC FRAMEWORK FOR OVERLAPPING CLUSTERING

被引:21
作者
BANDELT, HJ
DRESS, AWM
机构
[1] UNIV BIELEFELD,FAK MATH,D-33501 BIELEFELD,GERMANY
[2] UNIV HAMBURG,MATH SEMINAR,D-20146 HAMBURG,GERMANY
关键词
D O I
10.1016/0012-365X(94)00105-R
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Cluster analysis deals with procedures which - given a finite collection X of objects together with some kind of local dissimilarity information - identify those subcollections C of objects from X, called clusters, which exhibit a comparatively low degree of internal dissimilarity. In this note we study arbitrary mappings phi which assign to each subcollection A subset of or equal to X of objects its internal degree of dissimilarity phi(A), subject only to the natural condition that A subset of or equal to B subset of or equal to X implies phi(A)less than or equal to phi(B), and we analyse on a rather abstract, purely order theoretic level how assumptions concerning the way such a mapping cp might be constructed from local data (that is, data involving only a few objects at a time) influence the degree of overlapping observed within the resulting family of clusters, - and vice versa. Hence, unlike previous order theoretic approaches to cluster analysis, we do not restrict our attention to nonoverlapping, hierarchical clustering. Instead, we regard a dissimilarity function phi as an arbitrary isotone mapping from a finite partially ordered set Q - e.g. the set P(X) of all subsets A of a finite set X - into a (partially) ordered set R - e.g. the nonnegative real numbers - and we study the correspondence between the two subsets C(phi) and D(phi) of Q, formed by the elements whose images are inaccessible from above and from below, respectively. While D(phi) constitutes the local data structure from which phi can be built up, C(phi) embodies the family of clusters associated with phi. Our results imply that in case Q:=P(X) and R:=R(greater than or equal to 0) one has #D less than or equal to n for all D is an element of D(phi) and some fixed n is an element of N if and only if [GRAPHICS] for all C-0,..., C-n is an element of C(phi) if and only if this holds for all subsets C-0,..., C-n subset of or equal to X, generalizing a well-known criterion for n-conformity of hypergraphs as well as corresponding results due to Batbedat, dealing with the case n=2.
引用
收藏
页码:21 / 37
页数:17
相关论文
共 21 条
[1]   THE TENSOR PRODUCT OF CONTINUOUS LATTICES [J].
BANDELT, HJ .
MATHEMATISCHE ZEITSCHRIFT, 1980, 172 (01) :89-96
[2]  
BANDELT HJ, 1989, B MATH BIOL, V51, P133, DOI 10.1016/S0092-8240(89)80053-9
[3]  
BANDELT HJ, IN PRESS J CLASSIFIC
[4]  
BATBEDAT A, 1993, CR ACAD SCI I-MATH, V316, P865
[5]  
BATBEDAT A, 1990, APPROACHES PYRAMIDAL
[6]  
Batbedat A., 1989, METRON, V47, P35
[7]  
Batbedat A., 1988, METRON, V46, P47
[8]  
Berge C., 1989, HYPERGRAPHS, V45
[9]  
Edelman Paul H., 1985, GEOMETRIAE DEDICATA, V19, P247, DOI DOI 10.1007/BF00149365
[10]  
GOLUMBIC MC, 1985, J COMB THEORY B, V28, P8