Pairwise data clustering by deterministic annealing

被引:242
作者
Hofmann, T
Buhmann, JM
机构
[1] Rheinische Friedrich-Wilhelms-Universität, Institut für Informatik III, D-53117 Bonn
关键词
D O I
10.1109/34.566806
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Partitioning a data set and extracting hidden structure from the data arises in different application areas of pattern recognition, speech and image processing. Pairwise data clustering is a combinatorial optimization method for data grouping which extracts hidden structure from proximity data. We describe a deterministic annealing approach to painwise clustering which shares the robustness properties of maximum entropy inference. The resulting Gibbs probability distributions are estimated by mean-field approximation. A new structure-preserving algorithm to cluster dissimilarity data and to simultaneously embed these data in a Euclidian vector space is discussed which can be used for dimensionality reduction and data visualization. The suggested embedding algorithm which outperforms conventional approaches has been implemented to analyze dissimilarity data from protein analysis and from linguistics. The algorithm for pairwise data clustering is used to segment textured images.
引用
收藏
页码:1 / 14
页数:14
相关论文
共 46 条
[1]  
BAVELIER D, 1993, 34 ANN M PSYCH SOC W
[2]  
BREGLER C, 1994, ADV NEURAL INFORMATI, V6
[3]   VECTOR QUANTIZATION WITH COMPLEXITY COSTS [J].
BUHMANN, J ;
KUHNEL, H .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (04) :1133-1145
[4]   COMPLEXITY OPTIMIZED DATA CLUSTERING BY COMPETITIVE NEURAL NETWORKS [J].
BUHMANN, J ;
KUHNEL, H .
NEURAL COMPUTATION, 1993, 5 (01) :75-88
[6]   ENTROPY-CONSTRAINED VECTOR QUANTIZATION [J].
CHOU, PA ;
LOOKABAUGH, T ;
GRAY, RM .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (01) :31-42
[7]  
Cover T. M., 2005, ELEM INF THEORY, DOI 10.1002/047174882X
[8]   I-DIVERGENCE GEOMETRY OF PROBABILITY DISTRIBUTIONS AND MINIMIZATION PROBLEMS [J].
CSISZAR, I .
ANNALS OF PROBABILITY, 1975, 3 (01) :146-158
[9]   UNCERTAINTY RELATION FOR RESOLUTION IN SPACE, SPATIAL-FREQUENCY, AND ORIENTATION OPTIMIZED BY TWO-DIMENSIONAL VISUAL CORTICAL FILTERS [J].
DAUGMAN, JG .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1985, 2 (07) :1160-1169
[10]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38