A Novel Approach to the Problem of Non-uniqueness of the Solution in Hierarchical Clustering

被引:31
作者
Cattinelli, Isabella [1 ,2 ]
Valentini, Giorgio [1 ]
Paulesu, Eraldo [2 ,3 ]
Borghese, Nunzio Alberto [1 ]
机构
[1] Univ Milan, Dept Comp Sci, I-20135 Milan, Italy
[2] Univ Milano Bicocca, Dept Psychol, I-20126 Milan, Italy
[3] IRCSS Galeazzi, I-20126 Milan, Italy
关键词
Bioinformatics; dendrogram equivalence relation; hierarchical clustering (HC); neuroimaging; ORDER;
D O I
10.1109/TNNLS.2013.2247058
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
The existence of multiple solutions in clustering, and in hierarchical clustering in particular, is often ignored in practical applications. However, this is a non-trivial problem, as different data orderings can result in different cluster sets that, in turns, may lead to different interpretations of the same data. The method presented here offers a solution to this issue. It is based on the definition of an equivalence relation over dendrograms that allows developing all and only the significantly different dendrograms for the same dataset, thus reducing the computational complexity to polynomial from the exponential obtained when all possible dendrograms are considered. Experimental results in the neuroimaging and bioinformatics domains show the effectiveness of the proposed method.
引用
收藏
页码:1166 / 1173
页数:9
相关论文
共 23 条
[1]
Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[2]
Reading the reading brain: A new meta-analysis of functional imaging data on reading [J].
Cattinelli, Isabella ;
Borghese, N. Alberto ;
Gallucci, Marcello ;
Paulesu, Eraldo .
JOURNAL OF NEUROLINGUISTICS, 2013, 26 (01) :214-238
[3]
REVIEW OF CLASSIFICATION [J].
CORMACK, RM .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-GENERAL, 1971, 134 :321-+
[4]
How does gene expression clustering work? [J].
D'haeseleer, P .
NATURE BIOTECHNOLOGY, 2005, 23 (12) :1499-1501
[5]
Solving non-uniqueness in agglomerative hierarchical clustering using multidendrograms [J].
Fernandez, Alberto ;
Gomez, Sergio .
JOURNAL OF CLASSIFICATION, 2008, 25 (01) :43-65
[6]
Reducing and filtering point clouds with enhanced vector quantization [J].
Ferrari, Stefano ;
Ferrigno, Giancarlo ;
Piuri, Vincenzo ;
Borghese, N. Alberto .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2007, 18 (01) :161-177
[7]
THE ROLE OF SIMILARITY IN CATEGORIZATION - PROVIDING A GROUNDWORK [J].
GOLDSTONE, RL .
COGNITION, 1994, 52 (02) :125-157
[8]
Guo Y., 2008, Advances in Neural Information Processing Systems, V20, P601
[9]
Data clustering: 50 years beyond K-means [J].
Jain, Anil K. .
PATTERN RECOGNITION LETTERS, 2010, 31 (08) :651-666
[10]
Evaluation of the dual route theory of reading: a metanalysis of 35 neuroimaging studies [J].
Jobard, G ;
Crivello, F ;
Tzourio-Mazoyer, N .
NEUROIMAGE, 2003, 20 (02) :693-712