Hybrid hierarchical clustering with applications to microarray data

被引:95
作者
Chipman, H [1 ]
Tibshirani, R
机构
[1] Acadia Univ, Dept Math & Stat, Wolfville, NS B4P 2R6, Canada
[2] Stanford Univ, Dept Hlth Res & Policy, Stanford, CA 94305 USA
[3] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
关键词
bottom-up clustering; mutual cluster; top-down clustering;
D O I
10.1093/biostatistics/kxj007
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In this paper, we propose a hybrid clustering method that combines the strengths of bottom-up hierarchical clustering with that of top-down clustering. The first method is good at identifying small clusters but not large ones; the strengths are reversed for the second method. The hybrid method is built on the new idea of a mutual cluster: a group of points closer to each other than to any other points. Theoretical connections between mutual clusters and bottom-up clustering methods are established, aiding in their interpretation and providing an algorithm for identification of mutual clusters. We illustrate the technique on simulated and real microarray datasets.
引用
收藏
页码:286 / 301
页数:16
相关论文
共 12 条
[11]   DISSIMILARITY ANALYSIS - NEW TECHNIQUE OF HIERARCHICAL SUB-DIVISION [J].
MACNAUGHTONSMIT.P ;
MOCKETT, LG ;
DALE, MB ;
WILLIAMS, WT .
NATURE, 1964, 202 (493) :1034-&
[12]   Gene expression patterns of breast carcinomas distinguish tumor subclasses with clinical implications [J].
Sorlie, T ;
Perou, CM ;
Tibshirani, R ;
Aas, T ;
Geisler, S ;
Johnsen, H ;
Hastie, T ;
Eisen, MB ;
van de Rijn, M ;
Jeffrey, SS ;
Thorsen, T ;
Quist, H ;
Matese, JC ;
Brown, PO ;
Botstein, D ;
Lonning, PE ;
Borresen-Dale, AL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (19) :10869-10874