Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R

被引:1341
作者
Langfelder, Peter [1 ]
Zhang, Bin [2 ]
Horvath, Steve [1 ]
机构
[1] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[2] Rosetta Inpharmat Merck Res Labs, Seattle, WA USA
关键词
D O I
10.1093/bioinformatics/btm563
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hierarchical clustering is a widely used method for detecting clusters in genomic data. Clusters are defined by cutting branches off the dendrogram. A common but inflexible method uses a constant height cutoff value; this method exhibits suboptimal performance on complicated dendrograms. We present the Dynamic Tree Cut R package that implements novel dynamic branch cutting methods for detecting clusters in a dendrogram depending on their shape. Compared to the constant height cutoff method, our techniques offer the following advantages: (1) they are capable of identifying nested clusters; (2) they are flexiblecluster shape parameters can be tuned to suit the application at hand; (3) they are suitable for automation; and (4) they can optionally combine the advantages of hierarchical clustering and partitioning around medoids, giving better detection of outliers. We illustrate the use of these methods by applying them to proteinprotein interaction network data and to a simulated gene expression data set.
引用
收藏
页码:719 / 720
页数:2
相关论文
共 8 条
  • [1] Fuzzy C-means method for clustering microarray data
    Dembélé, D
    Kastner, P
    [J]. BIOINFORMATICS, 2003, 19 (08) : 973 - 980
  • [2] Understanding network concepts in modules
    Dong, Jun
    Horvath, Steve
    [J]. BMC SYSTEMS BIOLOGY, 2007, 1
  • [3] Dudoit S, 2002, GENOME BIOL, V3
  • [4] Integrating genetic and network analysis to characterize genes related to mouse weight
    Ghazalpour, Anatole
    Doss, Sudheer
    Zhang, Bin
    Wang, Susanna
    Plaisier, Christopher
    Castellanos, Ruth
    Brozell, Alec
    Schadt, Eric E.
    Drake, Thomas A.
    Lusis, Aldons J.
    Horvath, Steve
    [J]. PLOS GENETICS, 2006, 2 (08): : 1182 - 1192
  • [5] Mixture modelling of gene expression data from microarray experiments
    Ghosh, D
    Chinnaiyan, AM
    [J]. BIOINFORMATICS, 2002, 18 (02) : 275 - 286
  • [6] Evaluation and comparison of gene clustering methods in microarray analysis
    Thalamuthu, Anbupalam
    Mukhopadhyay, Indranil
    Zheng, Xiaojing
    Tseng, George C.
    [J]. BIOINFORMATICS, 2006, 22 (19) : 2405 - 2412
  • [7] van der Laan MJ, 2003, J STAT PLAN INFER, V117, P275, DOI 10.1016/S0378-3758(02)00388-9
  • [8] [No title captured]