Robust cluster analysis of microarray gene expression data with the number of clusters determined biologically

被引:32
作者
Bickel, DR [1 ]
机构
[1] Med Coll Georgia, Off Biostat & Bioinformat, Augusta, GA 30912 USA
关键词
D O I
10.1093/bioinformatics/btg092
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The success of each method of cluster analysis depends on how well its underlying model describes the patterns of expression. Outlier-resistant and distribution-insensitive clustering of genes are robust against violations of model assumptions. Results: A measure of dissimilarity that combines advantages of the Euclidean distance and the correlation coefficient is introduced. The measure can be made robust using a rank order correlation coefficient. A robust graphical method of summarizing the results of cluster analysis and a biological method of determining the number of clusters are also presented. These methods are applied to a public data set, showing that rank-based methods perform better than log-based methods.
引用
收藏
页码:818 / 824
页数:7
相关论文
共 24 条
  • [1] Tissue classification with gene expression profiles
    Ben-Dor, A
    Bruhn, L
    Friedman, N
    Nachman, I
    Schummer, M
    Yakhini, Z
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2000, 7 (3-4) : 559 - 583
  • [2] BICKEL DR, 2002, IN PRESS COMPUTING S
  • [3] Gene expression data analysis
    Brazma, A
    Vilo, J
    [J]. FEBS LETTERS, 2000, 480 (01) : 17 - 24
  • [4] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [5] D'haeseleer P, 1998, INFORMATION PROCESSING IN CELLS AND TISSUES, P203
  • [6] Daskin M. S., 1995, NETWORK DISCRETE LOC
  • [7] Exploring the metabolic and genetic control of gene expression on a genomic scale
    DeRisi, JL
    Iyer, VR
    Brown, PO
    [J]. SCIENCE, 1997, 278 (5338) : 680 - 686
  • [8] Donoho D. L., 1983, FESTSCHRIFT EL LEHMA
  • [9] Dudoit S, 2002, GENOME BIOL, V3
  • [10] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868