A Poisson-based adaptive affinity propagation clustering for SAGE data

被引:16
作者
Tang, DongMing [1 ]
Zhu, QingXin [1 ]
Yang, Fan [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610054, Peoples R China
基金
中国国家自然科学基金;
关键词
Poisson distribution; Serial analysis of gene expression; Affinity Propagation; Clustering; GENE-EXPRESSION DATA; SERIAL ANALYSIS; MODEL; LIBRARIES;
D O I
10.1016/j.compbiolchem.2009.11.001
中图分类号
Q [生物科学];
学科分类号
090105 [作物生产系统与生态工程];
摘要
Serial analysis of gene expression (SAGE) is a powerful tool to obtain gene expression profiles. Clustering analysis is a valuable technique for analyzing SAGE data. In this paper, we propose an adaptive clustering method for SAGE data analysis, namely, PoissonAPS. The method incorporates a novel clustering algorithm, Affinity Propagation (AP). While AP algorithm has demonstrated good performance on many different data sets, it also faces several limitations. PoissonAPS overcomes the limitations of AP using the clustering validation measure as a cost function of merging and splitting, and as a result, it can automatically cluster SAGE data without user-specified parameters. We evaluated PoissonAPS and compared its performance with other methods on several real life SAGE datasets. The experimental results show that PoissonAPS can produce meaningful and interpretable clusters for SAGE data. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:63 / 70
页数:8
相关论文
共 21 条
[1]
Genomic analysis of mouse retinal development [J].
Blackshaw, S ;
Harpavat, S ;
Trimarchi, J ;
Cai, L ;
Huang, HY ;
Kuo, WP ;
Weber, G ;
Lee, K ;
Fraioli, RE ;
Cho, SH ;
Yung, R ;
Asch, E ;
Ohno-Machado, L ;
Wong, WH ;
Cepko, CL .
PLOS BIOLOGY, 2004, 2 (09) :1411-1431
[2]
Comment on "clustering by passing messages between data points" [J].
Brusco, Michael J. ;
Koehn, Hans-Friedrich .
SCIENCE, 2008, 319 (5864)
[3]
Buckhaults P, 2003, CANCER RES, V63, P4144
[4]
Clustering analysis of SAGE data using a Poisson approach [J].
Cai, L ;
Huang, HY ;
Blackshaw, S ;
Liu, JS ;
Cepko, C ;
Wong, WH .
GENOME BIOLOGY, 2004, 5 (07)
[5]
Clustering by passing messages between data points [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2007, 315 (5814) :972-976
[6]
Response to comment on "clustering by passing messages between data points" [J].
Frey, Brendan J. ;
Dueck, Delbert .
SCIENCE, 2008, 319 (5864)
[7]
Huang Haiyan, 2008, V387, P185
[8]
Larsen B., 1999, Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
[9]
Clustering by soft-constraint affinity propagation: applications to gene-expression data [J].
Leone, Michele ;
Sumedha ;
Weigt, Martin .
BIOINFORMATICS, 2007, 23 (20) :2708-2715
[10]
Identifying differential expression in multiple SAGE libraries: an overdispersed log-linear model approach [J].
Lu, J ;
Tomfohr, JK ;
Kepler, TB .
BMC BIOINFORMATICS, 2005, 6 (1)