Simulated annealing based automatic fuzzy clustering combined with ANN classification for analyzing microarray data

被引:28
作者
Maulik, Ujjwal [2 ]
Mukhopadhyay, Anirban [1 ]
机构
[1] Univ Kalyani, Dept Comp Sci & Engn, Kalyani 741235, W Bengal, India
[2] Jadavpur Univ, Dept Comp Sci & Engn, Kolkata 700032, India
关键词
Microarray gene expression data; Fuzzy clustering; Cluster validity indices; Variable configuration length simulated annealing; Artificial neural network; Gene ontology; GENE-EXPRESSION DATA; TRANSCRIPTIONAL PROGRAM; C-MEANS; ALGORITHM; PATTERNS; SYSTEM; YEAST;
D O I
10.1016/j.cor.2009.02.025
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Microarray technology has made it possible to monitor the expression levels of many genes simultaneously across a number of experimental conditions. Fuzzy clustering is an important tool for analyzing microarray gene expression data. In this article, a real-coded Simulated Annealing (VSA) based fuzzy clustering method with variable length configuration is developed and combined with popular Artificial Neural Network (ANN) based classifier. The idea is to refine the clustering produced by VSA using ANN classifier to obtain improved clustering performance. The proposed technique is used to cluster three publicly available real life microarray data sets. The superior performance of the proposed technique has been demonstrated by comparing with some widely used existing clustering algorithms. Also statistical significance test has been conducted to establish the statistical significance of the superior performance of the proposed clustering algorithm. Finally biological relevance of the clustering solutions are established. (C) 2009 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1369 / 1380
页数:12
相关论文
共 41 条
[1]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[2]   Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays [J].
Alon, U ;
Barkai, N ;
Notterman, DA ;
Gish, K ;
Ybarra, S ;
Mack, D ;
Levine, AJ .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1999, 96 (12) :6745-6750
[3]  
ANDERSEN LN, 1997, P IEEE WORKSH NEUR N, P24
[4]  
[Anonymous], 2005, NEURAL NETWORKS PATT
[5]  
[Anonymous], 2007, Analysis of Biological Data: A Soft Computing Approach
[6]  
[Anonymous], Pattern Recognition with Fuzzy Objective Function Algorithms
[7]   Clustering using simulated annealing with probabilistic redistribution [J].
Bandyopadhyay, S ;
Maulik, U ;
Pakhira, MK .
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2001, 15 (02) :269-285
[8]   An improved algorithm for clustering gene expression data [J].
Bandyopadhyay, Sanghamitra ;
Mukhopadhyay, Anirban ;
Maulik, Ujjwal .
BIOINFORMATICS, 2007, 23 (21) :2859-2865
[9]   Multiobjective genetic clustering for pixel classification in remote sensing imagery [J].
Bandyopadhyay, Sanghamitra ;
Maulik, Ujjwal ;
Mukhopadhyay, Anirban .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2007, 45 (05) :1506-1511
[10]   Quantitative comparison of the performance of SAR segmentation algorithms [J].
Caves, R ;
Quegan, S ;
White, R .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1998, 7 (11) :1534-1546