A memetic-aided approach to hierarchical clustering from distance matrices: application to gene expression clustering and phylogeny

被引:12
作者
Cotta, C
Moscato, P
机构
[1] Univ Malaga, Dept Lenguajes & Ciencias Computac, ETSI Informat, E-29071 Malaga, Spain
[2] Univ Newcastle, Sch Elect Engn & Comp Sci, Fac Engn & Built Environm, Callaghan, NSW 2308, Australia
关键词
hierarchical clustering; memetic algorithms; phylogenetic inference; gene expression; data mining;
D O I
10.1016/S0303-2647(03)00136-9
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a heuristic approach to hierarchical clustering from distance matrices based on the use of memetic algorithms (MAs). By using MAs to solve some variants of the Minimum Weight Hamiltonian Path Problem on the input matrix, a sequence of the individual elements to be clustered (referred to as patterns) is first obtained. While this problem is also NP-hard, a probably optimal sequence is easy to find with the current advances for this problem and helps to prune the space of possible solutions and/or to guide the search performed by an actual clustering algorithm. This technique has been successfully applied to both a Branch-and-Bound algorithm, and to evolutionary algorithms and MAs. Experimental results are given in the context of phylogenetic inference and in the hierarchical clustering of gene expression data. (C) 2003 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:75 / 97
页数:23
相关论文
共 60 条
[1]  
Abu-Mostafa Y. S., 1990, Journal of Complexity, V6, P192, DOI 10.1016/0885-064X(90)90006-Y
[2]   Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling [J].
Alizadeh, AA ;
Eisen, MB ;
Davis, RE ;
Ma, C ;
Lossos, IS ;
Rosenwald, A ;
Boldrick, JG ;
Sabet, H ;
Tran, T ;
Yu, X ;
Powell, JI ;
Yang, LM ;
Marti, GE ;
Moore, T ;
Hudson, J ;
Lu, LS ;
Lewis, DB ;
Tibshirani, R ;
Sherlock, G ;
Chan, WC ;
Greiner, TC ;
Weisenburger, DD ;
Armitage, JO ;
Warnke, R ;
Levy, R ;
Wilson, W ;
Grever, MR ;
Byrd, JC ;
Botstein, D ;
Brown, PO ;
Staudt, LM .
NATURE, 2000, 403 (6769) :503-511
[3]  
[Anonymous], 2000, 4 ANN INT C COMP MOL
[4]  
Back T., 1995, Proceedings of the 6th International Conference on Genetic Algorithms, P2
[5]   RECOGNITION OF TREE METRICS [J].
BANDELT, HJ .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 1990, 3 (01) :1-6
[6]  
BENNETT CH, 2002, SCI AM MAR, P32
[7]  
Berry V, 2000, PROCEEDINGS OF THE ELEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P287
[8]  
BICKLE T, 1995, P 6 INT C GEN ALG, P9
[9]   Big trees from little genomes: mitochondrial gene order as a phylogenetic tool [J].
Boore, JL ;
Brown, WM .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 1998, 8 (06) :668-674
[10]   Ancient horizontal gene transfer [J].
Brown, JR .
NATURE REVIEWS GENETICS, 2003, 4 (02) :121-132