Fuzzy clustering as a means of selecting representative conformers and molecular alignments

被引:39
作者
Feher, M [1 ]
Schmidt, JM [1 ]
机构
[1] SignalGene Inc, Guelph, ON N1G 4P7, Canada
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 2003年 / 43卷 / 03期
关键词
D O I
10.1021/ci0200671
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper describes the first application of fuzzy c-means clustering for the selection of representatives from assemblies of conformations or alignments. In case of alignments, their quality is taken into account using a weighted c-means scheme, developed in this work. The performance of fuzzy cluster validity measures, such as compactness, partition function, and entropy, are studied on several examples, but the visual 3D representation of data points is shown to be most beneficial in determining the optimum number of clusters. Fuzzy clustering is expected to perform better than crisp clustering methods in cases where there are a significant number of "outliers", such as in molecular dynamics simulations and molecular alignments.
引用
收藏
页码:810 / 818
页数:9
相关论文
共 18 条
[1]  
[Anonymous], PATTERN RECOGNITION
[2]  
Bezdek JC, 1974, J CYBERNETICS, V3, P58, DOI [10.1080/01969727308546047, DOI 10.1080/01969727308546047]
[3]  
*CHEM COMP GROUP I, 2002, MOL OP ENV VERS 2000
[4]   A fuzzy hybrid hierarchical clustering method with a new criterion able to find the optimal partition [J].
Devillez, A ;
Billaudel, P ;
Lecolier, GV .
FUZZY SETS AND SYSTEMS, 2002, 128 (03) :323-338
[5]  
Dunn J. C., 1973, Journal of Cybernetics, V3, P32, DOI 10.1080/01969727308546046
[6]  
Dunn JC., 1977, Fuzzy Automata and Decision Processes
[7]   Identifying potential binding modes and explaining partitioning behavior using flexible alignments and multidimensional scaling [J].
Feher, M ;
Schmidt, JM .
JOURNAL OF COMPUTER-AIDED MOLECULAR DESIGN, 2001, 15 (12) :1065-1083
[8]   Metric and multidimensional scaling: Efficient tools for clustering molecular conformations [J].
Feher, M ;
Schmidt, JM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2001, 41 (02) :346-353
[9]   An automated approach for clustering an ensemble of NMR-derived protein structures into conformationally related subfamilies [J].
Kelley, LA ;
Gardner, SP ;
Sutcliffe, MJ .
PROTEIN ENGINEERING, 1996, 9 (11) :1063-1065
[10]  
Krzanowski W.J., 2000, PRINCIPLES MULTIVARI