On the use of information theory for assessing molecular diversity

被引:39
作者
Agrafiotis, DK
机构
[1] 3-Dimensional Pharmaceuticals, Inc., Exton, PA 19341
来源
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES | 1997年 / 37卷 / 03期
关键词
D O I
10.1021/ci960156b
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In a recent article published in Molecules, Lin presented a novel approach for assessing molecular diversity based on Shannon's information theory. In this method, a set of compounds is viewed as a static collection of microstates which can register information about their environment at some-predetermined capacity. Diversity is directly related to the information conveyed by the population, as quantified by Shannon's classical entropy equation. Despite its intellectual appeal, this method is characterized by a strong tendency to oversample remote areas of the feature space and produce unbalanced designs. This paper demonstrates this limitation with some simple examples and provides a rationale for the failure of the method to produce results that are consistent with other traditional methodologies.
引用
收藏
页码:576 / 580
页数:5
相关论文
共 13 条
[1]  
AGRAFIOTIS DK, 1997, IN PRESS THEOCHEM
[2]  
AGRAFIOTIS DK, UNPUB NEW METRICS AL
[3]   A possibilistic approach to clustering - Comments [J].
Barni, M ;
Cappellini, V ;
Mecocci, A .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1996, 4 (03) :393-396
[4]  
Bezdek J.C., 2013, Pattern Recognition With Fuzzy Objective Function Algorithms
[5]  
GATLIN LL, 1972, INFORMATION THEORY L
[6]  
Johnson M., 1990, CONCEPTS APPL MOL SI
[7]   The possibilistic C-means algorithm: Insights and recommendations [J].
Krishnapuram, R ;
Keller, JM .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 1996, 4 (03) :385-393
[8]  
Krishnapuram R., 1993, IEEE Transactions on Fuzzy Systems, V1, P98, DOI 10.1109/91.227387
[9]  
KUBINYI H, 1993, QSAR HANSCH ANAL REL, P172
[10]  
LIN SK, 1996, MOLECULES, V1, P57