Nearest-neighbor nonparametric method for estimating the configurational entropy of complex molecules

被引:88
作者
Hnizdo, Vladimir [1 ]
Darian, Eva
Fedorowicz, Adam
Demchuk, Eugene
Li, Shengqiao
Singh, Harshinder
机构
[1] NIOSH, Morgantown, WV 26505 USA
[2] Agcy Tox Subst & Dis Registry, Atlanta, GA 30333 USA
[3] W Virginia Univ, Sch Pharm, Morgantown, WV 26506 USA
[4] W Virginia Univ, Dept Stat, Morgantown, WV 26506 USA
关键词
configurational entropy; internal rotation; nearest neighbor; mutual information; computer simulations; STATISTICAL THERMODYNAMICS; COMPUTER-SIMULATIONS; INTERNAL-ROTATION; FORCE-FIELD; BETA-SHEET; MACROMOLECULES; CONFORMATION; DYNAMICS; ABSOLUTE; <LEU5>ENKEPHALIN;
D O I
10.1002/jcc.20589
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
A method for estimating the configurational (i.e., non-kinetic) part of the entropy of internal motion in complex molecules is introduced that does not assume any particular parametric form for the underlying probability density function. It is based on the nearest-neighbor (NN) distances of the points of a sample of internal molecular coordinates obtained by a computer simulation of a given molecule. As the method does not make any assumptions about the underlying potential energy function, it accounts fully for any anharmonicity of internal molecular motion. It provides an asymptotically unbiased and consistent estimate of the configurational part of the entropy of the internal degrees of freedom of the molecule. The NN method is illustrated by estimating the configurational entropy of internal rotation of capsaicin and two stereoisomers of tartaric acid, and by providing a much closer upper bound on the configurational entropy of internal rotation of a pentapeptide molecule than that obtained by the standard quasi-harmonic method. As a measure of dependence between any two internal molecular coordinates, a general coefficient of association based on the information-theoretic quantity of mutual information is proposed. Using NN estimates of this measure, statistical clustering procedures can be employed to group the coordinates into clusters of manageable dimensions and characterized by minimal dependence between coordinates belonging to different clusters. (C) 2006 Wiley Periodicals, Inc.*
引用
收藏
页码:655 / 668
页数:14
相关论文
共 37 条