A strategy for analysis of (molecular) equilibrium simulations: Configuration space density estimation, clustering, and visualization

被引:24
作者
Hamprecht, FA [1 ]
Peter, C
Daura, X
Thiel, W
van Gunsteren, WF
机构
[1] ETH Zentrum, Swiss Fed Inst Technol, Chem Phys Lab, CH-8092 Zurich, Switzerland
[2] Max Planck Inst Kohlenforsch, D-45470 Mulheim, Germany
关键词
D O I
10.1063/1.1330216
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
We propose an approach for summarizing the output of long simulations of complex systems, affording a rapid overview and interpretation. First, multidimensional scaling techniques are used in conjunction with dimension reduction methods to obtain a low-dimensional representation of the configuration space explored by the system. A nonparametric estimate of the density of states in this subspace is then obtained using kernel methods. The free energy surface is calculated from that density, and the configurations produced in the simulation are then clustered according to the topography of that surface, such that all configurations belonging to one local free energy minimum form one class. This topographical cluster analysis is performed using basin spanning trees which we introduce as subgraphs of Delaunay triangulations. Free energy surfaces obtained in dimensions lower than four can be visualized directly using iso-contours and -surfaces. Basin spanning trees also afford a glimpse of higher-dimensional topographies. The procedure is illustrated using molecular dynamics simulations on the reversible folding of peptide analoga. Finally, we emphasize the intimate relation of density estimation techniques to modern enhanced sampling algorithms. (C) 2001 American Institute of Physics.
引用
收藏
页码:2079 / 2089
页数:11
相关论文
共 62 条
[1]   OPTIMAL PROTOCOL AND TRAJECTORY VISUALIZATION FOR CONFORMATIONAL SEARCHES OF PEPTIDES AND PROTEINS [J].
ABAGYAN, R ;
ARGOS, P .
JOURNAL OF MOLECULAR BIOLOGY, 1992, 225 (02) :519-532
[2]   ESSENTIAL DYNAMICS OF PROTEINS [J].
AMADEI, A ;
LINSSEN, ABM ;
BERENDSEN, HJC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1993, 17 (04) :412-425
[3]  
[Anonymous], MATCH COMMUNICATIONS
[4]   The Quickhull algorithm for convex hulls [J].
Barber, CB ;
Dobkin, DP ;
Huhdanpaa, H .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1996, 22 (04) :469-483
[5]   CLUSTERING OF CHEMICAL STRUCTURES ON THE BASIS OF 2-DIMENSIONAL SIMILARITY MEASURES [J].
BARNARD, JM ;
DOWNS, GM .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1992, 32 (06) :644-649
[6]   Determination of equilibrium properties of biomolecular systems using multidimensional adaptive umbrella sampling [J].
Bartels, C ;
Schaefer, M ;
Karplus, M .
JOURNAL OF CHEMICAL PHYSICS, 1999, 111 (17) :8048-8067
[7]   A Sequential Niche Technique for Multimodal Function Optimization [J].
Beasley, David ;
Bull, David R. ;
Martin, Ralph R. .
EVOLUTIONARY COMPUTATION, 1993, 1 (02) :101-125
[8]  
Becker OM, 1998, J COMPUT CHEM, V19, P1255, DOI 10.1002/(SICI)1096-987X(199808)19:11<1255::AID-JCC5>3.0.CO
[9]  
2-J
[10]   Quantitative visualization of a macromolecular potential energy ''funnel'' [J].
Becker, OM .
JOURNAL OF MOLECULAR STRUCTURE-THEOCHEM, 1997, 398 :507-516