Essential dynamics:: A tool for efficient trajectory compression and management

被引:96
作者
Meyer, T
Ferrer-Costa, C
Pérez, A
Rueda, M
Bidon-Chanal, A
Luque, FJ
Laughton, CA
Orozco, M
机构
[1] Inst Rec Biomed Barcelona, Barcelona 08028, Spain
[2] Univ Barcelona, Fac Farm, Dept Fisicoquim, E-08028 Barcelona, Spain
[3] Univ Nottingham, Ctr Biomol Sci, Nottingham NG7 2RD, England
[4] Dept Bioquim & Biol Mol, Barcelona 08028, Spain
[5] Barcelona Supercomp Ctr, Computat Biol Program, Barcelona 08028, Spain
[6] Inst Nacl Bioinformat, Bioinformat Struct Node, Barcelona 08028, Spain
关键词
D O I
10.1021/ct050285b
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
We present a simple method for compression and management of very large molecular dynamics trajectories. The approach is based on the projection of the Cartesian snapshots collected along the trajectory into an orthogonal space defined by the eigenvectors obtained by diagonalization of the covariance matrix. The transformation is mathematically exact when the number of eigenvectors equals 3N-6 (N being the number of atoms), and in practice very accurate even when the number of eigenvectors is much smaller, permitting a dramatic reduction in the size of trajectory files. In addition, we have examined the ability of the method, when combined with interpolation, to recover dense samplings (snapshots collected at a high frequency) from more sparse (lower frequency) data as a method for further data compression. Finally, we have investigated the possibility of using the approach when extrapolating the behavior of the system to times longer than the original simulation period. Overall our results suggest that the method is an attractive alternative to current approaches for including dynamic information in static structure files such as those deposited in the Protein Data Bank.
引用
收藏
页码:251 / 258
页数:8
相关论文
共 19 条
[1]   ESSENTIAL DYNAMICS OF PROTEINS [J].
AMADEI, A ;
LINSSEN, ABM ;
BERENDSEN, HJC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1993, 17 (04) :412-425
[2]  
Anderson E., 1995, LAPACK USERS GUIDE
[3]   Molecular dynamics simulations of a nucleosome and free DNA [J].
Bishop, TC .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2005, 22 (06) :673-685
[4]  
CASE DA, 1999, AMBER8 0 COMPUTER PR
[5]   A 2ND GENERATION FORCE-FIELD FOR THE SIMULATION OF PROTEINS, NUCLEIC-ACIDS, AND ORGANIC-MOLECULES [J].
CORNELL, WD ;
CIEPLAK, P ;
BAYLY, CI ;
GOULD, IR ;
MERZ, KM ;
FERGUSON, DM ;
SPELLMEYER, DC ;
FOX, T ;
CALDWELL, JW ;
KOLLMAN, PA .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1995, 117 (19) :5179-5197
[6]   PARTICLE MESH EWALD - AN N.LOG(N) METHOD FOR EWALD SUMS IN LARGE SYSTEMS [J].
DARDEN, T ;
YORK, D ;
PEDERSEN, L .
JOURNAL OF CHEMICAL PHYSICS, 1993, 98 (12) :10089-10092
[7]  
de Groot BL, 1998, PROTEINS, V31, P116, DOI 10.1002/(SICI)1097-0134(19980501)31:2<116::AID-PROT2>3.0.CO
[8]  
2-K
[9]   COMPARISON OF SIMPLE POTENTIAL FUNCTIONS FOR SIMULATING LIQUID WATER [J].
JORGENSEN, WL ;
CHANDRASEKHAR, J ;
MADURA, JD ;
IMPEY, RW ;
KLEIN, ML .
JOURNAL OF CHEMICAL PHYSICS, 1983, 79 (02) :926-935
[10]  
Lehoucq R. B., 1997, ARPACK USERS GUIDE S