Algorithmic dimensionality reduction for molecular structure analysis

被引:60
作者
Brown, W. Michael [1 ]
Martin, Shawn [1 ]
Pollock, Sara N. [2 ,3 ,4 ]
Coutsias, Evangelos A. [4 ]
Watson, Jean-Paul [1 ]
机构
[1] Sandia Natl Labs, Albuquerque, NM 87185 USA
[2] Univ New Mexico, Dept Biochem, Albuquerque, NM 87131 USA
[3] Univ New Mexico, Mol Biol Div Biocomp, Albuquerque, NM 87131 USA
[4] Univ New Mexico, Dept Math, Albuquerque, NM 87131 USA
关键词
D O I
10.1063/1.2968610
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Dimensionality reduction approaches have been used to exploit the redundancy in a Cartesian coordinate representation of molecular motion by producing low-dimensional representations of molecular motion. This has been used to help visualize complex energy landscapes, to extend the time scales of simulation, and to improve the efficiency of optimization. Until recently, linear approaches for dimensionality reduction have been employed. Here, we investigate the efficacy of several automated algorithms for nonlinear dimensionality reduction for representation of trans, trans-1,2,4-trifluorocyclo-octane conformation-a molecule whose structure can be described on a 2-manifold in a Cartesian coordinate phase space. We describe an efficient approach for a deterministic enumeration of ring conformations. We demonstrate a drastic improvement in dimensionality reduction with the use of nonlinear methods. We discuss the use of dimensionality reduction algorithms for estimating intrinsic dimensionality and the relationship to the Whitney embedding theorem. Additionally, we investigate the influence of the choice of high-dimensional encoding on the reduction. We show for the case studied that, in terms of reconstruction error root mean square deviation, Cartesian coordinate representations and encodings based on interatom distances provide better performance than encodings based on a dihedral angle representation. (C) 2008 American Institute of Physics.
引用
收藏
页数:13
相关论文
共 81 条
[1]  
Abseher R, 2000, PROTEINS, V39, P82, DOI 10.1002/(SICI)1097-0134(20000401)39:1<82::AID-PROT9>3.0.CO
[2]  
2-S
[3]   A self-organizing principle for learning nonlinear manifolds [J].
Agrafiotis, DK ;
Xu, HF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (25) :15869-15872
[4]   MOLECULAR MECHANICS - THE MM3 FORCE-FIELD FOR HYDROCARBONS .1. [J].
ALLINGER, NL ;
YUH, YH ;
LII, JH .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1989, 111 (23) :8551-8566
[5]   Dihedral angle principal component analysis of molecular dynamics simulations [J].
Altis, Alexandros ;
Nguyen, Phuong H. ;
Hegger, Rainer ;
Stock, Gerhard .
JOURNAL OF CHEMICAL PHYSICS, 2007, 126 (24)
[6]   ESSENTIAL DYNAMICS OF PROTEINS [J].
AMADEI, A ;
LINSSEN, ABM ;
BERENDSEN, HJC .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1993, 17 (04) :412-425
[7]  
Amadei A, 1999, PROTEINS, V35, P283, DOI 10.1002/(SICI)1097-0134(19990515)35:3<283::AID-PROT2>3.3.CO
[8]  
2-I
[9]   An efficient method for sampling the essential subspace of proteins [J].
Amadei, A ;
Linssen, ABM ;
deGroot, BL ;
vanAalten, DMF ;
Berendsen, HJC .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 1996, 13 (04) :615-625
[10]  
ANDERSON E, 2003, LAPACK USERS GUIDE