POEM: Parameter optimization using ensemble methods: Application to target specific scoring functions

被引:20
作者
Antes, I [1 ]
Merkwirth, C [1 ]
Lengauer, T [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
D O I
10.1021/ci050036g
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
In computational biology processes such as docking, binding, and folding are often described by simplified, empirical models. These models are fitted to physical properties of the process by adjustable parameters. An appropriate choice of these parameters is crucial for the quality of the models. Locating the best choices for the parameters is often is a difficult task, depending on the complexity of the model. We describe a new method and program, POEM (Parameter Optimization using Ensemble Methods), for this task. In POEM we combine the DOE (Design Of Experiment) procedure with ensembles of different regression methods. We apply the method to the optimization of target specific scoring functions in molecular docking. The method consists of an iterative procedure that uses alternate evaluation and prediction steps. During each cycle of optimization we fit an approximate function to a defined loss function landscape and improve the quality of this fit from cycle to cycle by constantly augmenting our data set. As test applications we fitted the FlexX and Screenscore scoring functions to the kinase and ATPase protein classes. The results are promising: Starting from random parameters we are able to locate parameter sets which show superior performance compared to the original values. The POEM approach converges quickly and the approximated loss function landscapes are smooth, thus making the approach a suitable method for optimizations on rugged landscapes.
引用
收藏
页码:1291 / 1302
页数:12
相关论文
共 28 条
[11]  
Igel C., 2000, P 2 INT ICSC S NEUR, V2000, P115
[12]   Target-biased scoring approaches and expert systems in structure-based virtual screening [J].
Jansen, JM ;
Martin, EJ .
CURRENT OPINION IN CHEMICAL BIOLOGY, 2004, 8 (04) :359-364
[13]   A methodology for fitting and validating metamodels in simulation [J].
Kleijnen, JPC ;
Sargent, RG .
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2000, 120 (01) :14-29
[14]  
Krogh A., 1995, Advances in Neural Information Processing Systems 7, P231
[15]   Optimum design for artificial neural networks: an example in a bicycle derailleur system [J].
Lin, TY ;
Tseng, CH .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2000, 13 (01) :3-14
[16]   All-atom empirical potential for molecular modeling and dynamics studies of proteins [J].
MacKerell, AD ;
Bashford, D ;
Bellott, M ;
Dunbrack, RL ;
Evanseck, JD ;
Field, MJ ;
Fischer, S ;
Gao, J ;
Guo, H ;
Ha, S ;
Joseph-McCarthy, D ;
Kuchnir, L ;
Kuczera, K ;
Lau, FTK ;
Mattos, C ;
Michnick, S ;
Ngo, T ;
Nguyen, DT ;
Prodhom, B ;
Reiher, WE ;
Roux, B ;
Schlenkrich, M ;
Smith, JC ;
Stote, R ;
Straub, J ;
Watanabe, M ;
Wiórkiewicz-Kuczera, J ;
Yin, D ;
Karplus, M .
JOURNAL OF PHYSICAL CHEMISTRY B, 1998, 102 (18) :3586-3616
[17]   Fast nearest-neighbor searching for nonlinear signal processing [J].
Merkwirth, C ;
Parlitz, U ;
Lauterborn, W .
PHYSICAL REVIEW E, 2000, 62 (02) :2089-2097
[18]  
Montgomery D. C., 1991, DESIGN ANAL EXPT
[19]   Virtual screening for kinase targets [J].
Muegge, I ;
Enyedy, IJ .
CURRENT MEDICINAL CHEMISTRY, 2004, 11 (06) :693-707
[20]   A general and fast scoring function for protein-ligand interactions: A simplified potential approach [J].
Muegge, I ;
Martin, YC .
JOURNAL OF MEDICINAL CHEMISTRY, 1999, 42 (05) :791-804