POEM: Parameter optimization using ensemble methods: Application to target specific scoring functions

被引:20
作者
Antes, I [1 ]
Merkwirth, C [1 ]
Lengauer, T [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
D O I
10.1021/ci050036g
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
In computational biology processes such as docking, binding, and folding are often described by simplified, empirical models. These models are fitted to physical properties of the process by adjustable parameters. An appropriate choice of these parameters is crucial for the quality of the models. Locating the best choices for the parameters is often is a difficult task, depending on the complexity of the model. We describe a new method and program, POEM (Parameter Optimization using Ensemble Methods), for this task. In POEM we combine the DOE (Design Of Experiment) procedure with ensembles of different regression methods. We apply the method to the optimization of target specific scoring functions in molecular docking. The method consists of an iterative procedure that uses alternate evaluation and prediction steps. During each cycle of optimization we fit an approximate function to a defined loss function landscape and improve the quality of this fit from cycle to cycle by constantly augmenting our data set. As test applications we fitted the FlexX and Screenscore scoring functions to the kinase and ATPase protein classes. The results are promising: Starting from random parameters we are able to locate parameter sets which show superior performance compared to the original values. The POEM approach converges quickly and the approximated loss function landscapes are smooth, thus making the approach a suitable method for optimizations on rugged landscapes.
引用
收藏
页码:1291 / 1302
页数:12
相关论文
共 28 条
[1]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[2]  
[Anonymous], NONCONVEX OPTIMIZATI
[3]  
[Anonymous], 1998, NEURAL NETWORKS TRIC
[5]   Bagging predictors [J].
Breiman, L .
MACHINE LEARNING, 1996, 24 (02) :123-140
[6]   Protein flexibility in ligand docking and virtual screening to protein kinases [J].
Cavasotto, CN ;
Abagyan, RA .
JOURNAL OF MOLECULAR BIOLOGY, 2004, 337 (01) :209-225
[7]   ATPases as drug targets:: Learning from their structure [J].
Chène, P .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (09) :665-673
[8]  
Friedman J., 2001, The elements of statistical learning, V1, DOI DOI 10.1007/978-0-387-21606-5
[9]   Knowledge-based scoring function to predict protein-ligand interactions [J].
Gohlke, H ;
Hendlich, M ;
Klebe, G .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 295 (02) :337-356
[10]   NEURAL NETWORK ENSEMBLES [J].
HANSEN, LK ;
SALAMON, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (10) :993-1001