Sequential Sampling for Noisy Optimisation with CMA-ES

被引:2
作者
Groves, Matthew [1 ]
Branke, Juergen [2 ]
机构
[1] Univ Warwick, Math Inst, Warwick, England
[2] Univ Warwick, Warwick Business Sch, Warwick, England
来源
GECCO'18: PROCEEDINGS OF THE 2018 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE | 2018年
关键词
Bayesian Sampling; Knowledge Gradient; CMA-ES; Noisy Optimisation; EVOLUTIONARY OPTIMIZATION; ALGORITHMS; ENVIRONMENTS;
D O I
10.1145/3205455.3205559
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a novel sequential sampling scheme to allocate samples to individuals in order to maximally inform the selection step in Covariance Matrix Adaptation Evolution Strategies (CMA-ES) for noisy function optimisation. More specifically, we adopt the well-known Knowledge Gradient (KG) method to minimise the Kullback-Leibler divergence (relative entropy) between the distribution used for generating the next offspring population based on the mu selected individuals, and the distribution based on the true mu best individuals that would have been chosen in the absence of noise. Empirical tests demonstrate the benefit of integrating sequential sampling into CMA-ES, and that the proposed KG technique specifically adapted to the needs of CMA-ES indeed outperforms a more straightforward application of KG.
引用
收藏
页码:1023 / 1030
页数:8
相关论文
共 23 条
[1]  
Ackley D., 1987, A Connectionist Machine for Genetic Hillclimbing
[2]  
[Anonymous], 2016, Derivations for linear algebra and optimization
[3]  
Bartz-Beielstein T., 2007, METAHEURISTICS
[4]   Evolutionary algorithms in noisy environments: theoretical issues and guidelines for practice [J].
Beyer, HG .
COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2000, 186 (2-4) :239-267
[5]  
Bishop Christopher M, 2016, Pattern recognition and machine learning
[6]  
Branke J, 2003, LECT NOTES COMPUT SC, V2723, P766
[7]   Efficient Simulation Budget Allocation for Selecting an Optimal Subset [J].
Chen, Chun-Hung ;
He, Donghai ;
Fu, Michael ;
Lee, Loo Hay .
INFORMS JOURNAL ON COMPUTING, 2008, 20 (04) :579-595
[8]   Sequential Sampling to Myopically Maximize the Expected Value of Information [J].
Chick, Stephen E. ;
Branke, Juergen ;
Schmidt, Christian .
INFORMS JOURNAL ON COMPUTING, 2010, 22 (01) :71-80
[9]  
Finck S., 2010, 200921 RES CTR PPE
[10]   A KNOWLEDGE-GRADIENT POLICY FOR SEQUENTIAL INFORMATION COLLECTION [J].
Frazier, Peter I. ;
Powell, Warren B. ;
Dayanik, Savas .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2008, 47 (05) :2410-2439