基于强化学习的模型选择和超参数优化

被引：12

作者：

吴佳

陈森朋

陈修云

周瑞

机构：

[1] 电子科技大学信息与软件工程学院

来源：

电子科技大学学报 | 2020年 / 02期

关键词：

深度强化学习; 超参数优化; LSTM网络; 机器学习; 模型选择;

D O I：

暂无

中图分类号：

TP181 [自动推理、机器学习];

学科分类号：

摘要：

随着机器学习技术的不断发展,机器学习算法种类的增多以及模型复杂度提高,造成了实践应用中的两大难题:算法模型选择及模型超参数优化。为了实现模型选择和超参数优化的自动处理,该文提出了一种基于深度强化学习的优化方法。利用长短期记忆(LSTM)网络构建一个智能体(Agent),自动选择机器学习算法模型及对应的超参数组合。该智能体以最大化机器学习模型在验证数据集上的准确率为目标,利用所选择的模型在验证数据集上的准确率作为奖赏值(reward),通过强化学习算法不断学习直到找到最优的模型以及超参数组合。为了验证该方法的可行性及性能,在UCI标准数据集上将其与传统优化方法中基于树状结构Parzen的估计方法和随机搜索方法进行比较。多次实验结果证明该优化方法在稳定性、时间效率、准确度方面均具有优势。

引用

页码：255 / 261

页数：7

共 6 条

[1] Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition
Sainath, Tara N.
Weiss, Ron J.
Wilson, Kevin W.
Li, Bo
Narayanan, Arun
Variani, Ehsan
Bacchiani, Michiel
Shafran, Izhak
Senior, Andrew
Chin, Kean
Misra, Ananya
Kim, Chanwoo
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (05) : 965 - 979
[2] The WEKA data mining software[J] . Mark Hall,Eibe Frank,Geoffrey Holmes,Bernhard Pfahringer,Peter Reutemann,Ian H. Witten.ACM SIGKDD Explorations Newsletter . 2009 (1)
[3] Long short-term memory
Hochreiter, S
Schmidhuber, J
[J]. NEURAL COMPUTATION, 1997, 9 (08) : 1735 - 1780
[4] Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning[J] . Ronald J. Williams.Machine Learning . 1992 (3)
[5] Inverse Compositional Spatial Transformer Networks .2 Lin C H,Lucey S. 2017 IEEE Conference on Computer Vision and Pattern Recognition(CVPR 2017) . 2017
[6] Google''s Neural Machine Translation System:Bridging the Gap between Human and Machine Translation .2 Wu Y,Schuster M,Chen Z,et al. . 2016

← 1 →