强化学习中的迁移:方法和进展

被引：26

作者：

王皓

高阳

陈兴国

机构：

[1] 南京大学软件新技术国家重点实验室

来源：

电子学报 | 2008年 / 36(S1)卷 / S1期

关键词：

迁移学习; 强化学习; 知识; 行为; 认知心理学; 抽象; 泛化;

D O I：

暂无

中图分类号：

TP181 [自动推理、机器学习];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

传统机器学习方法认为不同的学习任务彼此无关,但事实上不同的学习任务常常相互关联.迁移学习试图利用任务之间的联系,利用过去的学习经验加速对于新任务的学习.机器学习各分支都已展开了对迁移学习的研究.本文综述了强化学习的迁移技术,依据认知心理学的理论将现有技术分为行为迁移和知识迁移两大类,并介绍、分析了各自的特点,并提出了一些开放性的问题.

引用

页码：39 / 43

页数：5

共 4 条

[1]

Transfer of Experience Between Reinforcement Learning Environments with Progressive Difficulty[J] . Michael G. Madden,Tom Howley.Artificial Intelligence Review . 2004 (3)

[2]

Proto-value functions:developmental reinforce-ment learning .2 Mahadevan S. Proceedings of the Twenty-second Interna-tional Conference on Machine Learning . 2005

[3]

Improving actionselectionin MDP’svia knowledge transfer .2 Sherstove A A,Stone P. Proceedings of the Twentieth Na-tional Conference on Artificial Intelligence . 2005

[4]

Autonomous transfer forreinforcement learning .2 Taylor M E,Kuhlmann G,Stone P. Proceedings of the SeventeenthInternational Conference on Autonomous Agents and Multi-A-gent Systems . 2008

← 1 →