图书章节

Imitation Learning 收藏

模仿学习

原文求助发布源

作者

Zihan Ding.1

作者单位

1.Imperial College LondonLondonUK

关键词

Imitation learning ;Apprenticeship learning ;Demonstration ;Reinforcement learning ;Behavioral cloning ;Inverse reinforcement learning ;Generative adversarial networks ;Sample efficiency

关键词译文

模仿学习;学徒学习;示范;强化学习;行为克隆;逆向强化学习;生成对抗网络;样本效率

发布日期

30 June 2020

页码

273-306

DOI

10.1007/978-981-15-4095-0_8

来源信息

Deep Reinforcement Learning ISBN：9789811540943, 2020年, 273-306页

摘要

To alleviate the low sample efficiency problem in deep reinforcement learning, imitation learning, or called apprenticeship learning, is one of the potential approaches, which leverages the expert demonstrations in sequential decision-making process. In order to provide the readers a comprehensive understanding about how to effectively extract information from the demonstration data, we introduce the most important categories in imitation learning, including behavioral cloning, inverse reinforcement learning, imitation learning from observations, probabilistic methods, and other methods. Imitation learning can either be regarded as an initialization or a guidance for training the agent in the scope of reinforcement learning. Combination of imitation learning and reinforcement learning is a promising direction for efficient learning and faster policy optimization in practice.

摘要译文

缓解深度强化学习，模仿学习或学徒学习中的样本效率低问题是一种潜在的方法，该方法在顺序决策过程中利用了专家论证。为了向读者提供有关如何从演示数据中有效提取信息的全面理解，我们介绍了模仿学习中最重要的类别，包括行为克隆，逆强化学习，基于观察的模仿学习，概率方法和其他方法。模仿学习可以被视为在强化学习范围内培训代理的初始化或指导。模仿学习与强化学习相结合是有效学习和更快地在实践中优化政策的有希望的方向。

Zihan Ding.1. Imitation Learning. Deep Reinforcement Learning[M].DE: Springer, 2020: 273-306