Domain Adaptation for Imitation Learning Using Generative Adversarial Network-Reference-Cited by-同舟云学术

Domain Adaptation for Imitation Learning Using Generative Adversarial Network

Published:2021-07-09 Issue:14 Volume:21 Page:4718
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Nguyen Duc Tho,Tran Chanh Minh^ORCID,Tan Phan Xuan^ORCID,Kamioka Eiji^ORCID

Abstract

Imitation learning is an effective approach for an autonomous agent to learn control policies when an explicit reward function is unavailable, using demonstrations provided from an expert. However, standard imitation learning methods assume that the agents and the demonstrations provided by the expert are in the same domain configuration. Such an assumption has made the learned policies difficult to apply in another distinct domain. The problem is formalized as domain adaptive imitation learning, which is the process of learning how to perform a task optimally in a learner domain, given demonstrations of the task in a distinct expert domain. We address the problem by proposing a model based on Generative Adversarial Network. The model aims to learn both domain-shared and domain-specific features and utilizes it to find an optimal policy across domains. The experimental results show the effectiveness of our model in a number of tasks ranging from low to complex high-dimensional.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/21/14/4718/pdf

Reference43 articles.

1. A survey of robot learning from demonstration

2. One-shot imitation learning;Duan;arXiv,2017

3. Time-Contrastive Networks: Self-Supervised Learning from Video

4. Imitation from Observation: Learning to Imitate Behaviors from Raw Video via Context Translation

5. Is imitation learning the route to humanoid robots?

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dynamic Motion Planning Model for Multirobot Using Graph Neural Network and Historical Information;Advanced Intelligent Systems;2023-04-18

2. Research on PV mode diffusion considering the game among enterprises in the complex network context;2023-04-13