Offline Imitation Learning with Model-based Reverse Augmentation-Reference-Cited by-同舟云学术

Offline Imitation Learning with Model-based Reverse Augmentation

Published:2024-08-24 Issue: Volume:8 Page:2608-2617
ISSN:
Container-title:Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
language:
Short-container-title:

Author:

Shao Jie-Jing¹^ORCID,Shi Hao-Sen¹^ORCID,Guo Lan-Zhe²^ORCID,Li Yu-Feng³^ORCID

Affiliation:

1. National Key Laboratory for Novel Software Technology, Nanjing University, Nanjing, China

2. National Key Laboratory for Novel Software Technology, School of Intelligence Science and Technology, Nanjing University, Nanjing, China

3. National Key Laboratory for Novel Software Technology, School of Artificial Intelligence, Nanjing University, Nanjing, China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3637528.3672059

Reference54 articles.

1. Gaon An Seungyong Moon Jang-Hyun Kim and Hyun Oh Song. 2021. Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble. In Advances in Neural Information Processing Systems 34. Virtual Event 7436--7447.

2. A survey of inverse reinforcement learning: Challenges, methods and progress

3. Modeling Human Driving Behavior Through Generative Adversarial Imitation Learning

4. Inducing structure in reward learning by learning features

5. Tom B. Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Christopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, and Dario Amodei. 2020. Language Models are Few-Shot Learners. In Advances in Neural Information Processing Systems 33. Virtual Event, 1877--1901.