1. Proximal policy optimization algorithms;Schulman,2017
2. Playing atari with deep reinforcement learning;Mnih,2013
3. A survey of meta-reinforcement learning;Beck,2023
4. Model-agnostic meta-learning for fast adaptation of deep networks;Finn,2017
5. A Survey on Offline Reinforcement Learning: Taxonomy, Review, and Open Problems