1. Remember and forget for experience replay;novati;Proc Int Conf Mach Learn,2019
2. A deeper look at experience replay;zhang;arXiv 1712 01275,2017
3. Deep Deterministic Policy Gradient With Compatible Critic Network
4. Diffusion policies as an expressive policy class for offline reinforcement learning;wang;arXiv 2208 06193,2022
5. Continual reinforcement learning with multi-timescale replay;kaplanis;arXiv 2004 07530,2020