1. Ddpg-based decision-making strategy of adaptive cruising for heavy vehicles considering stability;Sun;IEEE Access,2020
2. Despot: online pomdp planning with regularization;Ye;J Artif Intell Res,2017
3. S. Carr, N. Jansen, S. Junges, U. Topcu, Safe reinforcement learning via shielding for pomdps, ArXiv abs/2204.00755. doi:0.48550/arXiv.2204.00755.
4. Recurrent model-free rl can be a strong baseline for many pomdps;Ni,2021
5. Combining planning and deep reinforcement learning in tactical decision making for autonomous driving;Hoel;IEEE Transactions on Intelligent Vehicles,2020