1. Energy optimization of wind turbines via a neural control policy based on reinforcement learning Markov chain Monte Carlo algorithm;Aghaei;Applied Energy,2023
2. Solving transition independent decentralized Markov decision processes;Becker;Journal of Artificial Intelligence Research,2004
3. A Markovian decision process;Bellman;Indiana University Mathematics Journal,1957
4. Learning interaction-aware guidance for trajectory optimization in dense traffic scenarios;Brito;IEEE Transactions on Intelligent Transportation Systems,2022
5. Budgeted reinforcement learning in continuous state space;Carrara,2019