1. Reinforcement learning;Dayan;Stevens’ Handbook Exp Psychol,2002
2. Reinforcement Learning with TensorFlow;Dutta,2018
3. Fu Q, Song A. Adaptive modulation for underwater acoustic communications based on reinforcement learning. OCEANS 2018 MTS/IEEE Charleston; 2018, 1–8.
4. Reinforcement learning-based resource allocation for streaming in a multi-modal deep space network;Ha,2021
5. Categorical reparametrization with gumble-softmax;Jang;International Conference on Learning Representations,2017