1. Active sampling for deep Q-Learning based on TD-error adaptive correction.;C.Bai;Journal of Computer Research and Development,2019
2. Cao, X., Wan, H., Lin, Y., & Han, S. (2019). High-value prioritized experience replay for off-policy reinforcement learning. 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI), 1510-1514.
3. Research on anti-follower jamming performance of variable rate frequency hopping communications.;G.Chen;Fire Control & Command Control,2016
4. A power allocation algorithm based on cooperative Q-learning for multi-agent D2D communication networks.;Z.Dou;Physical Communication,2021
5. Reinforcement and deep reinforcement learning for wireless Internet of Things: A survey.;M. S.Frikha;Computer Communications,2021