1. Continuous control with deep reinforcement learning;Lillicrap,2016
2. V. Mnih, A.P. Badia, M. Mirza, A. Graves, T. Lillicrap, T. Harley, D. Silver, K. Kavukcuoglu, Asynchronous methods for deep reinforcement learning, in: International Conference on Machine Learning, 2016, pp. 1928–1937.
3. J. Schulman, S. Levine, P. Abbeel, M. Jordan, P. Moritz, Trust region policy optimization, in: International Conference on Machine Learning, 2015, pp. 1889–1897.
4. Deep reinforcement learning for pedestrian collision avoidance and human-machine cooperative driving;Li;Inf. Sci.,2020
5. Negation scope detection for sentiment analysis: A reinforcement learning framework for replicating human interpretations;Pröllochs;Inf. Sci.,2020