Funder
National Natural Science Foundation of China
National Key Research and Development Program of China
Fundamental Research Funds for the Central Universities
Subject
Artificial Intelligence,Information Systems and Management,Management Information Systems,Software
Reference71 articles.
1. Human-level control through deep reinforcement learning;Mnih;Nature,2015
2. Trust region policy optimization;Schulman,2015
3. Mastering the game of go with deep neural networks and tree search;Silver;Nature,2016
4. Continuous control with deep reinforcement learning;Lillicrap,2015
5. High-dimensional continuous control using generalized advantage estimation;Schulman,2015
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Trajectory-Oriented Policy Optimization with Sparse Rewards;2024 2nd International Conference on Intelligent Perception and Computer Vision (CIPCV);2024-05-17