Tpn:Triple Network Algorithm for Deep Reinforcement Learning
Author:
Han Chen,Wang Xuanyin
Reference26 articles.
1. Learning to predict by the methods of temporal differences;R S Sutton;Machine learning,1988
2. Q-learning;C J C H Watkins;Machine Learning,1992
3. Application of reinforcement learning to the game of othello;N J Van Eck;Computers & Operations Research,2008