Author:
Wang Xianjia,yang zhipeng,Chen Guici,Liu Yanli
Reference54 articles.
1. Human-level control through deep reinforcement learning;V Mnih;Nature,2015
2. Recurrent prediction model for partially observable mdps;S Xie;Inf. Sci,2023
3. Infinite horizon markov decision processes with unknown or variable discount factors;D White;Eur. J. Oper. Res,1987
4. A machine learning-enabled partially observable markov decision process framework for early sepsis prediction;Z Liu;INFORMS J. Comput,2022
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献