Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Computer Science Applications
Reference32 articles.
1. Achiam, J.: Spinning up in deep reinforcement learning (2018)
2. Chan, T.F., Golub, G.H., LeVeque, R.J.: Updating formulae and a pairwise algorithm for computing sample variances. In: Caussinus, H., Ettinger, P., Tomassone, R. (eds.) COMPSTAT 1982 5th Symposium Held at Toulouse 1982, vol. 1, pp. 30–41. Physica-Verlag HD, Heidelberg (1982)
3. Cheng, R., Orosz, G., Murray, R. M., Burdick, J. W.: End-to-end safe reinforcement learning through barrier functions for safety-critical continuous control tasks (2019). arXiv:1903.08792 [cs, stat]
4. Dong, X., Yu, B., Shi, Z., Zhong, Y.: Time-varying formation control for unmanned aerial vehicles: theories and applications. IEEE Trans. Contr. Syst. Technol. 23(1), 340–348 (2015). https://doi.org/10.1109/TCST.2014.2314460
5. Hausknecht, M., Stone, P.: Deep recurrent Q-learning for partially observable MDP (2017). arXiv:1507.06527 [cs]
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献