Reinforcement learning
Author:
Ahmadi Mohammadali
Reference131 articles.
1. Abbeel, P., Ng, A.Y. (2004). Apprenticeship learning via inverse reinforcement learning. Proceedings of the Twenty-First International Conference on Machine Learning, 1–8, Association for Computing Machinery (ACM), United States. 2. Al-Alwani, M.A., Dunn-Norman, S., Britt, L.K., Alkinani, H.H., Al-Hameedi, A.T.T., Al-Attar, A.M., …Al-Bazzaz, W.H. (2019). Production performance evaluation from stimulation and completion parameters in the Permian Basin: Data mining approach. SPE/AAPG/SEG Asia Pacific Unconventional Resources Technology Conference (URTEC), United States. https://www.onepetro.org/conferences/URTEC/19APUR. 3. Alshiekh, M., Bloem, R., Ehlers, R., Könighofer, B., Niekum, S., Topcu, U. (2018). Safe reinforcement learning via shielding. 32nd AAAI Conference on Artificial Intelligence, AAAI Press, United States. https://aaai.org/Library/AAAI/aaai18contents.php. 4. Utilization of artificial neural networks and the TD-learning method for constructing intelligent decision support systems;Baba;European Journal of Operational Research,2000 5. Dynamic programming;Bellman,1957
|
|