1. Agarwal, A., Kakade, S. M., Lee, J., & Mahajan, G. (2020). Optimality and approximation with policy gradient methods in markov decision processes. Conference on learning theory, Graz, Austria.
2. Almahamid, F., & Grolinger, K. (2021, 9). Reinforcement learning algorithms: An overview and classification. Canadian Conference on Electrical and Computer Engineering , 2021- September, Toronto, Canada.
3. Axtell, R. L. (2014). An agent-based model of the housing market bubble in metropolitan. SSRN.
4. Bae, J. W., Paik, E., Dongoh, K., Jung, J., & Lee, C. H. (2019, 1). Simulation framework for self-evolving agent-based models: A case study of housing market model. Proceedings - Winter Simulation Conference, Gothenburg, Sweden, 2018–December, (pp. 1120–1131).