1. Specious reward: A behavioral theory of impulsiveness and impulse control.
2. Ainslie, George. 1992. “Picoeconomics: The Strategic Interaction of Successive Motivational States within the Person.” Cambridge University Press.
3. Hyperbolically Discounted Temporal Difference Learning
4. Amrouni, Selim, Aymeric Moulin, Jared Vann, Svitlana Vyetrenko, Tucker Balch, and Manuela Veloso. 2021. “ABIDES-Gym: gym Environments for Multi-Agent Discrete Event Simulation and Application to Financial Markets.” In Proceedings of the Second ACM International Conference on AI in Finance, 1–9.
5. Asadi Kavosh and MichaelL. Littman. 2017. “An Alternative Softmax Operator for Reinforcement Learning.” In International Conference on Machine Learning 243–252. PMLR. http://proceedings.mlr.press/v70/asadi17a/asadi17a.pdf.