1. Achbany, Youssef., et al. 2005. Managing the exploration/exploitation trade-off in reinforcement learning. Technical Paper, Information Systems Research Unit (ISYS), IAG, Université Catholique de Louvain.
2. Barbier, Thibault., et al. 2018. Product-Closing Approximation for Nonparametric Choice Network Revenue Management. arXiv:1805.10537.
3. Belobaba, P.P., and C. Hopperstad. Boeing. 1999. MIT simulation study: PODS results update. In 1999 AGIFORS Reservations and Yield Management Study Group Symposium, April.
4. Bondoux, Nicolas., et al. 2020. Reinforcement learning applied to airline revenue management. Journal of Revenue and Pricing Management (2020): 1–17.
5. Borooah, Vani Kant. 2002. Logit and probit Ordered and multinomial models, Vol. 138. New York: Sage.