1. Handbook of Markov Decision Processes
2. Nicolas Gast , Bruno Gaujal , and Kimang Khun . Computing Whittle (and Gittins) index in subcubic time. arXiv preprint arXiv:2203.05207 , 2022 . Nicolas Gast, Bruno Gaujal, and Kimang Khun. Computing Whittle (and Gittins) index in subcubic time. arXiv preprint arXiv:2203.05207, 2022.
3. Martin L. Puterman . Markov Decision Processes: Discrete Stochastic Dynamic Programming . John Wiley & Sons, Inc. , USA , 2 nd edition, 2005 . Martin L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc., USA, 2nd edition, 2005.
4. The Functional Equations of Undiscounted Markov Renewal Programming
5. Richard S Sutton and Andrew G Barto . Reinforcement learning: An introduction . MIT press , 2018 . Richard S Sutton and Andrew G Barto. Reinforcement learning: An introduction. MIT press, 2018.