1. Apprenticeship learning via inverse reinforcement learning
2. Jacob Abernethy , Peter L Bartlett , and Elad Hazan . 2011 . Blackwell approachability and no-regret learning are equivalent . In Proceedings of the 24th Annual Conference on Learning Theory. 27--46 . Jacob Abernethy, Peter L Bartlett, and Elad Hazan. 2011. Blackwell approachability and no-regret learning are equivalent. In Proceedings of the 24th Annual Conference on Learning Theory. 27--46.
3. M Mehdi Afsar , Trafford Crump , and Behrouz Far . 2021. Reinforcement learning based recommender systems: A survey. ACM Computing Surveys (CSUR) ( 2021 ). M Mehdi Afsar, Trafford Crump, and Behrouz Far. 2021. Reinforcement learning based recommender systems: A survey. ACM Computing Surveys (CSUR) (2021).
4. Budget pacing for targeted online advertisements at LinkedIn