1. Reinforcement Learning based Recommender Systems: A Survey
2. XGBoost
3. Ching-An Cheng, Xinyan Yan, and Byron Boots. 2020. Trajectory-wise control variates for variance reduction in policy gradient methods. In Conference on Robot Learning. PMLR, 1379–1394.
4. Zihan Ding, Pablo Hernandez-Leal, Gavin Weiguang Ding, Changjian Li, and Ruitong Huang. 2020. Cdt: Cascading decision trees for explainable reinforcement learning. arXiv preprint arXiv:2011.07553 (2020).
5. Jianqing Fan Zhaoran Wang Yuchen Xie and Zhuoran Yang. 2020. A theoretical analysis of deep Q-learning. In Learning for Dynamics and Control. PMLR 486–489.