1. Non-stationary Markov decision processes, a worst-case approach using model-based reinforcement learning, extended version;lecarpentier;arXiv 1904 10090,2020
2. Bayesian online changepoint detection;adams;arXiv 0710 3742,2007
3. Analyzing Alibaba cloud’s preemptible instance pricing;davidow,2021
4. BCORLE(?): An offline reinforcement learning and evaluation framework for coupons allocation in e-commerce market;zhang;Proc Adv Neural Inf Process Syst,2021
5. Real-time dynamic pricing in a non-stationary environment using model-free reinforcement learning