1. Adith Swaminathan and Thorsten Joachims Counterfactual Risk Minimization. ICWWW 2015. 10.1145/2740908.2742564 Adith Swaminathan and Thorsten Joachims Counterfactual Risk Minimization . ICWWW 2015. 10.1145/2740908.2742564
2. Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms
3. Miroslav Dudik John Langford Lihong Li Doubly Robust Policy Evaluation and Learning. ICML 2011. Miroslav Dudik John Langford Lihong Li Doubly Robust Policy Evaluation and Learning . ICML 2011.