1. El Asri L, Khouzaimi H, Laroche R, Pietquin O (2014) Ordinal regression for interaction quality prediction. In: Proceedings of ICASSP. IEEE, pp 3245–3249
2. El Asri L, Laroche R, Pietquin O (2012) Reward function learning for dialogue management. In: Proceedings of the 6th STAIRS. IOS Press, pp 95–106
3. El Asri L, Laroche R, Pietquin O (2013) Reward shaping for statistical optimisation of dialogue management. In: Statistical language and speech processing. Springer, pp 93–101
4. Gašić M, Breslin C, Henderson M, Kim D, Szummer M, Thomson B, Tsiakoulis P, Young SJ (2013) On-line policy optimisation of Bayesian spoken dialogue systems via human interaction. In: Proceedings of ICASSP. IEEE, pp 8367–8371
5. Lee S, Eskenazi M (2012) An unsupervised approach to user simulation: toward self-improving dialog systems. In: Proceedings of 13th SIGDial. ACL, pp 50–59