1. MultiWOZ–A large-scale multi-domain Wizard-of-Oz dataset for task-oriented dialogue modelling;Budzianowski,2018
2. Deep reinforcement learning in a handful of trials using probabilistic dynamics models;Chua;Adv. Neural Inf. Process. Syst.,2018
3. Den Hengst, F., Hoogendoorn, M., Van Harmelen, F., Bosman, J., 2019. Reinforcement learning for personalized dialogue management. In: IEEE/WIC/ACM International Conference on Web Intelligence. pp. 59–67.
4. Domain-adversarial training of neural networks;Ganin;J. Mach. Learn. Res.,2016
5. Reinforcement learning with deep energy-based policies;Haarnoja,2017