Soloist: BuildingTask Bots at Scale with Transfer Learning and Machine Teaching-Reference-Cited by-同舟云学术

Soloist: BuildingTask Bots at Scale with Transfer Learning and Machine Teaching

Published:2021 Issue: Volume:9 Page:807-824
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:

Author:

Peng Baolin¹,Li Chunyuan²,Li Jinchao³,Shayandeh Shahin⁴,Liden Lars⁵,Gao Jianfeng⁶

Affiliation:

1. Microsoft Research, Redmond, United States. bapeng@microsoft.com

2. Microsoft Research, Redmond, United States. chunyl@microsoft.com

3. Microsoft Research, Redmond, United States. jincli@microsoft.com

4. Microsoft Research, Redmond, United States. shahins@microsoft.com

5. Microsoft Research, Redmond, United States. lars.liden@microsoft.com

6. Microsoft Research, Redmond, United States. jfgao@microsoft.com

Abstract

Abstract We present a new method, Soloist,1 that uses transfer learning and machine teaching to build task bots at scale. We parameterize classical modular task-oriented dialog systems using a Transformer-based auto-regressive language model, which subsumes different dialog modules into a single neural model. We pre-train, on heterogeneous dialog corpora, a task-grounded response generation model, which can generate dialog responses grounded in user goals and real-world knowledge for task completion. The pre-trained model can be efficiently adapted to accomplish new tasks with a handful of task-specific dialogs via machine teaching, where training samples are generated by human teachers interacting with the system. Experiments show that (i)Soloist creates new state-of-the-art on well-studied task-oriented dialog benchmarks, including CamRest676 and MultiWOZ; (ii) in the few-shot fine-tuning settings, Soloist significantly outperforms existing methods; and (iii) the use of machine teaching substantially reduces the labeling cost of fine-tuning. The pre-trained models and codes are available at https://aka.ms/soloist.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Link

http://direct.mit.edu/tacl/article-pdf/doi/10.1162/tacl_a_00399/1955175/tacl_a_00399.pdf

Reference81 articles.

1. Towards a human-like open-domain chatbot;Adiwardana;arXiv preprint arXiv:2001.09977,2020

2. Plato: Pre-trained dialogue generation model with discrete latent variable;Bao,2020

3. Rasa: Open source language understanding and dialogue management;Bocklisch;CoRR,2017

4. Hello, it’s GPT-2-How can I help you? Towards the use of pretrained language models for task-oriented dialogue systems;Budzianowski,2019

5. Multiwoz-a large-scale multi-domain wizard-of-oz dataset for task-oriented dialogue modelling;Budzianowski,2018

Cited by 34 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dialogue agents 101: a beginner’s guide to critical ingredients for designing effective conversational systems;Natural Language Processing;2024-09-09

2. STOD: Towards Scalable Task-Oriented Dialogue System on MultiWOZ-API;Applied Sciences;2024-06-19

3. KMc-ToD: Structure knowledge enhanced multi-copy network for task-oriented dialogue system;Knowledge-Based Systems;2024-06

4. OSTOD: One-Step Task-Oriented Dialogue with activated state and retelling response;Knowledge-Based Systems;2024-06

5. Dialogue summarization enhanced response generation for multi-domain task-oriented dialogue systems;Information Processing & Management;2024-05