Soloist: BuildingTask Bots at Scale with Transfer Learning and Machine Teaching

Author:

Peng Baolin1,Li Chunyuan2,Li Jinchao3,Shayandeh Shahin4,Liden Lars5,Gao Jianfeng6

Affiliation:

1. Microsoft Research, Redmond, United States. bapeng@microsoft.com

2. Microsoft Research, Redmond, United States. chunyl@microsoft.com

3. Microsoft Research, Redmond, United States. jincli@microsoft.com

4. Microsoft Research, Redmond, United States. shahins@microsoft.com

5. Microsoft Research, Redmond, United States. lars.liden@microsoft.com

6. Microsoft Research, Redmond, United States. jfgao@microsoft.com

Abstract

Abstract We present a new method, Soloist,1 that uses transfer learning and machine teaching to build task bots at scale. We parameterize classical modular task-oriented dialog systems using a Transformer-based auto-regressive language model, which subsumes different dialog modules into a single neural model. We pre-train, on heterogeneous dialog corpora, a task-grounded response generation model, which can generate dialog responses grounded in user goals and real-world knowledge for task completion. The pre-trained model can be efficiently adapted to accomplish new tasks with a handful of task-specific dialogs via machine teaching, where training samples are generated by human teachers interacting with the system. Experiments show that (i)Soloist creates new state-of-the-art on well-studied task-oriented dialog benchmarks, including CamRest676 and MultiWOZ; (ii) in the few-shot fine-tuning settings, Soloist significantly outperforms existing methods; and (iii) the use of machine teaching substantially reduces the labeling cost of fine-tuning. The pre-trained models and codes are available at https://aka.ms/soloist.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Reference81 articles.

1. Towards a human-like open-domain chatbot;Adiwardana;arXiv preprint arXiv:2001.09977,2020

2. Plato: Pre-trained dialogue generation model with discrete latent variable;Bao,2020

3. Rasa: Open source language understanding and dialogue management;Bocklisch;CoRR,2017

4. Hello, it’s GPT-2-How can I help you? Towards the use of pretrained language models for task-oriented dialogue systems;Budzianowski,2019

5. Multiwoz-a large-scale multi-domain wizard-of-oz dataset for task-oriented dialogue modelling;Budzianowski,2018

Cited by 34 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3