Author:
Zhong Ming,Liu Yang,Xu Yichong,Zhu Chenguang,Zeng Michael
Abstract
Dialogue is an essential part of human communication and cooperation. Existing research mainly focuses on short dialogue scenarios in a one-on-one fashion. However, multi-person interactions in the real world, such as meetings or interviews, are frequently over a few thousand words. There is still a lack of corresponding research and powerful tools to understand and process such long dialogues. Therefore, in this work, we present a pre-training framework for long dialogue understanding and summarization. Considering the nature of long conversations, we propose a window-based denoising approach for generative pre-training. For a dialogue, it corrupts a window of text with dialogue-inspired noise, and guides the model to reconstruct this window based on the content of the remaining conversation. Furthermore, to process longer input, we augment the model with sparse attention which is combined with conventional attention in a hybrid manner. We conduct extensive experiments on five datasets of long dialogues, covering tasks of dialogue summarization, abstractive question answering and topic segmentation. Experimentally, we show that our pre-trained model DialogLM significantly surpasses the state-of-the-art models across datasets and tasks. Source code and all the pre-trained models are available on our GitHub repository (https://github.com/microsoft/DialogLM).
Publisher
Association for the Advancement of Artificial Intelligence (AAAI)
Cited by
22 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Baichuan2-Sum: Instruction Finetune Baichuan2-7B Model for Dialogue Summarization;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30
2. Dynamic Multi-Scale Context Aggregation for Conversational Aspect-Based Sentiment Quadruple Analysis;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14
3. A Multi-party Conversational Social Robot Using LLMs;Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction;2024-03-11
4. Adapter-Based Selective Knowledge Distillation for Federated Multi-Domain Meeting Summarization;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2024
5. T4S: Two-Stage Screenplay Synopsis Summary Generation with Turning Points;Lecture Notes in Computer Science;2024