Chain-of-event prompting for multi-document summarization by large language models-Reference-Cited by-同舟云学术

Chain-of-event prompting for multi-document summarization by large language models

Published:2024-02-15 Issue:3 Volume:20 Page:229-247
ISSN:1744-0084
Container-title:International Journal of Web Information Systems
language:en
Short-container-title:IJWIS

Author:

Bao Songlin,Li Tiantian,Cao Bin

Abstract

Purpose In the era of big data, various industries are generating large amounts of text data every day. Simplifying and summarizing these data can effectively serve users and improve efficiency. Recently, zero-shot prompting in large language models (LLMs) has demonstrated remarkable performance on various language tasks. However, generating a very “concise” multi-document summary is a difficult task for it. When conciseness is specified in the zero-shot prompting, the generated multi-document summary still contains some unimportant information, even with the few-shot prompting. This paper aims to propose a LLMs prompting for multi-document summarization task. Design/methodology/approach To overcome this challenge, the authors propose chain-of-event (CoE) prompting for multi-document summarization (MDS) task. In this prompting, the authors take events as the center and propose a four-step summary reasoning process: specific event extraction; event abstraction and generalization; common event statistics; and summary generation. To further improve the performance of LLMs, the authors extend CoE prompting with the example of summary reasoning. Findings Summaries generated by CoE prompting are more abstractive, concise and accurate. The authors evaluate the authors’ proposed prompting on two data sets. The experimental results over ChatGLM2-6b show that the authors’ proposed CoE prompting consistently outperforms other typical promptings across all data sets. Originality/value This paper proposes CoE prompting to solve MDS tasks by the LLMs. CoE prompting can not only identify the key events but also ensure the conciseness of the summary. By this method, users can access the most relevant and important information quickly, improving their decision-making processes.

Publisher

Emerald

Reference33 articles.

1. The encoder-decoder framework and its applications;Deep Learning: Concepts and Architectures,2019

2. A survey of longest common subsequence algorithms,2000

3. Syntactic clustering of the web;Computer Networks and ISDN Systems,1997

4. Language models are few-shot learners;Advances in Neural Information Processing Systems,2020

5. A survey on evaluation of large language models,2023