Affiliation:
1. School of Computer Science, Yangtze University, Jingzhou 434000, China
Abstract
Headline generation aims to condense key information from an article or a document into a concise one-sentence summary. The Transformer structure is in general effective for such tasks, yet it suffers from a dramatic increase in training time and GPU consumption as the input text length grows. To address this problem, a hybrid attention mechanism is proposed. Both local and global semantic information among words are modeled in a way that significantly improves training efficiency, especially for long text. Effectiveness is not sacrificed; in fact, fluency and semantic coherence of the generated headlines are enhanced. Experimental results on an open benchmark dataset show that, compared to the baseline model’s best performance, the proposed model obtains a 14.7%, 16.7%, 14.4% and 9.1% increase in the F1 values of the ROUGE-1, the ROUGE-2, the ROUGE-L and the ROUGE-WE metrics, respectively. The semantic coherence of the generated text is also improved, as shown by a 2.8% improvement in the BERTScore’s F1 value. These results show that the effectiveness of the proposed headline generation model with the hybrid attention mechanism is also improved. The hybrid attention mechanism could provide references for relevant text generation tasks.
Funder
2021 Higher Education Research Program of the Educational Commission of Hubei Province of P. R. China
Reference44 articles.
1. Lee, S.-H., Choi, S.-W., and Lee, E.-B. (2023). A Question-Answering Model Based on Knowledge Graphs for the General Provisions of Equipment Purchase Orders for Steel Plants Maintenance. Electronics, 12.
2. Ahmad, P.N., Liu, Y., Khan, K., Jiang, T., and Burhan, U. (2023). BIR: Biomedical Information Retrieval System for Cancer Treatment in Electronic Health Record Using Transformers. Sensors, 23.
3. Lu, Y., Liu, Q., Dai, D., Xiao, X., Lin, H., Han, X., Sun, L., and Wu, H. (2022, January 22–27). Unified Structure Generation for Universal Information Extraction. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, Dublin, Ireland.
4. High Quality Information Extraction and Query-Oriented Summarization for Automatic Query-Reply in Social Network;Peng;Expert Syst. Appl.,2016
5. Sakurai, T., and Utsumi, A. (2004). Query-Based Multidocument Summarization for Information Retrieval. Proceedings of the NTCIR-4, National Institute of Informatics.