Ensemble Text Summarization Model for COVID-19-Associated Datasets-Reference-Cited by-同舟云学术

Ensemble Text Summarization Model for COVID-19-Associated Datasets

Published:2023-12-11 Issue: Volume:2023 Page:1-16
ISSN:1098-111X
Container-title:International Journal of Intelligent Systems
language:en
Short-container-title:International Journal of Intelligent Systems

Author:

Chellatamilan T.¹^ORCID,Narayanasamy Senthil Kumar²^ORCID,Garg Lalit³^ORCID,Srinivasan Kathiravan¹^ORCID,Islam Sardar M. N.⁴^ORCID

Affiliation:

1. School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632014, India

2. School of Computer Science Engineering and Information Systems, Vellore Institute of Technology, Vellore 632014, India

3. Faculty of Information and Communication Technology, University of Malta, Msida MSD2080, Malta

4. ISILC, Decision Sciences and Modelling Program, Victoria University, Footscray, Australia

Abstract

The work of text summarization in question-and-answer systems has gained tremendous popularity recently and has influenced numerous real-world applications for efficient decision-making processes. In this regard, the exponential growth of COVID-19-related healthcare records has necessitated the extraction of fine-grained results to forecast or estimate the potential course of the disease. Machine learning and deep learning models are frequently used to extract relevant insights from textual data sources. However, in order to summarize the textual information relevant to coronavirus, we have concentrated on a number of natural language processing (NLP) models in this research, including Bidirectional Encoder Representations of Transformers (BERT), Sequence-to-Sequence, and Attention models. This ensemble model is built on the previously mentioned models, which primarily concentrate on the segmented context terms included in the textual input. Most crucially, this research has concentrated on two key variations: grouping-related sentences using hierarchical clustering approaches and the distributional semantics of the terms found in the COVID-19 dataset. The gist evaluation (ROUGE) score result shows a significant and respectable accuracy of 0.40 average recalls.

Funder

Victoria University

Publisher

Hindawi Limited

Subject

Artificial Intelligence,Human-Computer Interaction,Theoretical Computer Science,Software

Link

http://downloads.hindawi.com/journals/ijis/2023/3106631.pdf

Reference57 articles.

1. Wikihow: a large scale text summarization dataset;M. Koupaee,2018

2. A deep reinforced model for abstractive summarization;R. Paulus,2017

3. Toward abstractive summarization using semantic representations;F. Liu,2018

4. Fine-tune BERT for extractive summarization;Y. Liu,2019

5. word2vec Explained: deriving Mikolov et al.'s negative-sampling word- embedding method;Y. Goldberg,2014

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring Recent Advances and Applications Across Sectors: A Natural Language Processing Perspective;Smart Innovation, Systems and Technologies;2024