Single Document Text Summarization Based on the Modified Cat Swarm Optimization (Mcso) Algorithm

Author:

Rautaray Jyotirmayee1,Panigrahi Sangram1,Nayak Ajit Kumar1

Affiliation:

1. Siksha “O” Anusandhan (Deemed to be University)

Abstract

Abstract In today's digital age, the World Wide Web has provided us with an immense volume of online information. Consequently, the challenge of extracting relevant insights from this vast data has emerged. Recently, text summarization has gained recognition as a solution for distilling valuable orderas of extensive credentials. Depending on the number of credentialsmeasured in favor of summarization is partitioned as single-document and multi-document summarization, which is a complex challenge for researchers in achieving accurate summaries. This research methodology, considered single document summarization by using the following steps they are text-pre-processing, Feature extraction, vectorization, and Modified Cat Swarm Optimization (MCSO) algorithm. Initially, the input documents are pre-processed for cleaning the data and they contain sentence segmentation, word tokenization, stop word removal, and lemmatization. In feature extraction, a score of the sentence is computed using Time Frequency-Inverse Domain Frequency (TF-IDF) also then words are formed toward a vector. After that, the generated vectors are post-processed, and Finally, the Modified Cat Swarm Optimization (MCSO) algorithm is utilized toprovideasummary of the single document with its fitness function. The performance of the model was compared with existing approaches such as Spherical, Rastrigin, and Griewank. To assess their effectiveness, we use benchmark datasets of the Document Understanding Conference (DUC) and evaluate algorithms using diverse metrics such as ROUGE score, F score, and summary accuracy, as well as best-case, worst-case, and average-case analyses. The experimental analysis demonstrates that Rosenbork surpasses other summarization methods included in the study.

Publisher

Research Square Platform LLC

Reference45 articles.

1. Telugu text summarization using histo fuzzy c-means and Median Support Based Grasshopper Optimization Algorithm (MSGOA);DURGA CBV;J. Theoretical Appl. Inform. Technol.,2022

2. Mohd, M., Jan, R., Shah, M.: Text document summarization using word embedding. Expert Systems with Applications, 143, p.112958. (2020)

3. Nawaz, A., Bakhtyar, M., Baber, J., Ullah, I., Noor, W., Basit, A.: Extractive text summarization models for Urdu language. Information Processing & Management, 57(6), p.102383. (2020)

4. Manjari, K.U., Rousha, S., Sumanth, D., Devi, J.S.: June. Extractive Text Summarization from Web pages using Selenium and TF-IDF algorithm. In 2020 4th International Conference on Trends in Electronics and Informatics (ICOEI)(48184) (pp. 648–652). IEEE. (2020)

5. Rani, R., Lobiyal, D.K.: A weighted word embedding based approach for extractive text summarization. Expert Systems with Applications, 186, p.115867. (2021)

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3