Long Text Summarization and Key Information Extraction in a Multi-Task Learning Framework

Author:

Lu Ming1,Chen Rongfa1

Affiliation:

1. College of Management and Economy , Tianjin University , Tianjin , , China .

Abstract

Abstract In the context of the rapid advancement of big data and artificial intelligence, there has been an unprecedented surge in text-based information. This proliferation necessitates the development of efficient and accurate techniques for text summarization. This paper addresses this need by articulating the challenges associated with text summarization and key information extraction. We introduce a novel model that integrates multi-task learning with an attention mechanism to enhance the summarization and extraction of long texts. Furthermore, we establish a loss function for the model, calibrated against the discrepancy observed during the training phase. Empirical evaluations were conducted through simulated experiments after pre-processing the data via the proposed extraction model. These evaluations indicate that the model achieves optimal performance in the iterative training range of 55 to 65. When benchmarked against comparative models, our model demonstrates superior performance in extracting long text summaries and key information, evidenced by the metrics on the Daily Mail dataset (mean scores: 40.19, 16.42, 35.48) and the Gigaword dataset (mean scores: 34.38, 16.21, 31.38). Overall, the model developed in this study proves to be highly effective and practical in extracting long text summaries and key information, thereby significantly enhancing the efficiency of processing textual data.

Publisher

Walter de Gruyter GmbH

Reference17 articles.

1. Kamin, S.T.·Lang, F.R.·Beyer, & A. (2017). Subjective technology adaptivity predicts technology use in old age. Gerontology.

2. Mei, B., Brown, G. T. L., & Teo, T. (2018). Toward an understanding of preservice english as a foreign language teachers’ acceptance of computer-assisted language learning 2.0 in the people’s republic of china. Journal of Educational Computing Research, 073563311770014.

3. Annuncy, V., & Joseph, P. (2023). New frontiers in linguistic research: eliminating the challenges of understanding the genetics of language through bioinformatics. Digital Scholarship in the Humanities(4), 4.

4. Mark, B., Raskutti, G., & Willett, R. (2018). Network estimation from point process data. IEEE Transactions on Information Theory, 1-1.

5. Wang, P., Lv, H., Zheng, X., Ma, W., & Wang, W. (2023). Validity analysis of network big data. Journal of web engineering(3), 22.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3