Affiliation:
1. College of Management and Economy , Tianjin University , Tianjin , , China .
Abstract
Abstract
In the context of the rapid advancement of big data and artificial intelligence, there has been an unprecedented surge in text-based information. This proliferation necessitates the development of efficient and accurate techniques for text summarization. This paper addresses this need by articulating the challenges associated with text summarization and key information extraction. We introduce a novel model that integrates multi-task learning with an attention mechanism to enhance the summarization and extraction of long texts. Furthermore, we establish a loss function for the model, calibrated against the discrepancy observed during the training phase. Empirical evaluations were conducted through simulated experiments after pre-processing the data via the proposed extraction model. These evaluations indicate that the model achieves optimal performance in the iterative training range of 55 to 65. When benchmarked against comparative models, our model demonstrates superior performance in extracting long text summaries and key information, evidenced by the metrics on the Daily Mail dataset (mean scores: 40.19, 16.42, 35.48) and the Gigaword dataset (mean scores: 34.38, 16.21, 31.38). Overall, the model developed in this study proves to be highly effective and practical in extracting long text summaries and key information, thereby significantly enhancing the efficiency of processing textual data.
Reference17 articles.
1. Kamin, S.T.·Lang, F.R.·Beyer, & A. (2017). Subjective technology adaptivity predicts technology use in old age. Gerontology.
2. Mei, B., Brown, G. T. L., & Teo, T. (2018). Toward an understanding of preservice english as a foreign language teachers’ acceptance of computer-assisted language learning 2.0 in the people’s republic of china. Journal of Educational Computing Research, 073563311770014.
3. Annuncy, V., & Joseph, P. (2023). New frontiers in linguistic research: eliminating the challenges of understanding the genetics of language through bioinformatics. Digital Scholarship in the Humanities(4), 4.
4. Mark, B., Raskutti, G., & Willett, R. (2018). Network estimation from point process data. IEEE Transactions on Information Theory, 1-1.
5. Wang, P., Lv, H., Zheng, X., Ma, W., & Wang, W. (2023). Validity analysis of network big data. Journal of web engineering(3), 22.