Author:
Al-Maleh Molham,Desouki Said
Abstract
AbstractNatural language processing has witnessed remarkable progress with the advent of deep learning techniques. Text summarization, along other tasks like text translation and sentiment analysis, used deep neural network models to enhance results. The new methods of text summarization are subject to a sequence-to-sequence framework of encoder–decoder model, which is composed of neural networks trained jointly on both input and output. Deep neural networks take advantage of big datasets to improve their results. These networks are supported by the attention mechanism, which can deal with long texts more efficiently by identifying focus points in the text. They are also supported by the copy mechanism that allows the model to copy words from the source to the summary directly. In this research, we are re-implementing the basic summarization model that applies the sequence-to-sequence framework on the Arabic language, which has not witnessed the employment of this model in the text summarization before. Initially, we build an Arabic data set of summarized article headlines. This data set consists of approximately 300 thousand entries, each consisting of an article introduction and the headline corresponding to this introduction. We then apply baseline summarization models to the previous data set and compare the results using the ROUGE scale.
Publisher
Springer Science and Business Media LLC
Subject
Information Systems and Management,Computer Networks and Communications,Hardware and Architecture,Information Systems
Cited by
40 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. A systematic literature review of deep learning-based text summarization: Techniques, input representation, training strategies, mechanisms, datasets, evaluation, and challenges;Expert Systems with Applications;2024-10
2. Stacked Denoising Variational Auto Encoder Model for Extractive Web Text Summarization;Iranian Journal of Science and Technology, Transactions of Electrical Engineering;2024-09-13
3. Review of ambiguity problem in text summarization using hybrid ACA and SLR;Intelligent Systems with Applications;2024-06
4. TOPICScore: Evaluating Automatic Text Summarization Using Embeddings, Occurrences, and Topic Detection;2024 4th International Conference on Innovative Research in Applied Science, Engineering and Technology (IRASET);2024-05-16
5. Extractive Arabic Text Summarization Using PageRank and Word Embedding;Arabian Journal for Science and Engineering;2024-04-18