Abstractive Summarization of Text Document in Malayalam Language: Enhancing Attention Model Using POS Tagging Feature-Reference-Cited by-同舟云学术

Abstractive Summarization of Text Document in Malayalam Language: Enhancing Attention Model Using POS Tagging Feature

Published:2023-02-28 Issue:2 Volume:22 Page:1-14
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

K. Nambiar Sindhya¹^ORCID,Peter S. David²^ORCID,Mary Idicula Sumam³^ORCID

Affiliation:

1. Department of Computer Science, Cochin University of Science and Technology, Kerala, India

2. School of Engineering, Cochin University of Science and Technology, Kerala, India

3. Department of Computer Science and Engineering, Muthoot Institute of Technology and Science, Ernakulam, Kerala, India

Abstract

Over the past few years, researchers are showing huge interest in sentiment analysis and summarization of documents. The primary reason being that huge volumes of information are available in textual format, and this data has proven helpful for real-world applications and challenges. The sentiment analysis of a document will help the user comprehend the content’s emotional intent. Abstractive summarization algorithms generate a condensed version of the text, which can then be used to determine the emotion represented in the text using sentiment analysis. Recent research in abstractive summarization concentrates on neural network-based models, rather than conjunctions-based approaches, which might improve the overall efficiency. Neural network models like attention mechanism are tried out to handle complex works with promising results. The proposed work aims to present a novel framework that incorporates the part of speech tagging feature to the word embedding layer, which is then used as the input to the attention mechanism. With POS feature being part of the input layer, this framework is capable of dealing with words containing contextual and morphological information. The relevance of POS tagging here is due to its strong reliance on the language’s syntactic, contextual, and morphological information. The three main elements in the work are pre-processing, POS tagging feature in the embedding phase, and the incorporation of it into the attention mechanism. The word embedding provides the semantic concept about the word, while the POS tags give an idea about how significant the words are in the context of the content, which corresponds to the syntactic information. The proposed work was carried out in Malayalam, one of the prominent Indian languages. A widely used and accepted dataset from the English language was translated to Malayalam for conducting the experiments. The proposed framework gives a ROUGE score of 28, which outperformed the baseline models.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3561819

Reference28 articles.

1. A. P. Ajees and Sumam Mary Idicula. 2018. A POS tagger for Malayalam using conditional random fields. Int. J. Appl. Eng. Res. 13 3 (2018).

2. Leveraging Linguistic Structure For Open Domain Information Extraction

3. Dzmitry Bahdanau Kyunghyun Cho and Yoshua Bengio. 2014. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473 (2014).

4. Lidong Bing Piji Li Yi Liao Wai Lam Weiwei Guo and Rebecca J. Passonneau. 2015. Abstractive multi-document summarization via phrase selection and merging. arXiv preprint arXiv:1506.01597 (2015).

5. Deep communicating agents for abstractive summarization;Celikyilmaz Asli;arXiv preprint arXiv:1803.10357,2018