Clustering-based Sequence to Sequence Model for Generative Question Answering in a Low-resource Language-Reference-Cited by-同舟云学术

Clustering-based Sequence to Sequence Model for Generative Question Answering in a Low-resource Language

Published:2022-12-27 Issue:2 Volume:22 Page:1-14
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Bidgoly Amir Jalaly¹^ORCID,Amirkhani Hossein¹^ORCID,Baradaran Razieh¹^ORCID

Affiliation:

1. Department of Information Technology and Computer Engineering, University of Qom, Qom, Iran

Abstract

Despite the impressive success of sequence to sequence models for generative question answering, they need a vast amount of question-answer pairs during training, which is hard and expensive to obtain, especially for low-resource languages. In this article, we present a framework that exploits the semantic clusters among the question-answer pairs to compensate for the lack of enough training data. In the training phase, the question-answer pairs are clustered, and a cluster predictor is trained to identify the cluster each question belongs to. Then, a sequence to sequence model is trained, where there is a different generator for each cluster in the decoder component. During the test phase, the cluster of the input question is first identified using the trained cluster predictor, and the appropriate decoder is exploited. Our experiments on a Persian religious dataset show that the proposed method outperforms the standard sequence to sequence model by a large margin in terms of ROUGE and BLEU scores. This is traced back to the lower number of words in each cluster, leading to a reduction in the number of effective parameters each generator needs to learn, which help the model learn from fewer training data with less overfitting.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3563036

Reference53 articles.

1. A question answering system in hadith using linguistic knowledge

2. FarsTail: A Persian natural language inference dataset;Amirkhani Hossein;arXiv preprint arXiv:2009.08820,2020

3. Multi-Document Answer Generation for Non-Factoid Questions

4. Incorporating External Knowledge into Machine Reading for Generative Question Answering

5. Latent Dirichlet allocation;Blei David M.;J. Mach. Learn. Res.,2003