Does learning from language family help? A case study on a low-resource question-answering task-Reference-Cited by-同舟云学术

Does learning from language family help? A case study on a low-resource question-answering task

Published:2024-06-03 Issue: Volume: Page:1-18
ISSN:2977-0424
Container-title:Natural Language Processing
language:en
Short-container-title:Nat. lang. processing

Author:

Pandya Hariom A.^ORCID,Bhatt Brijesh S.

Abstract

Abstract Multilingual pre-trained models make it possible to develop natural language processing (NLP) applications for low-resource languages (LRLs) using the model of resource-rich languages (RRLs). However, the structural characteristics of the target languages can impact task-specific learning. In this paper, we investigate the influence of structural diversities of languages on the system’s overall performance. Specifically, we propose a customized approach to leverage task-specific data of low-resource language families via transfer learning from RRL. Our findings are based on question-answering tasks using the XLM-R, mBERT, and IndicBERT transformer models and Indic languages (Hindi, Bengali, and Telugu). On the XQuAD-Hindi dataset, the few-shot learning using Bengali improves the benchmark mBERT (F1/EM) score by +(10.86/7.87) and XLM-R score by +(3.84/4.42). Few-shot learning using Telugu has also improved the mBERT score by +(10.42/7.36) and +(3.04/2.72) for XLM-R. In addition, our model has demonstrated benchmark-compatible performance in a zero-shot setup with single-epoch task learning. This approach can be adapted for other NLP tasks for LRLs.

Publisher

Cambridge University Press (CUP)

Reference58 articles.

1. Pandya, H. , Ardeshna, B. and Bhatt, B. (2021). Cascading adaptors to leverage English data to improve performance of question answering for low-resource languages. In Proceedings of the 18th International Conference on Natural Language Processing (ICON), National Institute of Technology Silchar, Silchar, India. NLP Association of India (NLPAI), pp. 544–549.

2. On the Cross-lingual Transferability of Monolingual Representations

3. Liu, Y. , Ott, M. , Goyal, N. , Du, J. , Joshi, M. , Chen, D. , Levy, O. , Lewis, M. , Zettlemoyer, L. and Stoyanov, V. (2019). Roberta: A robustly optimized BERT pretraining approach. CoRR, abs/1907.11692.

4. Unsupervised Cross-lingual Representation Learning at Scale

5. Ammar, W. , Mulcaire, G. , Tsvetkov, Y. , Lample, G. , Dyer, C. and Smith, N.A. (2016). Massively multilingual word embeddings.