MizBERT: A Mizo BERT Model-Reference-Cited by-同舟云学术

MizBERT: A Mizo BERT Model

Published:2024-06-26 Issue:7 Volume:23 Page:1-14
ISSN:2375-4699
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
language:en
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.

Author:

Lalramhluna Robert¹^ORCID,Dash Sandeep¹^ORCID,Pakray Dr.Partha²^ORCID

Affiliation:

1. Computer Science & Engineering, National Institute of Technology, Mizoram, Aizawl, India

2. Computer Science & Engineering, National Institute of Technology, Silchar, Silchar, India

Abstract

This research investigates the utilization of pre-trained BERT transformers within the context of the Mizo language. BERT, an abbreviation for Bidirectional Encoder Representations from Transformers, symbolizes Google’s forefront neural network approach to Natural Language Processing (NLP), renowned for its remarkable performance across various NLP tasks. However, its efficacy in handling low-resource languages such as Mizo remains largely unexplored. In this study, we introduce MizBERT , a specialized Mizo language model. Through extensive pre-training on a corpus collected from diverse online platforms, MizBERT has been tailored to accommodate the nuances of the Mizo language. Evaluation of MizBERT’s capabilities is conducted using two primary metrics: masked language modeling and perplexity, yielding scores of 76.12% and 3.2565, respectively. Additionally, its performance in a text classification task is examined. Results indicate that MizBERT outperforms both the Multilingual BERT model and the Support Vector Machine algorithm, achieving an accuracy of 98.92%. This underscores MizBERT’s proficiency in understanding and processing the intricacies inherent in the Mizo language.

Publisher

Association for Computing Machinery (ACM)

Link

https://dl.acm.org/doi/pdf/10.1145/3666003

Reference28 articles.

1. AraBERT: Transformer-based model for arabic language understanding;Antoun Wissam;arXiv preprint arXiv:2003.00104,2020

2. Andrew Bawitlung, Sandeep Kumar Dash, Robert Lalramhluna, and Alexander Gelbukh. 2024. An approach to Mizo language news classification using machine learning. In Data Science and Network Engineering, Suyel Namasudra, Munesh Chandra Trivedi, Ruben Gonzalez Crespo, and Pascal Lorenz (Eds.). Springer Nature Singapore, Singapore, 165–180.

3. Jereemi Bentham, Partha Pakray, Goutam Majumder, Sunday Lalbiaknia, and Alexander Gelbukh. 2016. Identification of rules for recognition of named entity classes in Mizo language. In Proceedings of the 2016 15th Mexican International Conference on Artificial Intelligence (MICAI’16). IEEE, 8–13.

4. Tom B. Brown Benjamin Mann Nick Ryder Melanie Subbiah Jared Kaplan Prafulla Dhariwal Arvind Neelakantan Pranav Shyam Girish Sastry Amanda Askell Sandhini Agarwal Ariel Herbert-Voss Gretchen Krueger Tom Henighan Rewon Child Aditya Ramesh Daniel M. Ziegler Jeffrey Wu Clemens Winter Christopher Hesse Mark Chen Eric Sigler Mateusz Litwin Scott Gray Benjamin Chess Jack Clark Christopher Berner Sam McCandlish Alec Radford Ilya Sutskever and Dario Amodei. 2020. Language models are few-shot learners. arxiv:2005.14165 [cs.CL] (2020).

5. German’s Next Language Model