Transformer-Based Neural Network Machine Translation Model for the Kurdish Sorani Dialect-Reference-Cited by-同舟云学术

Transformer-Based Neural Network Machine Translation Model for the Kurdish Sorani Dialect

Published:2023-01-15 Issue:1 Volume:7 Page:15-21
ISSN:2521-4217
Container-title:UHD Journal of Science and Technology
language:
Short-container-title:UHD J SCI TECH

Author:

Badawi Soran

Abstract

The transformer model is one of the most recently developed models for translating texts into another language. The model uses the principle of attention mechanism, surpassing previous models, such as sequence-to-sequence, in terms of performance. It performed well with highly resourced English, French, and German languages. Using the model architecture, we investigate training the modified version of the model in a low-resourced language such as the Kurdish language. This paper presents the first-ever transformer-based neural machine translation model for the Kurdish language by utilizing vocabulary dictionary units that share vocabulary across the dataset. For this purpose, we combine all the existing parallel corpora of Kurdish – English by building a large corpus and training it on the proposed transformer model. The outcome indicated that the suggested transformer model works well with Kurdish texts by scoring (0.45) on bilingual evaluation understudy (BLEU). According to the BLEU standard, the score indicates a high-quality translation.

Publisher

University of Human Development

Subject

General Medicine

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. English Translation Technology Based on Transformer Model;2024 International Conference on Machine Intelligence and Digital Applications;2024-05-30

2. KurdiSent: a corpus for kurdish sentiment analysis;Language Resources and Evaluation;2024-01-02

3. KurdSum: A new benchmark dataset for the Kurdish text summarization;Natural Language Processing Journal;2023-12

4. Data Augmentation For Sorani Kurdish News Headline Classification Using Back-Translation And Deep Learning Model;Kurdistan Journal of Applied Research;2023-06-30