Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic-Reference-Cited by-同舟云学术

Exploring the Performance of Tagging for the Classical and the Modern Standard Arabic

Published:2019-01-23 Issue: Volume:2019 Page:1-10
ISSN:1687-7101
Container-title:Advances in Fuzzy Systems
language:en
Short-container-title:Advances in Fuzzy Systems

Author:

AbuZeina Dia¹^ORCID,Abdalbaset Taqieddin Mostafa²

Affiliation:

1. College of Information Technology and Computer Engineering, Palestine Polytechnic University, Hebron, State of Palestine

2. Palestine Technical University–Kadoorie, AL-Aroub Branch, Hebron, State of Palestine

Abstract

The part of speech (PoS) tagging is a core component in many natural language processing (NLP) applications. In fact, the PoS taggers contribute as a preprocessing step in various NLP tasks, such as syntactic parsing, information extraction, machine translation, and speech synthesis. In this paper, we examine the performance of a modern standard Arabic (MSA) based tagger for the classical (i.e., traditional or historical) Arabic. In this work, we employed the Stanford Arabic model tagger to evaluate the imperative verbs in the Holy Quran. In fact, the Stanford tagger contains 29 tags; however, this work experimentally evaluates just one that is the VB ≡ imperative verb. The testing set contains 741 imperative verbs, which appear in 1,848 positions in the Holy Quran. Despite the previously reported accuracy of the Arabic model of the Stanford tagger, which is 96.26% for all tags and 80.14% for unknown words, the experimental results show that this accuracy is only 7.28% for the imperative verbs. This result promotes the need for further research to expose why the tagging is severely inaccurate for classical Arabic. The performance decline might be an indication of the necessity to distinguish between training data for both classical and MSA Arabic for NLP tasks.

Funder

Palestine Polytechnic University

Publisher

Hindawi Limited

Subject

Computational Mathematics,Control and Optimization,Control and Systems Engineering

Link

http://downloads.hindawi.com/journals/afs/2019/6254649.pdf

Reference19 articles.

1. Towards a standard Part of Speech tagset for the Arabic language

2. Toward enhanced Arabic speech recognition using part of speech tagging

3. Automatic Labeling of Semantic Roles

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A BERT Based Approach for Arabic POS Tagging;Advances in Computational Intelligence;2021