Challenges and opportunities for Arabic question-answering systems: current techniques and future directions-Reference-Cited by-同舟云学术

Challenges and opportunities for Arabic question-answering systems: current techniques and future directions

Published:2023-10-20 Issue: Volume:9 Page:e1633
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Alrayzah Asmaa¹²,Alsolami Fawaz¹,Saleh Mostafa¹

Affiliation:

1. Faculty of Computing and Information Technology, King Abdulaziz University, Makkah, Jeddah, Saudi Arabia

2. College of Computer Science and Information Systems, Najran University, Najran, Najran, Saudi Arabia

Abstract

Artificial intelligence-based question-answering (QA) systems can expedite the performance of various tasks. These systems either read passages and answer questions given in natural languages or if a question is given, they extract the most accurate answer from documents retrieved from the internet. Arabic is spoken by Arabs and Muslims and is located in the middle of the Arab world, which encompasses the Middle East and North Africa. It is difficult to use natural language processing techniques to process modern Arabic owing to the language’s complex morphology, orthographic ambiguity, regional variations in spoken Arabic, and limited linguistic and technological resources. Only a few Arabic QA experiments and systems have been designed on small datasets, some of which are yet to be made available. Although several reviews of Arabic QA studies have been conducted, the number of studies covered has been limited and recent trends have not been included. To the best of our knowledge, only two systematic reviews focused on Arabic QA have been published to date. One covered only 26 primary studies without considering recent techniques, while the other covered only nine studies conducted for Holy Qur’an QA systems. Here, the included studies were analyzed in terms of the datasets used, domains covered, types of Arabic questions asked, information retrieved, the mechanism used to extract answers, and the techniques used. Based on the results of the analysis, several limitations, concerns, and recommendations for future research were identified. Additionally, a novel taxonomy was developed to categorize the techniques used based on the domains and approaches of the QA system.

Publisher

PeerJ

Subject

General Computer Science

Link

https://peerj.com/articles/cs-1633.pdf

Reference128 articles.

1. Deep learning-based question answering: a survey;Abdel-Nabi;Knowledge and Information Systems,2022

2. Farasa: a fast and furious segmenter for arabic;Abdelali,2016

3. Pre-training BERT on arabic tweets: practical considerations;Abdelali,2021

4. Al-Bayan: an arabic question answering system for the Holy Qur’an;Abdelnasser,2014

5. ARBERT & MARBERT: deep bidirectional transformers for arabic;Abdul-Mageed,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AraFast: Developing and Evaluating a Comprehensive Modern Standard Arabic Corpus for Enhanced Natural Language Processing;Applied Sciences;2024-06-19