Extracting the roots of Arabic words without removing affixes-Reference-Cited by-同舟云学术

Extracting the roots of Arabic words without removing affixes

Published:2014-03-10 Issue:3 Volume:40 Page:376-385
ISSN:0165-5515
Container-title:Journal of Information Science
language:en
Short-container-title:Journal of Information Science

Author:

Yaseen Qussai¹,Hmeidi Ismail²

Affiliation:

1. Yarmouk University, Jordan

2. Jordan University of Science and Technology, Jordan

Abstract

Most research in Arabic roots extraction focuses on removing affixes from Arabic words. This process adds processing overhead and may remove non-affix letters, which leads to the extraction of incorrect roots. This paper advises a new approach to dealing with this issue by introducing a new algorithm for extracting Arabic words’ roots. The proposed algorithm, which is called the Word Substring Stemming Algorithm, does not remove affixes during the extraction process. Rather, it is based on producing the set of all substrings of an Arabic word, and uses the Arabic roots file, the Arabic patterns file and a concrete set of rules to extract correct roots from substrings. The experiments have shown that the proposed approach is competitive and its accuracy is 83.9%, Furthermore, its accuracy can be enhanced more in the sense that, for about 9.9% of the tested words, the WSS algorithm retrieves two candidates (in most cases) for the correct root.

Publisher

SAGE Publications

Subject

Library and Information Sciences,Information Systems

Link

http://journals.sagepub.com/doi/pdf/10.1177/0165551514526348

Reference22 articles.

1. Arabic stemming without a root dictionary

2. A Novel Algorithm to Extract Tri-Literal Arabic Roots

3. Towards an error-free Arabic stemming

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tasaheel: An Arabic Automative Textual Analysis Tool—All in One;IEEE Access;2023

2. Socio-Semantic Information Retrieval of Structured Arabic Texts;2022 IEEE/ACS 19th International Conference on Computer Systems and Applications (AICCSA);2022-12

3. Meta-search based approach for Arabic information retrieval;Online Information Review;2022-02-25

4. Title-Based Document Classification for Arabic Theses and Dissertations;Advances in Data and Information Sciences;2022

5. Text mining: A survey of Arabic root extraction algorithms;International Journal of ADVANCED AND APPLIED SCIENCES;2021-01