A Deep Learning Approach for Arabic Manuscripts Classification-Reference-Cited by-同舟云学术

A Deep Learning Approach for Arabic Manuscripts Classification

Published:2023-09-28 Issue:19 Volume:23 Page:8133
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Al-homed Lutfieh S.¹,Jambi Kamal M.¹,Al-Barhamtoshy Hassanin M.²^ORCID

Affiliation:

1. Department of Computer Science, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia

2. Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia

Abstract

For centuries, libraries worldwide have preserved ancient manuscripts due to their immense historical and cultural value. However, over time, both natural and human-made factors have led to the degradation of many ancient Arabic manuscripts, causing the loss of significant information, such as authorship, titles, or subjects, rendering them as unknown manuscripts. Although catalog cards attached to these manuscripts might contain some of the missing details, these cards have degraded significantly in quality over the decades within libraries. This paper presents a framework for identifying these unknown ancient Arabic manuscripts by processing the catalog cards associated with them. Given the challenges posed by the degradation of these cards, simple optical character recognition (OCR) is often insufficient. The proposed framework uses deep learning architecture to identify unknown manuscripts within a collection of ancient Arabic documents. This involves locating, extracting, and classifying the text from these catalog cards, along with implementing processes for region-of-interest identification, rotation correction, feature extraction, and classification. The results demonstrate the effectiveness of the proposed method, achieving an accuracy rate of 92.5%, compared to 83.5% with classical image classification and 81.5% with OCR alone.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/19/8133/pdf

Reference46 articles.

1. Comprehensive synthetic Arabic database for on/off-line script recognition research;Saabni;Int. J. Doc. Anal. Recognit. (IJDAR),2013

2. Automatic processing of Historical Arabic Documents: A comprehensive survey;Khedher;Pattern Recognit.,2020

3. Two hitherto unknown Arabic Euclid manuscripts;Hist. Math.,2015

4. Al-homed, L.S., Jambi, K.M., and Al-Barhamtoshy, H.M. (2022, January 12–13). A Novel Dataset for Known and Unknown Ancient Arabic Manuscripts. Proceedings of the 2022 20th International Conference on Language Engineering (ESOLEC), Cairo, Egypt.

5. Al-Maadeed, S., AIKadiry, M., Shaar, M., and Alja’am, J.M. (2018, January 25–26). A Mobile System for Historical Manuscripts Capturing, Recognition and Classification. Proceedings of the 2018 International Conference on Computer and Applications (ICCA), Beirut, Lebanon.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. End-to-End Deep Learning Framework for Arabic Handwritten Legal Amount Recognition and Digital Courtesy Conversion;Mathematics;2024-07-19