Applying Deep Learning Algorithms for Automatic Recognition and Transcription of Texts in Oracle Bones and Golden Texts
Author:
Qiao Yingjie1, Xing Lizhi2
Affiliation:
1. School of Art and Design, Luoyang University of Science and Technology, Research Center of Chinese Historical Civilization Inheritance and Innovation, Luoyang University of Science and Technology , Luoyang , Henan , , China . 2. The Yellow River Civilization and the Sustainable Development of Henan University Research Center, Research Center of Chinese Historical Civilization Inheritance and Innovation , Luoyang University of Science and Technology , Luoyang , Henan , , China .
Abstract
Abstract
This paper explores applying deep learning techniques for automatically recognizing and transcribing oracle bone and gold texts. We significantly enhance model recognition efficiency by leveraging the power of Generative Adversarial Networks (GANs) for image data enhancement and the Pix2Pix model for text repair. Our approach integrates the ResNet50 model for robust feature extraction with unsupervised domain adaptation, utilizing multiple pseudo labels to achieve efficient text recognition and transcription. We improve the model’s repair capabilities by generating hard-to-distinguish sample data through GANs and employing a U-Net-based text repair model enhanced with dense connectivity and spectral normalization. Further, combining ResNet50 for feature extraction and advanced domain adaptation techniques strengthens the model’s generalization. Our results on the Oracle dataset show an increase in recognition accuracy from 82% to 94.5%, highlighting the effectiveness of our image enhancement strategies. The ResNet50 extractor outperforms others across various Intersection over Union (IoU) metrics, establishing its feature extraction superiority. In real-world scenarios, testing with a combined Oracle and Jinwen dataset yields a recognition accuracy above 80%, demonstrating our model’s ability to effectively fulfill the recognition task. This research underscores the potential of deep learning algorithms in automating the recognition and transcription of ancient texts, offering a novel solution that significantly boosts recognition accuracy through a synergistic blend of image enhancement, feature extraction, and domain adaptation techniques.
Publisher
Walter de Gruyter GmbH
Reference18 articles.
1. Liu, K., Wu, X., Guo, Z., Yuan, S., Ding, X., & Fu, D., et al. (2021). Radiocarbon dating of oracle bones of late shang period in ancient china. Radiocarbon(1), 63. 2. Deng, F., & Wen, X. (2017). New interpretations of yi in a group of the oracle-bone inscriptions. Journal of Chinese Linguistics, 45(2), 394-422. 3. Jones, A. (2021). Three gallo-roman bronze disks with astral inscriptions:. Journal for the History of Astronomy, 52(4), 381-396. 4. Xie, Z., Huang, Y., Jin, L., Liu, Y., Zhu, Y., & Gao, L., et al. (2019). Weakly supervised precise segmentation for historical document images. Neurocomputing, 350(JUL.20), 271-281. 5. Gao, J., & Liang, X. (2020). Distinguishing oracle variants based on the isomorphism and symmetry invariances of oracle-bone inscriptions. IEEE Access, PP(99), 1-1.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Research on Oracle Bone Inscription Segmentation and Recognition Model Based on Deep Learning;2024 IEEE 4th International Conference on Electronic Technology, Communication and Information (ICETCI);2024-05-24
|
|