How to Tell Ancient Signs Apart? Recognizing and Visualizing Maya Glyphs with CNNs-Reference-Cited by-同舟云学术

How to Tell Ancient Signs Apart? Recognizing and Visualizing Maya Glyphs with CNNs

Published:2018-12-29 Issue:4 Volume:11 Page:1-25
ISSN:1556-4673
Container-title:Journal on Computing and Cultural Heritage
language:en
Short-container-title:J. Comput. Cult. Herit.

Author:

Can Gülcan¹,Odobez Jean-Marc¹,Gatica-Perez Daniel¹

Affiliation:

1. Idiap Research Institute and École Polytechnique Fédérale de Lausanne (EPFL), Switzerland

Abstract

Thanks to the digital preservation of cultural heritage materials, multimedia tools (e.g., based on automatic visual processing) considerably ease the work of scholars in the humanities and help them to perform quantitative analysis of their data. In this context, this article assesses three different Convolutional Neural Network (CNN) architectures along with three learning approaches to train them for hieroglyph classification, which is a very challenging task due to the limited availability of segmented ancient Maya glyphs. More precisely, the first approach, the baseline, relies on pretrained networks as feature extractor. The second one investigates a transfer learning method by fine-tuning a pretrained network for our glyph classification task. The third approach considers directly training networks from scratch with our glyph data. The merits of three different network architectures are compared: a generic sequential model (i.e., LeNet), a sketch-specific sequential network (i.e., Sketch-a-Net), and the recent Residual Networks. The sketch-specific model trained from scratch outperforms other models and training strategies. Even for a challenging 150-class classification task, this model achieves 70.3% average accuracy and proves itself promising in case of a small amount of cultural heritage shape data. Furthermore, we visualize the discriminative parts of glyphs with the recent Grad-CAM method, and demonstrate that the discriminative parts learned by the model agree, in general, with the expert annotation of the glyph specificity (diagnostic features). Finally, as a step toward systematic evaluation of these visualizations, we conduct a perceptual crowdsourcing study. Specifically, we analyze the interpretability of the representations from Sketch-a-Net and ResNet-50. Overall, our article takes two important steps toward providing tools to scholars in the digital humanities: increased performance for automation and improved interpretability of algorithms.

Funder

Hasler Foundation through the DCrowdLens project

Swiss National Science Foundation through the MAAYA project

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design,Computer Science Applications,Information Systems,Conservation

Link

https://dl.acm.org/doi/pdf/10.1145/3230670

Reference46 articles.

1. Multi-Task CNN Model for Attribute Prediction

2. Evaluating Shape Representations for Maya Glyph Classification

3. Maya Codical Glyph Segmentation: A Crowdsourcing Approach

4. Shape Representations for Maya Codical Glyphs

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The State of Pilot Study Reporting in Crowdsourcing: A Reflection on Best Practices and Guidelines;Proceedings of the ACM on Human-Computer Interaction;2024-04-17

2. LanT: finding experts for digital calligraphy character restoration;Multimedia Tools and Applications;2024-01-18

3. Digital Restoration of Cultural Heritage With Data-Driven Computing: A Survey;IEEE Access;2023

4. Deep Segmentation of Corrupted Glyphs;Journal on Computing and Cultural Heritage;2022-01-22

5. A deep neural network based framework for restoring the damaged persian pottery via digital inpainting;Journal of Computational Science;2021-11