Machine Learning Based Assembly of Fragments of Ancient Papyrus-Reference-Cited by-同舟云学术

Machine Learning Based Assembly of Fragments of Ancient Papyrus

Published:2021-07 Issue:3 Volume:14 Page:1-21
ISSN:1556-4673
Container-title:Journal on Computing and Cultural Heritage
language:en
Short-container-title:J. Comput. Cult. Herit.

Author:

Abitbol Roy¹^ORCID,Shimshoni Ilan¹,Ben-Dov Jonathan²

Affiliation:

1. Department of Information Systems, University of Haifa, Israel

2. Department of Biblical Studies, Tel Aviv University, Israel

Abstract

The task of assembling fragments in a puzzle-like manner into a composite picture plays a significant role in the field of archaeology as it supports researchers in their attempt to reconstruct historic artifacts. In this article, we propose a method for matching and assembling pairs of ancient papyrus fragments containing mostly unknown scriptures. Papyrus paper is manufactured from papyrus plants and therefore portrays typical thread patterns resulting from the plant’s stems. The proposed algorithm is founded on the hypothesis that these thread patterns contain unique local attributes such that nearby fragments show similar patterns reflecting the continuations of the threads. We posit that these patterns can be exploited using image processing and machine learning techniques to identify matching fragments. The algorithm and system which we present support the quick and automated classification of matching pairs of papyrus fragments as well as the geometric alignment of the pairs against each other. The algorithm consists of a series of steps and is based on deep-learning and machine learning methods. The first step is to deconstruct the problem of matching fragments into a smaller problem of finding thread continuation matches in local edge areas (squares) between pairs of fragments. This phase is solved using a convolutional neural network ingesting raw images of the edge areas and producing local matching scores. The result of this stage yields very high recall but low precision. Thus, we utilize these scores in order to conclude about the matching of entire fragments pairs by establishing an elaborate voting mechanism. We enhance this voting with geometric alignment techniques from which we extract additional spatial information. Eventually, we feed all the data collected from these steps into a Random Forest classifier in order to produce a higher order classifier capable of predicting whether a pair of fragments is a match. Our algorithm was trained on a batch of fragments which was excavated from the Dead Sea caves and is dated circa the 1st century BCE. The algorithm shows excellent results on a validation set which is of a similar origin and conditions. We then tried to run the algorithm against a real-life set of fragments for which we have no prior knowledge or labeling of matches. This test batch is considered extremely challenging due to its poor condition and the small size of its fragments. Evidently, numerous researchers have tried seeking matches within this batch with very little success. Our algorithm performance on this batch was sub-optimal, returning a relatively large ratio of false positives. However, the algorithm was quite useful by eliminating 98% of the possible matches thus reducing the amount of work needed for manual inspection. Indeed, experts that reviewed the results have identified some positive matches as potentially true and referred them for further investigation.

Funder

Deutsche-Israelische Projektkooperation

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design,Computer Science Applications,Information Systems,Conservation

Link

https://dl.acm.org/doi/pdf/10.1145/3460961

Reference46 articles.

1. Pairwise matching of 3D fragments using fast fourier transform

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Computational techniques for virtual reconstruction of fragmented archaeological textiles;Heritage Science;2023-12-13

2. Machine Learning for Ancient Languages: A Survey;Computational Linguistics;2023

3. Assembling Fragments of Ancient Papyrus via Artificial Intelligence;Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering;2023

4. A hybrid 3D object auto-completion approach with self-supervised data augmentation for fragments of archaeological objects;Journal of Cultural Heritage;2022-07

5. A Comparative Study on Reassembly of Image Fragments;Algorithms for Intelligent Systems;2022