Affiliation:
1. Department of Mathematics, Computer Science and Physics, Università degli Studi di Udine, Via delle Scienze 206, 33100 Udine, Italy
2. Department of Humanities and Cultural Heritage, Università degli Studi di Udine, Vicolo Florio 2/b, 33100 Udine, Italy
Abstract
Over the years, the humanities community has increasingly requested the creation of artificial intelligence frameworks to help the study of cultural heritage. Document Layout segmentation, which aims at identifying the different structural components of a document page, is a particularly interesting task connected to this trend, specifically when it comes to handwritten texts. While there are many effective approaches to this problem, they all rely on large amounts of data for the training of the underlying models, which is rarely possible in a real-world scenario, as the process of producing the ground truth segmentation task with the required precision to the pixel level is a very time-consuming task and often requires a certain degree of domain knowledge regarding the documents at hand. For this reason, in this paper, we propose an effective few-shot learning framework for document layout segmentation relying on two novel components, namely a dynamic instance generation and a segmentation refinement module. This approach is able of achieving performances comparable to the current state of the art on the popular Diva-HisDB dataset, while relying on just a fraction of the available data.
Funder
Piano Nazionale di Ripresa e Resilienza
Publisher
World Scientific Pub Co Pte Ltd
Subject
Computer Networks and Communications,General Medicine
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献