Affiliation:
1. Institute of Informatics and Telematics, National Research Council, 56124 Pisa, Italy
2. Department of Civilisation and Forms of Knowledge, University of Pisa, 56100 Pisa, Italy
Abstract
The Jewish community archive in Pisa owns a vast collection of documents and manuscripts that date back centuries. These documents contain valuable genealogical information, including birth, marriage, and death records. This paper aims to describe the preliminary results of the Archivio Storico della Comunita Ebraica di Pisa (ASCEPI) project, with a focus on the extraction of data from the Nati, Morti e Ballottati (NMB) Registry document in the archive. The NMB Registry contains about 1900 records of births, deaths, and balloted individuals within the Jewish community in Pisa. The study uses a semiautomatic pipeline of digitization, transcription, and Natural Language Processing (NLP) techniques to extract personal data such as names, surnames, birth and death dates, and parental names from each record. The extracted data are then used to build a knowledge base and a genealogical tree for a representative family, Supino. This study demonstrates the potential of using NLP and rule-based techniques to extract valuable information from historical documents and to construct genealogical trees.
Subject
Computer Networks and Communications,Human-Computer Interaction,Communication
Reference43 articles.
1. Visualizing genealogy through a family-centric perspective;Ball;Inf. Vis.,2017
2. (2023, March 16). The ASCEPI Project. Available online: http://ascepi.iit.cnr.it/.
3. Colorni, V. (1967). Itinerarium, Bologna, Italy, (anastatic reprint).
4. Lonardo, P.M. (1982). Gli Ebrei a Pisa, Forni. Doc. VII.
5. Luzzati, M. (1985). La Casa dell’Ebreo Saggi sugli Ebrei a Pisa e in Toscana nel Medioevo e nel Rinascimento, Nistri Lischi.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献