Abstract
Our paper addresses how artificial intelligence technologies can transform museum records of provenance into structured and machine-readable data, which is the first critical step in undertaking a large-scale cross-institutional analysis of object history. Drawing on research on natural language processing (NLP), we have identified sentence boundary disambiguation and span categorization as highly effective techniques for extracting and structuring information from provenance texts. Our paper focuses on a provenance-specific annotation scheme that enables us to retain historical nuances when constructing provenance linked open data (PLOD).
Reference5 articles.
1. Finkel, Jenny Rose/Manning, Christopher D. (2009). Nested Named Entity Recognition. Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing. Singapore, Association for Computational Linguistics, 141–50. https://doi.org/10.3115/1699510.1699529 (all URLs here accessed in August 2023).
2. IFAR (2023). International Foundation for Art Research (IFAR) Provenance Guide. Available online at https://www.ifar.org/Provenance_Guide.pdf.
3. Rother, Lynn/Koss, Max/Mariani, Fabio (2022). Taking Care of History: Toward a Politics of Provenance Linked Open Data in Museums. In: Emily Lew Fry/Erin Canning (Eds.). Perspectives on Data. Chicago, The Art Institute of Chicago. https://doi.org/10.53269/9780865593152/06.
4. Rother, Lynn/Mariani, Fabio/Koss, Max (2023). Hidden Value: Provenance as a Source for Economic and Social History. Economic History Yearbook, Special Issue: Digital History 64 (1), 111–42. https://doi.org/10.1515/jbwg-2023-0005.
5. Yeide, Nancy H./Walsh, Amy L./Akinsha, Konstantin (2001). The AAM Guide to Provenance Research. Washington DC, American Association of Museums.