Computing similarity between items in a digital library of cultural heritage-Reference-Cited by-同舟云学术

Computing similarity between items in a digital library of cultural heritage

Published:2012-12 Issue:4 Volume:5 Page:1-19
ISSN:1556-4673
Container-title:Journal on Computing and Cultural Heritage
language:en
Short-container-title:J. Comput. Cult. Herit.

Author:

Aletras Nikolaos¹,Stevenson Mark¹,Clough Paul¹

Affiliation:

1. The University of Sheffield

Abstract

Large amounts of cultural heritage content have now been digitized and are available in digital libraries. However, these are often unstructured and difficult to navigate. Automatic techniques for identifying similar items in these collections could be used to improve navigation since it would allow items that are implicitly connected to be linked together and allow sets of similar items to be clustered. Europeana is a large digital library containing more than 20 million digital objects from a set of cultural heritage providers throughout Europe. The diverse nature of this collection means that the items do not have standard metadata to assist navigation. A range of methods for computing the similarity between pairs of texts are applied to metadata records in Europeana in order to estimate the similarity between items. Various methods for computing similarity have been proposed and can be classified into two main approaches: (1) knowledge-based, which make use of external knowledge sources and (2) corpus-based approaches, which rely on analyzing the frequency distributions of words in documents. Both techniques are evaluated against manual judgements obtained for this study and a multiple-choice test created from manually generated categories in cultural heritage collections. We find that a combination of corpus and knowledge-based approaches provide the best results in both experiments.

Funder

Seventh Framework Programme

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design,Computer Science Applications,Information Systems,Conservation

Link

https://dl.acm.org/doi/pdf/10.1145/2399180.2399184

Reference56 articles.

1. Proceedings of the 1st Joint Conference on Lexical and Computational Semantics --;Agirre E.

2. Understanding cultural heritage experts' information seeking needs

3. Inter-Coder Agreement for Computational Linguistics

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Context-Aware Querying, Geolocalization, and Rephotography of Historical Newspaper Images;Applied Sciences;2022-11-01

2. Exploring digital cultural heritage through browsing;DIGIT RES ARTS HUM;2022

3. Object Spotting in Historical Documents;Digital Techniques for Heritage Presentation and Preservation;2021

4. Survey and Analysis of Interactive Art Documentation, 1979–2017;Leonardo;2019-06

5. Figure spotting in Indian heritage image;Journal of Cultural Heritage;2018-07