The Archaeotools project: faceted classification and natural language processing in an archaeological context-Reference-Cited by-同舟云学术

The Archaeotools project: faceted classification and natural language processing in an archaeological context

Published:2009-06-28 Issue:1897 Volume:367 Page:2507-2519
ISSN:1364-503X
Container-title:Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences
language:en
Short-container-title:Phil. Trans. R. Soc. A.

Author:

Jeffrey S.¹,Richards J.¹,Ciravegna F.²,Waller S.¹,Chapman S.²,Zhang Z.²

Affiliation:

1. Archaeology Data Service, Department of Archaeology, The King's Manor, University of YorkYork Y01 7EP, UK

2. Web Intelligence Technologies Laboratory, Natural Language Processing Group, Department of Computer Science, University of SheffieldSheffield S1 4DP, UK

Abstract

This paper describes ‘Archaeotools’, a major e-Science project in archaeology. The aim of the project is to use faceted classification and natural language processing to create an advanced infrastructure for archaeological research. The project aims to integrate over 1×10 6 structured database records referring to archaeological sites and monuments in the UK, with information extracted from semi-structured grey literature reports, and unstructured antiquarian journal accounts, in a single faceted browser interface. The project has illuminated the variable level of vocabulary control and standardization that currently exists within national and local monument inventories. Nonetheless, it has demonstrated that the relatively well-defined ontologies and thesauri that exist in archaeology mean that a high level of success can be achieved using information extraction techniques. This has great potential for unlocking and making accessible the information held in grey literature and antiquarian accounts, and has lessons for allied disciplines.

Publisher

The Royal Society

Subject

General Physics and Astronomy,General Engineering,General Mathematics

Link

https://royalsocietypublishing.org/doi/pdf/10.1098/rsta.2009.0038

Reference19 articles.

1. Amrani A. Abajian V. Kodratoff Y. & Matte-Tailliez O. 2008 A chain of text-mining to extract information in archaeology. In Information and communication technologies: from theory to applications ICTTA 2008 3rd Int. Conf. pp. 1–5.

2. Appelt D. E. & Israel D. 1999 Introduction to information extraction technology. IJCAI-99 tutorial Stockholm. See http://www.ai.sri.com/∼appelt/ie-tutorial/IJCAI99.pdf.

3. Bridging the Two Cultures – Commercial Archaeology and the Study of Prehistoric Britain

4. Ciravegna F. Lanfrachi V. Moore P. Baghdev R. & Iria J. 2006 Automatically annotating jet engine event reports using information extraction. In Proc. Knowledge and Information Management: the Challenge of Through Life Support Seminar Institution of Mechanical Engineers London 26 September 2006 .

Cited by 33 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Text Mining Oral Histories in Historical Archaeology;International Journal of Historical Archaeology;2023-01-13

2. Information Extraction and Machine Learning for Archaeological Texts;Discourse and Argumentation in Archaeology: Conceptual and Computational Approaches;2023

3. NLP and Archaeology: A View from a Digital Archive;Discourse and Argumentation in Archaeology: Conceptual and Computational Approaches;2023

4. Can BERT Dig It? Named Entity Recognition for Information Retrieval in the Archaeology Domain;Journal on Computing and Cultural Heritage;2022-09-16

5. Same text, same discourse? Empirical validation of a discourse analysis methodology for cultural heritage;Digital Scholarship in the Humanities;2022-07-09