Affiliation:
1. University of Genova, Italy
Abstract
Understanding and describing past or present societies is a complex task, as it involves a multi-faceted analysis of the norms, interactions, and evolutions that characterize them. This serves as the motivation for developing a tool, named Herodotus, aiming at supporting domain experts, such as historians or archaeologists, in the reasoning tasks over complex interactions characterizing a society in order to explain why some event took place and, possibly, to predict what could happen when some factors change. An important part of Herodotus is the text mining module that is responsible for the extraction of knowledge from written sources, such as books and scientific papers. Machines cannot always help users in dealing with natural language, because of the variety, ambiguity and non-rigidity that language shows in its use; they can only try to process information in a meaningful way for users. Information Extraction (IE) is the technology that pulls specific information from large volumes of unstructured texts and stores this information in structured forms. Users can then consult, compose, and analyze them. Domain-based IE should focus on an analysis of a specific state of affairs and, in this way, it can obtain more precise and detailed results. This helps domain experts to deal with the complexity of their everyday objects and environments. This chapter is centered on the Interactive Summary Extractor tool, whose scope is to organize, in a partially automated but substantially interactive way, text summaries for archaeological and historical documental sources. The texts so analyzed will help domain experts to collect data, viewing a synthesized version of it, compose such summaries in units of sense for the particular archaeological study or research that is in place, and so on. Summaries can then be modified, stored, retrieved and managed for later elaboration.
Reference48 articles.
1. Ancona, M., Scagliola, N., & Traverso, A. (2005). Application of 3G cellular phones to cultural heritage: The agamemnon project. Paper presented at the International Workshop on Recording, Modeling and Visualization of Cultural Heritage. New York, NY.
2. Antikytera Mechanism. (n.d.). Wikipedia. Retrieved from http://en.wikipedia.org/wiki/Antikythera_mechanism
3. Baker, C. F., & Fellbaum, C. (2009). WordNet and FrameNet as complementary resources for annotation. In Proceedings of the Third Linguistic Annotation Workshop, ACL-IJCNLP, (pp. 125–129). ACL.
4. Berryman, M. J., Allison, A., & Abbott, D. (2002). Signal processing and statistical methods in analysis of text and DNA. In Proceedings of SPIE: Biomedical Applications of Micro and Nanoengineering. SPIE.
5. Bliki Engine. (n.d.). Retrieved from http://code.google.com/p/gwtwiki/