Abstract
This paper describes work on the morphological and syntactic annotation of Sumerian cuneiform as a model for low resource languages in general. Cuneiform texts are invaluable sources for the study of history, languages, economy, and cultures of Ancient Mesopotamia and its surrounding regions. Assyriology, the discipline dedicated to their study, has vast research potential, but lacks the modern means for computational processing and analysis. Our project, Machine Translation and Automated Analysis of Cuneiform Languages, aims to fill this gap by bringing together corpus data, lexical data, linguistic annotations and object metadata. The project’s main goal is to build a pipeline for machine translation and annotation of Sumerian Ur III administrative texts. The rich and structured data is then to be made accessible in the form of (Linguistic) Linked Open Data (LLOD), which should open them to a larger research community. Our contribution is two-fold: in terms of language technology, our work represents the first attempt to develop an integrative infrastructure for the annotation of morphology and syntax on the basis of RDF technologies and LLOD resources. With respect to Assyriology, we work towards producing the first syntactically annotated corpus of Sumerian.
Funder
Deutsche Forschungsgemeinschaft
National Endowment for the Humanities
Social Sciences and Humanities Research Council of Canada
Reference33 articles.
1. Towards Open Data for Linguistics: Linguistic Linked Data
2. SPARQL 1.1 Overviewhttps://www.w3.org/TR/sparql11-overview/
3. Ur_Namma, an OWL Ontology of a Sumerian Grammarhttp://www.epistematica.com/2012/05/ur_namma-an-owl-ontology-of-a-sumerian-grammar/
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献