The Chekhov Digital project: semantic markup of a parallel corpus of translations of Chekhov's fiction into German

Author:

Severina Elena Mikhailovna,Fyodorov Nikita Aleksandrovich

Abstract

The article discusses the issues of developing the principles of a semantically marked parallel corpus of translations of Chekhov's fiction into German within the framework of the Chekhov Digital project, a digital academic publication of the writer's collected works in TEI (Text Encoding Initiative) format. The parallel corpus project is focused on creating a digital infrastructure for studying the writer's works, allowing researchers to analyze and compare original texts with their translations. Difficulties were identified related to the interpretation of significant elements of the writer's works, the specifics of their translation into German and the semantic markup of translations of fiction, for example, difficulties arose with defining the boundaries and relationships between the elements of semantic markup. Ways to overcome them are proposed, including the use of digital methods and natural language processing technologies. The project uses digital methods and technologies of natural language processing, the standard of digital publication Text Encoding Initiative (TEI). The text markup structure based on the TEI standard makes documents machine-readable, which allows to develop tools for complex semantic information retrieval. The inclusion in the Chekhov Digital project of parallel corpora of translations of A. P. Chekhov's works into different languages makes it possible to expand research tools in the field of translation studies, making it possible to compare texts of translations and originals, detect similarities and differences in vocabulary, grammar, style and cultural references, as well as automate routine research processes, which makes search and analysis much more effective information on large volumes of texts. The results of the project will contribute to the development of the digital humanitarian environment, contributing to the preservation and popularization of the literary heritage of A.P. Chekhov. The creation of a semantically marked parallel corpus of translations will be important for literary critics, linguists and translators, allowing them to study the specifics of translations of Chekhov's works and develop new forms of text analysis and interpretation. The experience gained during the project will be valuable for future research and practical applications, demonstrating the effectiveness of digital technologies in humanitarian research and education.

Publisher

Aurora Group, s.r.o

Reference14 articles.

1. Chekhov A. P. Polnoe sobranie sochinenii i pisem: V 30 t. / AN SSSR. In-t mirovoi lit. im. A. M. Gor'kogo. M.: Nauka, 1974-1983.

2. TEI Consortium, eds. TEI P5: Guidelines for Electronic Text Encoding and Interchange. Version 4.7.0. Last updated on 16th November 2023. TEI Consorti-um. URL: http://www.tei-c.org/Guidelines/P5/ (data obrashcheniya: 24.03.2024).

3. Wikidata // URL: https://www.wikidata.org/wiki/Wikidata: Main_Page (data obrashcheniya: 10.04.2024).

4. Severina E. M., Bonch-Osmolovskaya A. A., Kudin A. M. Tsifrovye filologicheskie praktiki: proekt "Chekhov Digital" // Aktual'nye problemy filologii i pedagogicheskoi lingvistiki. 2022. №2. S. 153-165.

5. Dobrovol'skii D. O., Kretov A. A., Sharov S. A. Korpus parallel'nykh tekstov: arkhitektura i vozmozhnosti ispol'zovaniya // Natsional'nyi korpus russkogo yazyka: 2003–2005. M.: Indrik, 2005, 263-296.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3