The Chekhov Digital project: tasks and problems of implementing semantic markup of texts (on the example of A. P. Chekhov's story "The Death of an Official")


Severina Elena Mikhailovna,Larionova Marina Chengarovna


The article considers a model of preparation of machine-readable (semantic) markup of texts for the Chekhov Digital project on the example of philological interpretation of individual significant elements of A. P. Chekhov's story "Death of an Official" and presentation of this information explicitly based on the standards of digital publication Text Encoding Initiative (TEI/XML). Based on the work of literary researchers, significant entities have been identified for marking up the corpus of the writer's texts, but the question of their representation in the text remains quite complex. A philological examination of such aspects as "properties, states and events; character features" in an excerpt from the story of A.P. Chekhov was carried out from the point of view of the TEI markup capabilities for preserving philological knowledge in a machine-readable format. One of the objectives of the Chekhov Digital project is to go beyond a simple digitized text and provide useful digital tools for the researcher. The elements of machine-readable markup are presented, which make it possible to mark up significant entities in Chekhov's texts for organizing semantic search through the corpus of the writer's texts, the problems and research tasks arising in the process of implementing such interdisciplinary projects due to the need to combine the efforts of specialists from different fields of knowledge are considered. The project implements the principle of Open research data, the most important task of which is to create scientific communities around data. The work on the project led to the development of scientific cooperation between researchers of the Higher School of Economics, the UNC RAS and the SFU.


Aurora Group, s.r.o


Colloid and Surface Chemistry,Physical and Theoretical Chemistry

Reference15 articles.

1. Chekhov A. P. Polnoe sobranie sochinenii i pisem: V 30 t. / AN SSSR. In-t mirovoi lit. im. A. M. Gor'kogo. M.: Nauka, 1974-1983.

2. TEI Consortium, eds. TEI P5: Guidelines for Electronic Text Encoding and Interchange. Version 4.6.0. Last updated on 4th April 2023. TEI Consortium. URL:

3. Severina E. M., Bonch-Osmolovskaya A. A., Kudin A. M. Tsifrovye filologicheskie praktiki: proekt "Chekhov Digital". Aktual'nye problemy filologii i pedagogicheskoi lingvistiki. 2022. № 2. S. 153-165.

4. Severina E.M., Larionova M.Ch. Novye filologicheskie praktiki: semanticheskoe izdanie tekstov A. P. Chekhova. Filologiya: nauchnye issledovaniya. 2020, № 10, s. 13-21. DOI: 10.7256/2454-0749.2020.10.33970.

5. Chekhov A. P. Smert' chinovnika // Chekhov A. P. Polnoe sobranie sochinenii i pisem: V 30 t. Sochineniya: V 18 t. T. 2. M.: Nauka, 1975. S. 164–166.







Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3