Affiliation:
1. Kyiv National Linguistic University
Abstract
The research of the European Media comprises plenty of precious documents compiled into our educational corpora. This study represents how the corpus has been compiled. Also, we find it necessary to show the tools for this compilation which were described and analyzed. The first step of our compilation was collecting raw data in the library. The second step was selecting the format of the selection to compile by the chosen tool (the Sketch Engine). Then we made the following selection of files containing European Media content allowing it to go into the EU collection. The selected files were from the library of the popular media of Europe. It has been selected to have comprised highly cited articles from British sources: the BBC, the Sun, the Daily Mail, the Guardian, the Times, and the Economist. All the mentioned newspapers are known for their investigative journalism and critical analysis of current affairs. Our selected tools for our media subcorpus were the web-based tool for creating corpora Sketch Engine. Also, we used the offline corpus manager AntConc, the open-source software program for corpora analysis.
Reference12 articles.
1. Vasko, R., Korolyova, A., Hryshchuk, Y., & Kapranov, Y. (2021, September). Transfer of Mathematical Formulas and Computer Algorithms into Macrocomparative Studies. In 2021 11th International Conference on Advanced Computer Information Technologies (ACIT) IEEE, 2021. p. 642-647.
2. Liashko, O., Bober, N., Kapranov, Y., Cherkhava, O., & Meleshkevych, L. (2022). Interpretation of Keywords as Indicators of Intertextuality in English New Testament Texts (Antconc Corpus Manager Toolkit). WISDOM, 22(2), 193-207.
3. Zhukovska V. English detached adjectival constructions with an explicit subject: A quantitative corpus-based analysis. Journal of Linguistics (Jazykovedný časopis), ROČNÍK 72 (2), 2021. P. 465–477.
4. Zhukovska V. Quantitative Corpus-Driven Approach to Disambiguation of Synonymous Grammatical Constructions. Proceedings of the 4th International Conference on Computational Linguistics and Intelligent Systems (COLINS 2020). Volume I: Main Conference, Lviv, Ukraine, April 23-24, 2020. CEUR Workshop Proceedings 2604, CEUR-WS.org 2020. P. 507–522.
5. Zhukovska V.V., Mosiyuk O. O. Statistical software R in corpus-driven research and machine learning. Information Technologies and Learning Tools. 2021. Vol. 86, № 6. P. 1–18.