1. Alatrash, R., Schlechtweg, D., Kuhn, J., & Schulte im Walde, S. (2020). CCOHA: Clean Corpus of Historical American English. In Proceedings of the Twelfth Language Resources and Evaluation Conference, 6958–6966. Marseille, France: European Language Resources Association. URL: https://aclanthology.org/2020.lrec-1.859/
2. Alves, D., Thakkar, G., & Tadić, M. (2022). Building and Evaluating Universal Named-Entity Recognition English corpus, 1–15. https://doi.org/10.48550/arXiv.2212.07162
3. Anthony, L. (2023). Corpus AI: Integrating Large Language Models (LLMs) into a Corpus Analysis Toolkit. Presentation given at the 49th Annual Conference of the Japan Association for English Corpus Studies, Kansai University, Osaka, Japan. URL: https://osf.io/srtyd/
4. Burnard, L. (2004). Metadata for corpus work. In M. Wynne (Ed.), Developing linguistic corpora: A guide to good practice (pp. 40–57). Oxford: Oxbow Books. URL: https://users.ox.ac.uk/~martinw/dlc/chapter3.htm
5. Chaplynskyi, D. (2023). Introducing UberText 2.0: A Corpus of Modern Ukrainian at Scale. Proceedings of the Second Ukrainian Natural Language Processing Workshop, 1–10, Dubrovnik. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.unlp-1.1