1. Bollmann, M. , Dipper, S. , Krasselt, J. , and Petran, F. (2012). Manual and semi-automatic normalization of historical spelling-case studies from Early New High German. In KONVENS, pp. 342–350.
2. JANES v0.4 : korpus slovenskih spletnih uporabniških vsebin (JANES 04: a corpus of Slovene User Generated Content;Fišer;Slovenščina 2.0,2016
3. TnT
4. Eisenstein, J. (2013). What to do about bad language on the Internet. In Proceedings of North American Chapter of the Association for Computational Linguistics (NAACL), pp. 359–369.
5. Rayson, P. , Archer, D. , Baron, A. , Culpeper, J. , and Smith, N. (2007). Tagging the Bard: Evaluating the accuracy of a modern POS tagger on Early Modern English corpora. In Proceedings of the Corpus Linguistics Conference: CL 2007. UCREL.