1. Airio, E. (2006). Word normalization and decompounding in mono- and bilingual IR. Information Retrieval, 9, 249–271.
2. Barnbrook, G. (1996). Language and computers. Edinburgh University Press.
3. Beale, A. D. (1987). Towards a distributional lexicon. In R. Garside, G. Leech, & G. Sampson (Eds.), The computational analysis of English: A corpus-based approach (pp. 149–162). Longman.
4. Biber, D., Conrad, S., & Reppen, R. (1998). Corpus linguistics: Investigating language structure and use. Cambridge University Press.
5. Creutz, M. (2003). Unsupervised segmentation of words using prior distributions of morph length and frequency. In Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, July 2003 (pp. 280–287).