1. Brown, R.D.: Finding and Identifying Text in 900+ Languages. Digital Investigation 9, S34–S43 (2012)
2. Cavnar, W.B., Trenkle, J.M.: N-Gram-Based Text Categorization. In: Proceedings of SDAIR 1994, 3rd Annual Symposium on Document Analysis and Information Retrieval, UNLV Publications/Reprographics, pp. 161–175 (April 1994)
3. Ljubešić, N., Mikelić, N., Boras, D.: Language identification: How to distinguish similar languages. In: Lužar-Stifter, V., Hljuz Dobrić, V. (eds.) Proceedings of the 29th International Conference on Information Technology Interfaces, Zagreb, pp. 541–546. SRCE University Computing Centre (2007)
4. Ahmed, B., Cha, S.H., Tappert, C.: Language Identification from Text Using N-gram Based Cumulative Frequency Addition. In: Proceedings of Student/Faculty Research Day, CSIS, Pace University (May 2004)
5. Carter, S., Tsagkias, M., Weerkamp, W.: Semi-Supervised Priors for Microblog Language Identification. In: Proceedings of the Dutch-Belgian Information Retrieval Workshop (DIR 2011), Amsterdam (February 2011)