1. T. Arts, Y. Belinkov, N. Habash, A. Kilgarrif, and V. Suchomel. 2014. arTenTen: Arabic Corpus and Word Sketches. Journal of King Saud University - Computer and Information Sciences, 26.4, 357--371.
2. T. McEnery, and A. Hardie. 2012. Corpus Linguistics: Method, Theory and Practice. Cambridge: Cambridge University Press.
3. V. Clark-Sánchez. 2013. Review: Gries (2009) Quantitative Corpus Linguistics with R. London and New York: Routledge. Corpora, 8.2, 269--272.
4. N. Habash, and O. Rambow. 2005. Arabic tokenization, part-of-speech tagging and morphological disambiguation in one fell swoop. In Proceedings of the Association for Computational Linguistics (ACL'05). Michigan: Ann Arbor, 573--580.
5. N. Habash, O. Rambow, and R. Roth, R. 2009. MADA+TOKAN: a toolkit for Arabic tokenization, diacritization, morphological disambiguating, POS tagging, stemming and lemmatization. In Proceedings of the International Conference on Arabic Language Resources and Tools. Cairo, Egypt, 102--109.