1. Baldwin, T., Lui, M.: Language identification: the long and the short of the matter. In: Human Language Technologies: The 2010 Annual Conference of the NAACL, Los Angeles, CA, pp. 229–237 (June 2010)
2. Bird, S., Dale, R., Dorr, B., Gibson, B., Joseph, M., Kan, M.Y., Lee, D., Powley, B., Radev, D., Tan, Y.F.: The ACL anthology reference corpus: a reference dataset for bibliographic research in computational linguistics. In: Proceedings of the Language Resources and Evaluation Conference (LREC 2008), Marrakesh, Morocco, May 2008
3. Choueka, Y.: Looking for needles in a haystack, or locating interesting collocational expressions in large textual databases. In: Proceedings of the RIAO Conference on User-Oriented Content-Based Text and Image Handling, pp. 21–24. Cambridge, MA (1988)
4. Daudaravicius, V., Marcinkeviciene, R.: Gravity counts for the boundaries of collocations. Int. J. Corpus Linguist. 9(2), 321–348 (2004)
5. Lecture Notes in Computer Science;V Daudaravicius,2010