1. Cavnar, W., Trenkle, J.: N-Gram Based Text Categorization. In: 3rd Annual Symposium on Document Analysis and Information Retrieval, Las Vegas, NV, pp. 161–175 (1994)
2. Dunning, T.: Statistical Identification of Language, Technical report, Computing Research Laboratory, New Mexico State University (1994)
3. Lee, D.S., Nohl, C.R., Baird, H.S.: Language Identification in Complex, Unoriented, and Degraded Document Images. In: International Workshop on Document Analysis Systems, Malvern, Penn-sylvania, pp. 76–98 (1996)
4. Hochberg, J., Kerns, L., Kelly, P., Thomas, T.: Automatic Script Identification from Images Using Cluster-based Templates. IEEE PAMI 19(2), 176–181 (1997)
5. Spitz, A.L.: Determination of the Script and Language Content of Document Images. IEEE PAMI 19(3), 235–245 (1997)