1. Zobia R, Waqas A, Usama IB (2011) Challenges in Urdu text tokenization and sentence boundary disambiguation. Proceedings of the IJCNLP Workshop on South and Southeast Asian Natural Language Processing 40–45.
2. A character net based Chinese text segmentation method;L Zhou;Workshop on Building and Using Semantic Networks,2002
3. Kaplan RM (2005) A method for tokenizing text. CSLI Publications, Stanford. pp. 55–63.
4. A heuristic method based on a statistical approach for Chinese text segmentation;CC Yang;Journal of the American Society for Information Science and Technology,2005
5. Intelligent processing system;AS Shahabi;IFIP International Federation of Information Processing Springer Boston,2007