Publisher
Springer Science and Business Media LLC
Reference29 articles.
1. Webster JJ, Kit C. Tokenization as the initial phase in NLP. In: COLING 1992 volume 4 pp-23-28: The 14th international conference on computational linguistics. 1992.
2. Hansdah RC, Murmu NC. Encoding of Ol Chiki in Universal Character Set. First week of September; 2002. https://wesanthals.tripod.com/sitebuildercontent/sitebuilderfiles/uni_olchiki.pdf.
3. Raghunath Murmu (2000, 7th ed.), pa.rsi poha(Pasi Poha, A Santali Primer), ASECA, Rairangpur, Orissa, India. http://debracollege.dspaces.org/bitstream/123456789/982/1/SANTALI_Learning_BOOk_Sanatli_prsi_Poha_compressed.pdf.
4. Toraman C, et al. Impact of tokenization on language models: An analysis for turkish. ACM Trans Asian Low-Resour Lang Inf Process. 2023;22(4):1–21.
5. Kuhail MA, et al. Interacting with educational chatbots: a systematic review. Educ Inf Technol. 2023;28(1):973–1018.