Publisher
Springer Nature Switzerland
Reference11 articles.
1. Zhao, W., et al.: A Survey of Large Language Models. arXiv abs:2303.18223 [cs.CL] (2023)
2. Gage, P.: A new algorithm for data compression. In: The C Users Journal, vol. 12, issue 201, pp. 23–38 (1994)
3. Provilkov, I., Emelianenko, D., Voita, E.: BPE-dropout: simple and effective subword regularization. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 1882–1892, Online. Association for Computational Linguistics (2020)
4. He, X., Haffari, C., Norouzi, M.: Dynamic programming encoding for subword segmentation in neural machine translation. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 3042–3051, Online. Association for Computational Linguistics (2020)
5. Vepstas, V., Goertzel, B.: Learning language from a large (unannotated) corpus. In: Computing Research Repository, arXiv:1401.3372 [cs.CL] (2014)