Affiliation:
1. Key Laboratory of Artificial Intelligence, Information Engineering and Automation, Kunming University of Science and Technology, Kunming, Yunnan, China
Abstract
The smallest semantic unit of the Burmese language is called the syllable. In the present study, it is intended to propose the first neural joint learning model for Burmese syllable segmentation, word segmentation, and
part-of-speech
(
POS
) tagging with the BERT. The proposed model alleviates the error propagation problem of the syllable segmentation. More specifically, it extends the neural joint model for Vietnamese word segmentation, POS tagging, and dependency parsing [28] with the pre-training method of the Burmese character, syllable, and word embedding with BiLSTM-CRF-based neural layers. In order to evaluate the performance of the proposed model, experiments are carried out on Burmese benchmark datasets, and we fine-tune the model of multilingual BERT. Obtained results show that the proposed joint model can result in an excellent performance.
Funder
Key Program of National Natural Science Foundation of China
National Natural Science Foundation of China
Key Project of Natural Science Foundation of Yunnan Province
Candidates of the Young and Middle Aged Academic and Technical Leaders of Yunnan Province
Publisher
Association for Computing Machinery (ACM)
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Indian Language Analysis with XLM-RoBERTa: Enhancing Parts of Speech Tagging for Effective Natural Language Preprocessing;2023 Seventh International Conference on Image Information Processing (ICIIP);2023-11-22
2. The Comparison of Language Models with a Novel Text Filtering Approach for Turkish Sentiment Analysis;ACM Transactions on Asian and Low-Resource Language Information Processing;2022-12-27
3. A BiLSTM-CRF Based Approach to Word Segmentation in Chinese;2022 IEEE Intl Conf on Dependable, Autonomic and Secure Computing, Intl Conf on Pervasive Intelligence and Computing, Intl Conf on Cloud and Big Data Computing, Intl Conf on Cyber Science and Technology Congress (DASC/PiCom/CBDCom/CyberSciTech);2022-09-12