Prompt-based for Low-Resource Tibetan Text Classification
-
Published:2023-08-24
Issue:8
Volume:22
Page:1-13
-
ISSN:2375-4699
-
Container-title:ACM Transactions on Asian and Low-Resource Language Information Processing
-
language:en
-
Short-container-title:ACM Trans. Asian Low-Resour. Lang. Inf. Process.
Affiliation:
1. Institute of Ethnology and Anthropology, Chinese Academy of Social Sciences, China
Abstract
Text classification is a critical and foundational task in Tibetan natural language processing, it plays a crucial role in various applications, such as sentiment analysis and information extraction. However, the limited availability of annotated data poses a significant challenge to Tibetan natural language processing. This paper proposes a prompt learning-based method for low-resource Tibetan text classification to overcome this challenge. This method utilizes pre-trained language models to learn text representation and generation capabilities on a large-scale unsupervised Tibetan corpus, enabling few-shot Tibetan text classification. Experimental results demonstrate that the proposed method significantly improves the performance of Tibetan text classification in low-resource scenarios. This work provides a new research idea and method for low-resource language processing, such as Tibetan natural language processing. Hopefully, it will inspire subsequent work on low-resource language processing.
Funder
National Social Science Foundation of China
National Natural Science Foundation of China
Innovation Project major research of Chinese Academy of Social Sciences
Publisher
Association for Computing Machinery (ACM)
Subject
General Computer Science
Reference46 articles.
1. Li Ailin. 2014. Research on Tibetan text classification algorithm for Web public opinion analysis. Master’s thesis. Northwest University for Nationalities.
2. Yuan Bin. 2016. Research and Implementation of Tibetan Weibo Emotion Classification. Master’s Thesis . Northwest University for Nationalities.
3. Tibetan text classification based on pre-trained language model;An Bo;Journal of Chinese Information Processing,2022
4. A Survey on Aspect-Based Sentiment Classification
5. Deeplearning Model Used in Text Classification
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献