When Low Resource NLP Meets Unsupervised Language Model: Meta-Pretraining then Meta-Learning for Few-Shot Text Classification (Student Abstract)-Reference-Cited by-同舟云学术

When Low Resource NLP Meets Unsupervised Language Model: Meta-Pretraining then Meta-Learning for Few-Shot Text Classification (Student Abstract)

Published:2020-04-03 Issue:10 Volume:34 Page:13773-13774
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Deng Shumin,Zhang Ningyu,Sun Zhanlin,Chen Jiaoyan,Chen Huajun

Abstract

Text classification tends to be difficult when data are deficient or when it is required to adapt to unseen classes. In such challenging scenarios, recent studies have often used meta-learning to simulate the few-shot task, thus negating implicit common linguistic features across tasks. This paper addresses such problems using meta-learning and unsupervised language models. Our approach is based on the insight that having a good generalization from a few examples relies on both a generic model initialization and an effective strategy for adapting this model to newly arising tasks. We show that our approach is not only simple but also produces a state-of-the-art performance on a well-studied sentiment classification dataset. It can thus be further suggested that pretraining could be a promising solution for few-shot learning of many other NLP tasks. The code and the dataset to replicate the experiments are made available at https://github.com/zxlzr/FewShotNLP.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 18 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-task convex combination interpolation for meta-learning with fewer tasks;Knowledge-Based Systems;2024-07

2. IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification;Proceedings of the 17th ACM International Conference on Web Search and Data Mining;2024-03-04

3. Metric-Free Learning Network with Dual Relations Propagation for Few-Shot Aspect Category Sentiment Analysis;Transactions of the Association for Computational Linguistics;2024

4. Cross-Lingual Zero-Shot and Few-Shot Learning to Hate Speech Detection;2024

5. Boosting Text Classification Performance for Unlabeled Data with Semi-Supervised Learning;2023 IEEE 9th International Women in Engineering (WIE) Conference on Electrical and Computer Engineering (WIECON-ECE);2023-11-25