Author:
LIU Zhiwei,HUANG Bo,XIA Chunming,XIONG Yujie,ZANG Zhensen,ZHANG Yongqiang
Abstract
The few-shot named entity recognition (NER) task aims to train a robust model in the source domain and transfer it to the target domain with very few annotated data. Currently, some approaches rely on the prototypical network for NER. However, these approaches often overlook the spatial relations in the span boundary matrix because entity words tend to depend more on adjacent words. We propose using a multidimensional convolution module to address this limitation to capture short-distance spatial dependencies. Additionally, we utilize an improved prototypical network and assign different weights to different samples that belong to the same class, thereby enhancing the performance of the few-shot NER task. Further experimental analysis demonstrates that our approach has significantly improved over baseline models across multiple datasets.
Reference38 articles.
1. Lafferty J, McCallum A, Pereira F C N. Conditional random fields: Probabilistic models for segmenting and labeling sequence data[C]//Proceedings of the 18th Conference of the International Conference on Machine Learning. Washington D C: AAAI Press, 2001: 282-289.
2. Fritzler A, Logacheva V, Kretov M. Few-shot classification in named entity recognition task[C]//Proceedings of the 34th ACM/SIGAPP Symposium on Applied Computing. New York: ACM SIGGRAPH, 2019: 993-1000.
3. Yang Y, Katiyar A. Simple and effective few-shot named entity recognition with structured nearest neighbor learning[C]//Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. Stroudsburg: EMNLP, 2020: 6365-6375.
4. Das S S S, Katiyar A, Passonneau R J, et al. CONTaiNER: Few-shot named entity recognition via contrastive learning[C]//Proceedings of the 60th Annual Meeting of the Association for Computational Linguistic. Stroudsburg: ACL, 2022: 6338-6353.
5. Snell J, Swersky K, Zemel R. Prototypical networks for few-shot learning[C]//Proceedings of the 2020 Conference in Neural Information Processing Systems. Cambridge: NIPS, 2020: 4077-4087.