Abstract
Named Entity Recognition (NER) is at the core of natural language understanding. The quality and amount of datasets determine the performance of deep-learning-based NER models. As datasets for NER require token-level or word-level labels to be assigned, annotating the datasets is expensive and time consuming. To alleviate efforts of manual anotation, many prior studies utilized weak supervision for NER tasks. However, using weak supervision directly would be an obstacle for training deep networks because the labels automatically annotated contain a a lot of noise. In this study, we propose a framework to better train the deep model for NER tasks using weakly labeled data. The proposed framework stems from the idea that mixup, which was recently considered as a data augmentation strategy, would be an obstacle to deep model training for NER tasks. Inspired by this idea, we used mixup as a perturbation function for consistency regularization, one of the semi-supervised learning strategies. To support our idea, we conducted several experiments for NER benchmarks. Experimental results proved that directly using mixup on NER tasks hinders deep model training while demonstrating that the proposed framework achieves improved performances compared to employing only a few human-annotated data.
Funder
Technology development Program
Ministry of SMEs and Startups
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference42 articles.
1. Youn, G., Yoon, B., Ji, S., Ko, D., and Rhee, J. MixUp based Cross-Consistency Training for Named Entity Recognition. Proceedings of the 6th International Conference on Advances in Artificial Intelligence.
2. Towards a Protein–Protein Interaction information extraction system: Recognizing named entities;Danger;Knowl.-Based Syst.,2014
3. Mollá, D., Van Zaanen, M., and Smith, D. Named entity recognition for question answering. Proceedings of the Australasian Language Technology Workshop 2006.
4. A joint model to identify and align bilingual named entities;Chen;Comput. Linguist.,2013
5. Multi-document summarization based on the Yago ontology;Baralis;Expert Syst. Appl.,2013
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Image Augmentation with Convolutional Neural Networks;2023 IEEE 14th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON);2023-10-12