Author:
Long Jun,Yin Zhuoying,Liu Chao,Huang Wenti
Abstract
AbstractPrompt-tuning has emerged as a promising approach for improving the performance of classification tasks by converting them into masked language modeling problems through the insertion of text templates. Despite its considerable success, applying this approach to relation extraction is challenging. Predicting the relation, often expressed as a specific word or phrase between two entities, usually requires creating mappings from these terms to an existing lexicon and introducing extra learnable parameters. This can lead to a decrease in coherence between the pre-training task and fine-tuning. To address this issue, we propose a novel method for prompt-tuning in relation extraction, aiming to enhance the coherence between fine-tuning and pre-training tasks. Specifically, we avoid the need for a suitable relation word by converting the relation into relational semantic keywords, which are representative phrases that encapsulate the essence of the relation. Moreover, we employ a composite loss function that optimizes the model at both token and relation levels. Our approach incorporates the masked language modeling (MLM) loss and the entity pair constraint loss for predicted tokens. For relation level optimization, we use both the cross-entropy loss and TransE. Extensive experimental results on four datasets demonstrate that our method significantly improves performance in relation extraction tasks. The results show an average improvement of approximately 1.6 points in F1 metrics compared to the current state-of-the-art model. Codes are released at https://github.com/12138yx/TCohPrompt.
Funder
Department of Education of Hunan province
National Natural Science Foundation of China
Publisher
Springer Science and Business Media LLC
Reference41 articles.
1. Alt C, Gabryszak A, Hennig L (2020) TACRED revisited: a thorough evaluation of the TACRED relation extraction task. In: Proceedings of the 58th annual meeting of the association for computational linguistics, pp 1558–1569
2. Bordes A, Usunier N, Garcia-Duran A et al (2013) Translating embeddings for modeling multi-relational data. Adv Neural Inf Process Syst 26:2787–2795
3. Brown T, Mann B, Ryder N et al (2020) Language models are few-shot learners. Adv Neural Inf Process Syst 33:1877–1901
4. Chen X, Li L, Zhang N et al (2022) Relation extraction as open-book examination: retrieval-enhanced prompt tuning. In: Proceedings of the 45th international ACM SIGIR conference on research and development in information retrieval, pp 2443–2448
5. Chen X, Zhang N, Xie X et al (2022) KnowPrompt: knowledge-aware prompt-tuning with synergistic optimization for relation extraction. Proc ACM Web Conf 2022:2778–2788