Parameter-efficient feature-based transfer for paraphrase identification-Reference-Cited by-同舟云学术

Parameter-efficient feature-based transfer for paraphrase identification

Published:2022-12-19 Issue: Volume: Page:1-31
ISSN:1351-3249
Container-title:Natural Language Engineering
language:en
Short-container-title:Nat. Lang. Eng.

Author:

Liu Xiaodong,Rzepka Rafal,Araki Kenji

Abstract

Abstract There are many types of approaches for Paraphrase Identification (PI), an NLP task of determining whether a sentence pair has equivalent semantics. Traditional approaches mainly consist of unsupervised learning and feature engineering, which are computationally inexpensive. However, their task performance is moderate nowadays. To seek a method that can preserve the low computational costs of traditional approaches but yield better task performance, we take an investigation into neural network-based transfer learning approaches. We discover that by improving the usage of parameters efficiently for feature-based transfer, our research goal can be accomplished. Regarding the improvement, we propose a pre-trained task-specific architecture. The fixed parameters of the pre-trained architecture can be shared by multiple classifiers with small additional parameters. As a result, the computational cost left involving parameter update is only generated from classifier-tuning: the features output from the architecture combined with lexical overlap features are fed into a single classifier for tuning. Furthermore, the pre-trained task-specific architecture can be applied to natural language inference and semantic textual similarity tasks as well. Such technical novelty leads to slight consumption of computational and memory resources for each task and is also conducive to power-efficient continual learning. The experimental results show that our proposed method is competitive with adapter-BERT (a parameter-efficient fine-tuning approach) over some tasks while consuming only 16% trainable parameters and saving 69-96% time for parameter update.

Publisher

Cambridge University Press (CUP)

Subject

Artificial Intelligence,Linguistics and Language,Language and Linguistics,Software

Reference78 articles.

1. SemEval-2014 Task 1: Evaluation of Compositional Distributional Semantic Models on Full Sentences through Semantic Relatedness and Textual Entailment

2. SciTaiL: A Textual Entailment Dataset from Science Question Answering

3. de Masson d’Autume, C. , Ruder, S. , Kong, L. and Yogatama, D. (2019). Episodic memory in lifelong language learning. In Wallach, H. , Larochelle, H. , Beygelzimer, A. , d’Alché-Buc, F. , Fox, E. and Garnett, R. (eds), Advances in Neural Information Processing Systems, vol. 32, Curran Associates, Inc.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DG Embeddings: The unsupervised definition embeddings learned from dictionary and glossary to gloss context words of Cloze task;Knowledge-Based Systems;2024-07

2. Defsent+: Improving Sentence Embeddings of Language Models by Projecting Definition Sentences into a Quasi-Isotropic or Isotropic Vector Space of Unlimited Dictionary Entries;2024