PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains-Reference-Cited by-同舟云学术

PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains

Published:2022 Issue: Volume:10 Page:414-433
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:

Author:

Ben-David Eyal¹,Oved Nadav²,Reichart Roi³

Affiliation:

1. Technion - Israel Institute of Technology, Israel. eyalbd12@campus.technion.ac.il

2. Technion - Israel Institute of Technology, Israel. nadavo@campus.technion.ac.il

3. Technion - Israel Institute of Technology, Israel. roiri@technion.ac.il

Abstract

Abstract Natural Language Processing algorithms have made incredible progress, but they still struggle when applied to out-of-distribution examples. We address a challenging and underexplored version of this domain adaptation problem, where an algorithm is trained on several source domains, and then applied to examples from unseen domains that are unknown at training time. Particularly, no examples, labeled or unlabeled, or any other knowledge about the target domain are available to the algorithm at training time. We present PADA: An example-based autoregressive Prompt learning algorithm for on-the-fly Any-Domain Adaptation, based on the T5 language model. Given a test example, PADA first generates a unique prompt for it and then, conditioned on this prompt, labels the example with respect to the NLP prediction task. PADA is trained to generate a prompt that is a token sequence of unrestricted length, consisting of Domain Related Features (DRFs) that characterize each of the source domains. Intuitively, the generated prompt is a unique signature that maps the test example to a semantic space spanned by the source domains. In experiments with 3 tasks (text classification and sequence tagging), for a total of 14 multi-source adaptation scenarios, PADA substantially outperforms strong baselines.1

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Link

https://direct.mit.edu/tacl/article-pdf/doi/10.1162/tacl_a_00468/2008061/tacl_a_00468.pdf

Reference73 articles.

1. Invariant risk minimization;Arjovsky;CoRR,2019

2. Perl: Pivot-based domain adaptation for pre-trained deep contextualized embedding models;Ben-David;Transactions of the Association for Computational Linguistics,2020

3. Biographies, Bollywood, boom-boxes and blenders: Domain adaptation for sentiment classification;Blitzer,2007

4. Zero-shot domain adaptation: A multi-view approach;Blitzer,2009

5. Domain adaptation with structural correspondence learning;Blitzer,2006

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Comprehensive Survey on Test-Time Adaptation Under Distribution Shifts;International Journal of Computer Vision;2024-07-18

2. A prompt construction method for the reverse dictionary task of large-scale language models;Engineering Applications of Artificial Intelligence;2024-07

3. Graph-Enhanced Prompt Learning for Personalized Review Generation;Data Science and Engineering;2024-06-18

4. ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction;Proceedings of the ACM Web Conference 2024;2024-05-13

5. Incremental Accumulation of Linguistic Context in Artificial and Biological Neural Networks;2024-01-16