Affiliation:
1. Technion - Israel Institute of Technology, Israel. eyalbd12@campus.technion.ac.il
2. Technion - Israel Institute of Technology, Israel. nadavo@campus.technion.ac.il
3. Technion - Israel Institute of Technology, Israel. roiri@technion.ac.il
Abstract
Abstract
Natural Language Processing algorithms have made incredible progress, but they still struggle when applied to out-of-distribution examples. We address a challenging and underexplored version of this domain adaptation problem, where an algorithm is trained on several source domains, and then applied to examples from unseen domains that are unknown at training time. Particularly, no examples, labeled or unlabeled, or any other knowledge about the target domain are available to the algorithm at training time. We present PADA: An example-based autoregressive Prompt learning algorithm for on-the-fly Any-Domain Adaptation, based on the T5 language model. Given a test example, PADA first generates a unique prompt for it and then, conditioned on this prompt, labels the example with respect to the NLP prediction task. PADA is trained to generate a prompt that is a token sequence of unrestricted length, consisting of Domain Related Features (DRFs) that characterize each of the source domains. Intuitively, the generated prompt is a unique signature that maps the test example to a semantic space spanned by the source domains. In experiments with 3 tasks (text classification and sequence tagging), for a total of 14 multi-source adaptation scenarios, PADA substantially outperforms strong baselines.1
Subject
Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication
Reference73 articles.
1. Invariant risk
minimization;Arjovsky;CoRR,2019
2. Perl: Pivot-based domain adaptation for
pre-trained deep contextualized embedding models;Ben-David;Transactions of the Association for Computational
Linguistics,2020
3. Biographies, Bollywood, boom-boxes and
blenders: Domain adaptation for sentiment classification;Blitzer,2007
4. Zero-shot domain adaptation: A multi-view
approach;Blitzer,2009
5. Domain adaptation with structural
correspondence learning;Blitzer,2006
Cited by
19 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献