A taxonomy and review of generalization research in NLP-Reference-Cited by-同舟云学术

A taxonomy and review of generalization research in NLP

Published:2023-10-19 Issue:10 Volume:5 Page:1161-1174
ISSN:2522-5839
Container-title:Nature Machine Intelligence
language:en
Short-container-title:Nat Mach Intell

Author:

Hupkes Dieuwke,Giulianelli Mario^ORCID,Dankers Verna,Artetxe Mikel,Elazar Yanai,Pimentel Tiago^ORCID,Christodoulopoulos Christos^ORCID,Lasri Karim,Saphra Naomi,Sinclair Arabella,Ulmer Dennis,Schottmann Florian,Batsuren Khuyagbaatar^ORCID,Sun Kaiser,Sinha Koustuv,Khalatbari Leila,Ryskina Maria^ORCID,Frieske Rita^ORCID,Cotterell Ryan,Jin Zhijing^ORCID

Abstract

AbstractThe ability to generalize well is one of the primary desiderata for models of natural language processing (NLP), but what ‘good generalization’ entails and how it should be evaluated is not well understood. In this Analysis we present a taxonomy for characterizing and understanding generalization research in NLP. The proposed taxonomy is based on an extensive literature review and contains five axes along which generalization studies can differ: their main motivation, the type of generalization they aim to solve, the type of data shift they consider, the source by which this data shift originated, and the locus of the shift within the NLP modelling pipeline. We use our taxonomy to classify over 700 experiments, and we use the results to present an in-depth analysis that maps out the current state of generalization research in NLP and make recommendations for which areas deserve attention in the future.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Computer Networks and Communications,Computer Vision and Pattern Recognition,Human-Computer Interaction,Software

Link

https://www.nature.com/articles/s42256-023-00729-y.pdf

Reference61 articles.

1. Marcus, G. F. Rethinking eliminative connectionism. Cogn. Psychol. 37, 243–282 (1998).

2. Kirk, R., Zhang, A., Grefenstette, E. & Rocktäschel, T. A survey of generalisation in deep reinforcement learning. J. Artif. Intell. Res. https://doi.org/10.1613/jair.1.14174 (2023).

3. Chowdhery, A. et al. PaLM: scaling language modeling with pathways. J. of Mach. Learn. Res. 24, 1–113 (2023).

4. Devlin, J., Chang, M.-W., Lee, K. & Toutanova, K. BERT: pre-training of deep bidirectional transformers for language understanding. In Proc. 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers) (Burstein, J. et al eds) 4171–4186 (Association for Computational Linguistics, 2019); https://doi.org/10.18653/v1/N19-1423

5. Blodgett, S. L., Green, L. & O’Connor, B. Demographic dialectal variation in social media: a case study of African-American English. Jian Su, Kevin Duh, Xavier Carreras (eds). In Proc. 2016 Conference on Empirical Methods in Natural Language Processing (Su, J. et al eds) 1119–1130 (Association for Computational Linguistics, 2016); https://doi.org/10.18653/v1/D16-1120. https://aclanthology.org/D16-1120

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Adaptive Evolutionary Computing Ensemble Learning Model for Sentiment Analysis;Applied Sciences;2024-08-04

2. COMI: COrrect and MItigate Shortcut Learning Behavior in Deep Neural Networks;Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval;2024-07-10

3. STIRNet: A Spatio-Temporal Network for Air Formation Targets Intention Recognition;IEEE Access;2024