Probing Natural Language Inference Models through Semantic Fragments-Reference-Cited by-同舟云学术

Probing Natural Language Inference Models through Semantic Fragments

Published:2020-04-03 Issue:05 Volume:34 Page:8713-8721
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Richardson Kyle,Hu Hai,Moss Lawrence,Sabharwal Ashish

Abstract

Do state-of-the-art models for language understanding already have, or can they easily learn, abilities such as boolean coordination, quantification, conditionals, comparatives, and monotonicity reasoning (i.e., reasoning about word substitutions in sentential contexts)? While such phenomena are involved in natural language inference (NLI) and go beyond basic linguistic understanding, it is unclear the extent to which they are captured in existing NLI benchmarks and effectively learned by models. To investigate this, we propose the use of semantic fragments—systematically generated datasets that each target a different semantic phenomenon—for probing, and efficiently improving, such capabilities of linguistic models. This approach to creating challenge datasets allows direct control over the semantic diversity and complexity of the targeted linguistic phenomena, and results in a more precise characterization of a model's linguistic behavior. Our experiments, using a library of 8 such semantic fragments, reveal two remarkable findings: (a) State-of-the-art models, including BERT, that are pre-trained on existing NLI benchmark datasets perform poorly on these new fragments, even though the phenomena probed here are central to the NLI task; (b) On the other hand, with only a few minutes of additional fine-tuning—with a carefully selected learning rate and a novel variation of “inoculation”—a BERT-based model can master all of these logic and monotonicity fragments while retaining its performance on established NLI benchmarks.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Mixture of Prompt Experts for Natural Language Inference;2024 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE);2024-08-06

2. GANLI: Elevating Natural Language Inference through Advanced Gated Attention Mechanisms;2024-05-02

3. Transformer Hybrid Neural Network Model for Natural Language Reasoning;2024 IEEE 7th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC);2024-03-15

4. Scope Ambiguities in Large Language Models;Transactions of the Association for Computational Linguistics;2024

5. From Pre-Training to Fine-Tuning: An In-Depth Analysis of Large Language Models in the Biomedical Domain;2024