Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation-Reference-Cited by-同舟云学术

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Published:2021-05-18 Issue:15 Volume:35 Page:13834-13842
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Staliūnaitė Ieva,Gorinski Philip John,Iacobacci Ignacio

Abstract

Determining the plausibility of causal relations between clauses is a commonsense reasoning task that requires complex inference ability. The general approach to this task is to train a large pretrained language model on a specific dataset. However, the available training data for the task is often scarce, which leads to instability of model training or reliance on the shallow features of the dataset. This paper presents a number of techniques for making models more robust in the domain of causal reasoning. Firstly, we perform adversarial training by generating perturbed inputs through synonym substitution. Secondly, based on a linguistic theory of discourse connectives, we perform data augmentation using a discourse parser for detecting causally linked clauses in large text, and a generative language model for generating distractors. Both methods boost model performance on the Choice of Plausible Alternatives (COPA) dataset, as well as on a Balanced COPA dataset, which is a modified version of the original data that has been developed to avoid superficial cues, leading to a more challenging benchmark. We show a statistically significant improvement in performance and robustness on both datasets, even with only a small number of additionally generated data points.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. GaitSCM: Causal representation learning for gait recognition;Computer Vision and Image Understanding;2024-06

2. Semantic Augmentation in Chinese Adversarial Corpus for Discourse Relation Recognition Based on Internal Semantic Elements;Electronics;2024-05-15

3. Interpretability for reliable, efficient, and self-cognitive DNNs: From theories to applications;Neurocomputing;2023-08

4. A comparative study of adversarial training methods for neural models of source code;Future Generation Computer Systems;2023-05

5. 基本イベントに基づく常識推論データセットの構築と利用;Journal of Natural Language Processing;2023