Neural Network Acceptability Judgments-Reference-Cited by-同舟云学术

Neural Network Acceptability Judgments

Published:2019-11 Issue: Volume:7 Page:625-641
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:Transactions of the Association for Computational Linguistics

Author:

Warstadt Alex¹,Singh Amanpreet²,Bowman Samuel R.¹

Affiliation:

1. New York University.

2. New York University, Facebook AI Research.

Abstract

This paper investigates the ability of artificial neural networks to judge the grammatical acceptability of a sentence, with the goal of testing their linguistic competence. We introduce the Corpus of Linguistic Acceptability (CoLA), a set of 10,657 English sentences labeled as grammatical or ungrammatical from published linguistics literature. As baselines, we train several recurrent neural network models on acceptability classification, and find that our models outperform unsupervised models by Lau et al. (2016) on CoLA. Error-analysis on specific grammatical phenomena reveals that both Lau et al.’s models and ours learn systematic generalizations like subject-verb-object order. However, all models we test perform far below human level on a wide range of grammatical constructions.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Human-Computer Interaction,Communication

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/tacl_a_00290

Reference60 articles.

1. The Handbook of Contemporary Syntactic Theory

2. Poverty of the Stimulus Revisited

Cited by 179 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. SNN-BERT: Training-efficient Spiking Neural Networks for energy-efficient BERT;Neural Networks;2024-12

2. A systematic review of machine learning methods in software testing;Applied Soft Computing;2024-09

3. Comparative analysis of paraphrasing performance of ChatGPT, GPT‐3, and T5 language models using a new ChatGPT generated dataset: ParaGPT;Expert Systems;2024-08-15

4. A flexible BERT model enabling width- and depth-dynamic inference;Computer Speech & Language;2024-08

5. Modelling child comprehension: A case of suffixal passive construction in Korean;Computer Speech & Language;2024-08