Neural Lattice Language Models-Reference-Cited by-同舟云学术

Neural Lattice Language Models

Published:2018-12 Issue: Volume:6 Page:529-541
ISSN:2307-387X
Container-title:Transactions of the Association for Computational Linguistics
language:en
Short-container-title:TACL

Author:

Buckman Jacob¹,Neubig Graham¹

Affiliation:

1. Language Technologies Institute, Carnegie Mellon University,

Abstract

In this work, we propose a new language modeling paradigm that has the ability to perform both prediction and moderation of information flow at multiple granularities: neural lattice language models. These models construct a lattice of possible paths through a sentence and marginalize across this lattice to calculate sequence probabilities or optimize parameters. This approach allows us to seamlessly incorporate linguistic intuitions — including polysemy and the existence of multiword lexical items — into our language model. Experiments on multiple language modeling tasks show that English neural lattice language models that utilize polysemous embeddings are able to improve perplexity by 9.95% relative to a word-level baseline, and that a Chinese model that handles multi-character tokens is able to improve perplexity by 20.94% relative to a character-level baseline.

Publisher

MIT Press - Journals

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/tacl_a_00036

Reference6 articles.

1. Stored Word Sequences in Language Learning

2. Long Short-Term Memory

3. Adding more fuel to the fire: An eye-tracking study of idiom processing by native and non-native speakers

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. DiverSeg: Leveraging Diverse Segmentations with Cross-granularity Alignment for Neural Machine Translation;Journal of Natural Language Processing;2024

2. Canine: Pre-training an Efficient Tokenization-Free Encoder for Language Representation;Transactions of the Association for Computational Linguistics;2022

3. Increasing Context for Estimating Confidence Scores in Automatic Speech Recognition;IEEE/ACM Transactions on Audio, Speech, and Language Processing;2022

4. Confusion2Vec: towards enriching vector space word representations with representational ambiguities;PeerJ Computer Science;2019-06-10