Half-Context Language Models-Reference-Cited by-同舟云学术

Half-Context Language Models

Published:2011-12 Issue:4 Volume:37 Page:843-865
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:Computational Linguistics

Author:

Schütze Hinrich¹,Walsh Michael¹

Affiliation:

1. University of Stuttgart

Abstract

This article investigates the effects of different degrees of contextual granularity on language model performance. It presents a new language model that combines clustering and half-contextualization, a novel representation of contexts. Half-contextualization is based on the half-context hypothesis that states that the distributional characteristics of a word or bigram are best represented by treating its context distribution to the left and right separately and that only directionally relevant distributional information should be used. Clustering is achieved using a new clustering algorithm for class-based language models that compares favorably to the exchange algorithm. When interpolated with a Kneser-Ney model, half-context models are shown to have better perplexity than commonly used interpolated n-gram models and traditional class-based approaches. A novel, fine-grained, context-specific analysis highlights those contexts in which the model performs well and those which are better treated by existing non-class-based models.

Publisher

MIT Press - Journals

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://www.mitpressjournals.org/doi/pdf/10.1162/COLI_a_00078

Reference10 articles.

1. Long distance bigram models applied to word clustering

2. 10.1162/153244303322533223

3. "Schema abstraction" in a multiple-trace memory model.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Tooth Sensitivity After Dental Bleaching With a Desensitizer-containing and a Desensitizer-free Bleaching Gel: A Systematic Review and Meta-analysis;Operative Dentistry;2019-03-01