Neural Embedding Allocation: Distributed Representations of Topic Models-Reference-Cited by-同舟云学术

Neural Embedding Allocation: Distributed Representations of Topic Models

Published:2022 Issue:4 Volume:48 Page:1021-1052
ISSN:0891-2017
Container-title:Computational Linguistics
language:en
Short-container-title:

Author:

Keya Kamrun Naher¹,Papanikolaou Yannis²,Foulds James R.³

Affiliation:

1. University of Maryland, Baltimore County Department of Information Systems kkeya1@umbc.edu

2. Healx Department of Research and Development yannis.papanikolaou@healx.io

3. University of Maryland, Baltimore County Department of Information Systems jfoulds@umbc.edu

Abstract

Abstract We propose a method that uses neural embeddings to improve the performance of any given LDA-style topic model. Our method, called neural embedding allocation (NEA), deconstructs topic models (LDA or otherwise) into interpretable vector-space embeddings of words, topics, documents, authors, and so on, by learning neural embeddings to mimic the topic model. We demonstrate that NEA improves coherence scores of the original topic model by smoothing out the noisy topics when the number of topics is large. Furthermore, we show NEA’s effectiveness and generality in deconstructing and smoothing LDA, author-topic models, and the recent mixed membership skip-gram topic model and achieve better performance with the embeddings compared to several state-of-the-art models.

Publisher

MIT Press

Subject

Artificial Intelligence,Computer Science Applications,Linguistics and Language,Language and Linguistics

Link

https://direct.mit.edu/coli/article-pdf/48/4/1021/2061857/coli_a_00457.pdf

Reference64 articles.

1. Big data’s disparate impact;Barocas;California Law Review,2016

2. On the dangers of stochastic parrots: Can language models be too big?;Bender,2021

3. A neural probabilistic language model;Bengio;Journal of Machine Learning Research,2003

4. Latent Dirichlet allocation;Blei;Journal of Machine Learning Research,2003

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Explainable paper classification system using topic modeling and SHAP;Intelligent Data Analysis;2024-08-01

2. CoTE: A Flexible Method for Joint Learning of Topic and Embedding Models;Lecture Notes in Computer Science;2024