Discovering research topics from library electronic references using latent Dirichlet allocation-Reference-Cited by-同舟云学术

Discovering research topics from library electronic references using latent Dirichlet allocation

Published:2018-02-19 Issue:3 Volume:36 Page:400-410
ISSN:0737-8831
Container-title:Library Hi Tech
language:en
Short-container-title:LHT

Author:

Fang Debin,Yang Haixia,Gao Baojun,Li Xiaojun

Abstract

Purpose Discovering the research topics and trends from a large quantity of library electronic references is essential for scientific research. Current research of this kind mainly depends on human justification. The purpose of this paper is to demonstrate how to identify research topics and evolution in trends from library electronic references efficiently and effectively by employing automatic text analysis algorithms. Design/methodology/approach The authors used the latent Dirichlet allocation (LDA), a probabilistic generative topic model to extract the latent topic from the large quantity of research abstracts. Then, the authors conducted a regression analysis on the document-topic distributions generated by LDA to identify hot and cold topics. Findings First, this paper discovers 32 significant research topics from the abstracts of 3,737 articles published in the six top accounting journals during the period of 1992-2014. Second, based on the document-topic distributions generated by LDA, the authors identified seven hot topics and six cold topics from the 32 topics. Originality/value The topics discovered by LDA are highly consistent with the topics identified by human experts, indicating the validity and effectiveness of the methodology. Therefore, this paper provides novel knowledge to the accounting literature and demonstrates a methodology and process for topic discovery with lower cost and higher efficiency than the current methods.

Publisher

Emerald

Subject

Library and Information Sciences,Information Systems

Reference45 articles.

1. Multilevel image coding with hyperfeatures;International Journal of Computer Vision,2008

2. Co-word analysis of the trends in stem cells field based on subject heading weighting;Scientometrics,2011

3. Simultaneously discovering and quantifying risk types from textual risk disclosures;Management Science,2014

4. Probabilistic topic models;Communications of the ACM,2012

5. Dynamic topic models,2006

Cited by 19 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Temporal analysis of topic modeling output by machine learning techniques;International Journal of Data Science and Analytics;2024-07-02

2. Latent Dirichlet Allocation (LDA) topic models for Space Syntax studies on spatial experience;City, Territory and Architecture;2024-01-09

3. Research-practice gap in accounting journals? A topic modeling approach;Journal of Accounting Literature;2023-08-18

4. Research on the Automatic Subject-Indexing Method of Academic Papers Based on Climate Change Domain Ontology;Sustainability;2023-02-21

5. Identifying Hot Information Security Topics Using LDA and Multivariate Mann-Kendall Test;IEEE Access;2023