Detection of Temporal Shifts in Semantics Using Local Graph Clustering

Author:

Hwang NeilORCID,Chatterjee ShirshenduORCID,Di Yanming,Bhattacharyya Sharmodeep

Abstract

Many changes in our digital corpus have been brought about by the interplay between rapid advances in digital communication and the current environment characterized by pandemics, political polarization, and social unrest. One such change is the pace with which new words enter the mass vocabulary and the frequency at which meanings, perceptions, and interpretations of existing expressions change. The current state-of-the-art algorithms do not allow for an intuitive and rigorous detection of these changes in word meanings over time. We propose a dynamic graph-theoretic approach to inferring the semantics of words and phrases (“terms”) and detecting temporal shifts. Our approach represents each term as a stochastic time-evolving set of contextual words and is a count-based distributional semantic model in nature. We use local clustering techniques to assess the structural changes in a given word’s contextual words. We demonstrate the efficacy of our method by investigating the changes in the semantics of the phrase “Chinavirus”. We conclude that the term took on a much more pejorative meaning when the White House used the term in the second half of March 2020, although the effect appears to have been temporary. We make both the dataset and the code used to generate this paper’s results available.

Funder

NSF DMS

PSC-CUNY Enhanced Research Award

Publisher

MDPI AG

Subject

General Economics, Econometrics and Finance

Reference55 articles.

1. Liebeskind, C., Dagan, I., and Schler, J. (2012, January 7–8). Statistical thesaurus construction for a morphologically rich language. Proceedings of the Sixth International Workshop on Semantic Evaluation, Montréal, QC, Canada.

2. Zaragoza, M.Q., Torres, L.S., and Basdevant, J. (2020, January 11–16). Translating Knowledge Representations with Monolingual Word Embeddings: The Case of a Thesaurus on Corporate Non-Financial Reporting. Proceedings of the 6th International Workshop on Computational Terminology, Marseille, France.

3. Loukachevitch, N., and Parkhomenko, E. (2019, January 23–27). Thesaurus Verification Based on Distributional Similarities. Proceedings of the 10th Global Wordnet Conference, Wroclaw, Poland.

4. Improving distributional similarity with lessons learned from word embeddings;Levy;Trans. Assoc. Comput. Linguist.,2015

5. Baroni, M., Dinu, G., and Kruszewski, G. (2014, January 22–27). Don’t count, predict! a systematic comparison of context-counting vs. context-predicting semantic vectors. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Baltimore, MD, USA.

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Lifelong Machine Learning for Topic Modeling Based on Hellinger Distance;2023 International Joint Conference on Neural Networks (IJCNN);2023-06-18

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3