Graph-based exploration and clustering analysis of semantic spaces-Reference-Cited by-同舟云学术

Graph-based exploration and clustering analysis of semantic spaces

Published:2019-11-13 Issue:1 Volume:4 Page:
ISSN:2364-8228
Container-title:Applied Network Science
language:en
Short-container-title:Appl Netw Sci

Author:

Veremyev Alexander,Semenov Alexander,Pasiliao Eduardo L.,Boginski Vladimir

Abstract

Abstract The goal of this study is to demonstrate how network science and graph theory tools and concepts can be effectively used for exploring and comparing semantic spaces of word embeddings and lexical databases. Specifically, we construct semantic networks based on word2vec representation of words, which is “learnt” from large text corpora (Google news, Amazon reviews), and “human built” word networks derived from the well-known lexical databases: WordNet and Moby Thesaurus. We compare “global” (e.g., degrees, distances, clustering coefficients) and “local” (e.g., most central nodes and community-type dense clusters) characteristics of considered networks. Our observations suggest that human built networks possess more intuitive global connectivity patterns, whereas local characteristics (in particular, dense clusters) of the machine built networks provide much richer information on the contextual usage and perceived meanings of words, which reveals interesting structural differences between human built and machine built semantic networks. To our knowledge, this is the first study that uses graph theory and network science in the considered context; therefore, we also provide interesting examples and discuss potential research directions that may motivate further research on the synthesis of lexicographic and machine learning based tools and lead to new insights in this area.

Publisher

Springer Science and Business Media LLC

Subject

Computational Mathematics,Computer Networks and Communications,Multidisciplinary

Link

http://link.springer.com/content/pdf/10.1007/s41109-019-0228-y.pdf

Reference76 articles.

1. Abbott, JT, Austerweil JL, Griffiths TL (2015) Random walks on semantic networks can resemble optimal foraging. Psychol Rev 122(3):558–569.

2. Abello, J, Pardalos PM, Resende MGC (1999) On maximum clique problems in very large graphs. In: Abello J Vitter J (eds)External Memory Algorithms and Visualization, 119–130.. American Mathematical Society, Boston.

3. Abello, J, Resende MGC, Sudarsky S (2002) Massive quasi-clique detection. In: Rajsbaum S (ed)LATIN 2002: Theoretical Informatics, 598–612.. Springer-Verlag, London.

4. Altuncu, MT, Mayer E, Yaliraki SN, Barahona M (2019) From free text to clusters of content in health records: an unsupervised graph partitioning approach. Applied Network Science 4(1):2.

5. Amazon Reviews dataset (2017) Unlocked Mobile Phones. https://www.kaggle.com/PromptCloudHQ/amazon-reviews-unlocked-mobile-phones. Last accessed 15 Feb 2019.

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Using dynamic knowledge graphs to detect emerging communities of knowledge;Knowledge-Based Systems;2024-06

2. Ranking influential nodes in complex network using edge weight degree based shell decomposition;Journal of Computational Science;2023-12

3. Feature-rich multiplex lexical networks reveal mental strategies of early language learning;Scientific Reports;2023-01-26

4. Socio-Semantic Network Motifs Framework for Discourse Analysis;LAK22: 12th International Learning Analytics and Knowledge Conference;2022-03-21

5. Sustainable development goals: conceptualization, communication and achievement synergies in a complex network framework;Applied Network Science;2022-03-14