Abstract
AbstractDespite the large diffusion and use of embedding generated through Word2Vec, there are still many open questions about the reasons for its results and about its real capabilities. In particular, to our knowledge, no author seems to have analysed in detail how learning may be affected by the various choices of hyperparameters. In this work, we try to shed some light on various issues focusing on a typical dataset. It is shown that the learning rate prevents the exact mapping of the co-occurrence matrix, that Word2Vec is unable to learn syntactic relationships, and that it does not suffer from the problem of overfitting. Furthermore, through the creation of an ad-hoc network, it is also shown how it is possible to improve Word2Vec directly on the analogies, obtaining very high accuracy without damaging the pre-existing embedding. This analogy-enhanced Word2Vec may be convenient in various NLP scenarios, but it is used here as an optimal starting point to evaluate the limits of Word2Vec.
Funder
Università degli Studi della Campania Luigi Vanvitelli
Publisher
Springer Science and Business Media LLC
Subject
Hardware and Architecture,Information Systems,Theoretical Computer Science,Software
Reference34 articles.
1. Al-Matham RN, Al-Khalifa HS (2021) Synoextractor: a novel pipeline for Arabic synonym extraction using Word2Vec word embeddings. Complexity. https://doi.org/10.1155/2021/6627434
2. Almeida F, Xexéo G (2019) Word embeddings: a survey. arXiv:1901.09069
3. Altszyler E, Sigman M, Fernández Slezak D (2016) Comparative study of LSA versus Word2Vec embeddings in small corpora: a case study in dreams database. arXiv:1610.01520
4. Altszyler E, Ribeiro S, Sigman M, Fernández Slezak D (2017) The interpretation of dream meaning: resolving ambiguity using latent semantic analysis in a small corpus of text. Conscious Cognit. https://doi.org/10.1016/j.concog.2017.09.004
5. Balaneshin-Kordan S, Kotov A (2018) Deep neural architecture for multi-modal retrieval based on joint embedding space for text and images. In: Proceedings of the 11th ACM International Conference on Web Search and Data Mining, Association for Computing Machinery, New York, NY, USA, WSDM’18, pp 28–36. https://doi.org/10.1145/3159652.3159735
Cited by
40 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献