WORD2VEC NOT DEAD: PREDICTING HYPERNYMS OF CO-HYPONYMS IS BETTER THAN READING DEFINITIONS-Reference-Cited by-同舟云学术

WORD2VEC NOT DEAD: PREDICTING HYPERNYMS OF CO-HYPONYMS IS BETTER THAN READING DEFINITIONS

Published:2020 Issue: Volume: Page:
ISSN:2075-7182
Container-title:Computational Linguistics and Intellectual Technologies
language:
Short-container-title:

Author:

Arefyev N. V., ,Fedoseev M. V.,Kabanov A. V.,Zizov V. S., , , , ,

Abstract

Expert-built lexical resources are known to provide information of good quality for the cost of low coverage. This property limits their applicability in modern NLP applications. Building descriptions of lexical-semantic relations manually in sufficient volume requires a huge amount of qualified human labour. However, given some initial version of a taxonomy is already built, automatic or semi-automatic taxonomy enrichment systems can greatly reduce the required efforts. We propose and experiment with two approaches to taxonomy enrichment, one utilizing information from word definitions and another from word usages, and also a combination of them. The first method retrieves co-hyponyms for the target word from distributional semantic models (word2vec) or language models (XLM-R), then looks for hypernyms of co-hyponyms in the taxonomy. The second method tries to extract hypernyms directly from Wiktionary definitions. The proposed methods were evaluated on the Dialogue-2020 shared task on taxonomy enrichment. We found that predicting hypernyms of cohyponyms achieves better results in this task. The combination of both methods improves results further and is among 3 best-performing systems for verbs. An important part of the work is detailed qualitative and error analysis of the proposed methods, which provide interesting observations of their behaviour and ideas for the future work.

Publisher

Russian State University for the Humanities

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Taxonomy enrichment with text and graph vector representations;Semantic Web;2022-04-06

2. Taxonomy Enrichment with Text and Graph Vector Representation;Lecture Notes in Computer Science;2022

3. Using Embedding-Based Similarities to Improve Lexical Resources;Lobachevskii Journal of Mathematics;2021-07