A Sememe Prediction Method Based on the Central Word of a Semantic Field-Reference-Cited by-同舟云学术

A Sememe Prediction Method Based on the Central Word of a Semantic Field

Published:2024-01-19 Issue:2 Volume:13 Page:413
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Luo Guanran¹^ORCID,Cui Yunpeng¹^ORCID

Affiliation:

1. Agriculture Information Institute, Chinese Academy of Agricultural Sciences, Beijing 100081, China

Abstract

A “sememe” is an indivisible minimal unit of meaning in linguistics. Manually annotating sememes in words requires a significant amount of time, so automated sememe prediction is often used to improve efficiency. Semantic fields serve as crucial mediators connecting the semantics between words. This paper proposes an unsupervised method for sememe prediction based on the common semantics between words and semantic fields. In comparison to methods based on word vectors, this approach demonstrates a superior ability to align the semantics of words and sememes. We construct various types of semantic fields through ChatGPT and design a semantic field selection strategy to adapt to different scenario requirements. Subsequently, following the order of word–sense–sememe, we decompose the process of calculating the semantic sememe similarity between semantic fields and target words. Finally, we select the word with the highest average semantic sememe similarity as the central word of the semantic field, using its semantic primes as the predicted result. On the BabelSememe dataset constructed based on the sememe knowledge base HowNet, the method of semantic field central word (SFCW) achieved the best results for both unstructured and structured sememe prediction tasks, demonstrating the effectiveness of this approach. Additionally, we conducted qualitative and quantitative analyses on the sememe structure of the central word.

Funder

NSTL

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/2/413/pdf

Reference36 articles.

1. A Set of Postulates for the Science of Language;Bloomfield;Language,1926

2. Dong, Z., and Dong, Q. (2003, January 21–23). HowNet—A hybrid language and knowledge resource. Proceedings of the International Conference on Natural Language Processing and Knowledge Engineering, Beijing, China.

3. Barzilay, R., and Kan, M.Y. (August, January 30). Improved Word Representation Learning with Sememes. Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Vancouver, BC, Canada.

4. Fan, M., Zhang, Y., and Li, J. (2015, January 15–17). Word similarity computation based on HowNet. Proceedings of the 2015 12th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), Zhangjiajie, China.

5. Hu, F.S., and Guo, Y. (2012, January 25–27). An improved algorithm of word similarity computation based on HowNet. Proceedings of the 2012 IEEE International Conference on Computer Science and Automation Engineering (CSAE), Zhangjiajie, China.