Affiliation:
1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, China
Abstract
Supervised keyword extraction methods usually require a large human-annotated corpus to train the model. Expensive manual labeling has made unsupervised technology using word graph networks attractive. Traditional word graph networks simply consider the co-occurrence relationship of words or the topological structure of the network, ignoring the influence of semantic relations between words on keyword extraction. To solve these problems, an unsupervised keyword extraction method based on word graph networks for both Chinese and English is proposed. This method uses word embedding to applying a “word attraction score” to semantic relevance between words in a document. Combination of the bias weight of the node and a weighted PageRank algorithm is used to compute the final scores of words. The experimental results demonstrate that the method is more effective than the traditional methods.
Reference39 articles.
1. NE-Rank: A Novel Graph-Based Keyphrase Extraction in Twitter
2. Simple Unsupervised Keyphrase Extraction using Sentence Embeddings
3. A graph based keyword extraction model using collective node weight
4. Bougouin, A., Boudin, F., & Daille, B. (2013). Topicrank: Graph-based topic ranking for keyphrase extraction. In Proceedings of the International Joint Conference on Natural Language Processing (IJCNLP) (pp. 543-551). Academic Press.
5. Brin, S., & Page, L. (1998). The anatomy of a large-scale hypertextual web search engine. Computer networks and ISDN systems, 30(1-7), 107-117.
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献