Using the Ship-Gram Model for Japanese Keyword Extraction Based on News Reports-Reference-Cited by-同舟云学术

Using the Ship-Gram Model for Japanese Keyword Extraction Based on News Reports

Published:2021-04-16 Issue: Volume:2021 Page:1-9
ISSN:1099-0526
Container-title:Complexity
language:en
Short-container-title:Complexity

Author:

Teng Miao¹^ORCID

Affiliation:

1. College of Foreign Languages, Anyang Normal University, Anyang, Henan 455001, China

Abstract

In this paper, we conduct an in-depth study of Japanese keyword extraction from news reports, train external computer document word sets from text preprocessing into word vectors using the Ship-gram model in the deep learning tool Word2Vec, and calculate the cosine distance between word vectors. In this paper, the sliding window in TextRank is designed to connect internal document information to improve the in-text semantic coherence. The main idea is to use not only the statistical and structural features of words but also the semantic features of words extracted through word-embedding techniques, i.e., multifeature fusion, to obtain the importance weights of words themselves and the attraction weights between words and then iteratively calculate the final weight of each word through the graph model algorithm to determine the extracted keywords. To verify the performance of the algorithm, extensive simulation experimental studies were conducted on three different types of datasets. The experimental results show that the proposed keyword extraction algorithm can improve the performance by a maximum of 6.45% and 20.36% compared with the existing word frequency statistics and graph model methods, respectively; MF-Rank can achieve a maximum performance improvement of 1.76% compared with PW-TF.

Publisher

Hindawi Limited

Subject

Multidisciplinary,General Computer Science

Link

http://downloads.hindawi.com/journals/complexity/2021/9965843.pdf

Reference26 articles.

1. News audience fragmentation in the Japanese Twittersphere

2. An Analysis of Web Coverage on the 2018 West Japan Heavy Rain Disaster

3. A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning

4. Event mining and timeliness analysis from heterogeneous news streams

5. Conceptual extraction of compound Korean keywords;S. S. Lee;Journal of Information Processing Systems,2020

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Retracted: Using the Ship-Gram Model for Japanese Keyword Extraction Based on News Reports;Complexity;2024-01-24

2. Keyword Extraction Method Based on Graph Attention Network;2023 IEEE 5th International Conference on Civil Aviation Safety and Information Technology (ICCASIT);2023-10-11