Research on Keyword Extraction Algorithm in English Text Based on Cluster Analysis-Reference-Cited by-同舟云学术

Research on Keyword Extraction Algorithm in English Text Based on Cluster Analysis

Published:2022-03-28 Issue: Volume:2022 Page:1-8
ISSN:1687-5273
Container-title:Computational Intelligence and Neuroscience
language:en
Short-container-title:Computational Intelligence and Neuroscience

Author:

Ma Jingxia¹^ORCID

Affiliation:

1. School of Western Languages and Cultures, Harbin Normal University, Harbin 150025, China

Abstract

How to facilitate users to quickly and accurately search for the text information they need is a current research hotspot. Text clustering can improve the efficiency of information search and is an effective text retrieval method. Keyword extraction and cluster center point selection are key issues in text clustering research. Common keyword extraction algorithms can be divided into three categories: semantic-based algorithms, machine learning-based algorithms, and statistical model-based algorithms. There are three common methods for selecting cluster centers: randomly selecting the initial cluster center point, manually specifying the cluster center point, and selecting the cluster center point according to the similarity between the points to be clustered. The randomly selected initial cluster center points may contain “outliers,” and the clustering results are locally optimal. Manually specifying the cluster center points will be very subjective because each person’s understanding of the text set is different, and it is not suitable for the case of a large number of text sets. Selecting the cluster center points according to the similarity between the points to be clustered can make the selected cluster center points distributed in each class and be as close as possible to the class center points, but it takes a long time to calculate the cluster centers. Aiming at this problem, this paper proposes a keyword extraction algorithm based on cluster analysis. The results show that the algorithm does not rely on background knowledge bases, dictionaries, etc., and obtains statistical parameters and builds models through training. Experiments show that the keyword extraction algorithm has high accuracy and can quickly extract the subject content of an English translation.

Publisher

Hindawi Limited

Subject

General Mathematics,General Medicine,General Neuroscience,General Computer Science

Link

http://downloads.hindawi.com/journals/cin/2022/4293102.pdf

Reference37 articles.

1. Effective approaches for extraction of keywords;J. Kaur;International Journal of Computer Science Issues (IJCSI),2010

2. Automatic keyword extraction from documents using conditional random fields;C. Zhang;Journal of Computational Information Systems,2008

3. Keyword Extraction Using Support Vector Machine

4. Artificial intelligence for decision making in the era of Big Data – evolution, challenges and research agenda

5. Keyword and Keyphrase Extraction Techniques: A Literature Review

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Web of Science Veri Tabanında Bibliyometrik Bir Araştırma: İş Güvenliği Makaleleri;Journal of Turkish Operations Management;2024-07-18

2. Recent applications and prospects of omega-3 fatty acids: A bibliometric study and visualization analysis in 2014–2023;Prostaglandins, Leukotrienes and Essential Fatty Acids;2024-02

3. Transesophageal echocardiography: A bibliometric analysis from 1979 to 2022;Echocardiography;2024-02

4. Deep-KeywordNet: automated english keyword extraction in documents using deep keyword network based ranking;Multimedia Tools and Applications;2024-01-30

5. Trends and hotspots of acupuncture for allergic rhinitis: A bibliometric analysis from 2002 to 2022;Complementary Therapies in Medicine;2023-11