An Improved Similarity Matching based Clustering Framework for Short and Sentence Level Text-Reference-Cited by-同舟云学术

An Improved Similarity Matching based Clustering Framework for Short and Sentence Level Text

Published:2017-02-01 Issue:1 Volume:7 Page:551
ISSN:2088-8708
Container-title:International Journal of Electrical and Computer Engineering (IJECE)
language:
Short-container-title:IJECE

Author:

Basha M. John,Kaliyamurthie K.P.

Abstract

Text clustering plays a key role in navigation and browsing process. For an efficient text clustering, the large amount of information is grouped into meaningful clusters. Multiple text clustering techniques do not address the issues such as, high time and space complexity, inability to understand the relational and contextual attributes of the word, less robustness, risks related to privacy exposure, etc. To address these issues, an efficient text based clustering framework is proposed. The Reuters dataset is chosen as the input dataset. Once the input dataset is preprocessed, the similarity between the words are computed using the cosine similarity. The similarities between the components are compared and the vector data is created. From the vector data the clustering particle is computed. To optimize the clustering results, mutation is applied to the vector data. The performance the proposed text based clustering framework is analyzed using the metrics such as Mean Square Error (MSE), Peak Signal Noise Ratio (PSNR) and Processing time. From the experimental results, it is found that, the proposed text based clustering framework produced optimal MSE, PSNR and processing time when compared to the existing Fuzzy C-Means (FCM) and Pairwise Random Swap (PRS) methods.

Publisher

Institute of Advanced Engineering and Science

Subject

Electrical and Electronic Engineering,General Computer Science

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A comparative study of lexical similarity approaches for Malay short text;AIP Conference Proceedings;2023

2. A Survey of Text Matching Techniques;Engineering, Technology & Applied Science Research;2021-02-06