A Fast Weighted Fuzzy C-Medoids Clustering for Time Series Data Based on P-Splines-Reference-Cited by-同舟云学术

A Fast Weighted Fuzzy C-Medoids Clustering for Time Series Data Based on P-Splines

Published:2022-08-17 Issue:16 Volume:22 Page:6163
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Xu Jiucheng,Hou Qinchen,Qu Kanglin,Sun Yuanhao^ORCID,Meng Xiangru

Abstract

The rapid growth of digital information has produced massive amounts of time series data on rich features and most time series data are noisy and contain some outlier samples, which leads to a decline in the clustering effect. To efficiently discover the hidden statistical information about the data, a fast weighted fuzzy C-medoids clustering algorithm based on P-splines (PS-WFCMdd) is proposed for time series datasets in this study. Specifically, the P-spline method is used to fit the functional data related to the original time series data, and the obtained smooth-fitting data is used as the input of the clustering algorithm to enhance the ability to process the data set during the clustering process. Then, we define a new weighted method to further avoid the influence of outlier sample points in the weighted fuzzy C-medoids clustering process, to improve the robustness of our algorithm. We propose using the third version of mueen’s algorithm for similarity search (MASS 3) to measure the similarity between time series quickly and accurately, to further improve the clustering efficiency. Our new algorithm is compared with several other time series clustering algorithms, and the performance of the algorithm is evaluated experimentally on different types of time series examples. The experimental results show that our new method can speed up data processing and the comprehensive performance of each clustering evaluation index are relatively good.

Funder

National Natural Science Foundation of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/16/6163/pdf

Reference43 articles.

1. Clustering of time series data—a survey

2. A review on time series data mining

3. Time-series clustering – A decade review

4. Time series clustering;Caiado,2015

5. Unsupervised Curve Clustering using B-Splines

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Perspective Chapter: Enhancing Regression Analysis with Splines and Machine Learning – Evaluation of How to Capture Complex Non-Linear Multidimensional Variables;Nonlinear Systems and Matrix Analysis - Recent Advances in theory and Applications [Working Title];2024-09-11

2. Equivalence partition based morphological similarity clustering for large-scale time series;Scientific Reports;2023-04-11