Affiliation:
1. Intelligent Power Equipment Technology Research Center, Wuhan University, Wuhan 430072, China
2. School of Power and Mechanical Engineering, Wuhan University, Wuhan 430072, China
3. School of Water Resources and Hydropower Engineering, Wuhan University, Wuhan 430072, China
Abstract
In order to solve low-quality problems such as data anomalies and missing data in the condition monitoring data of hydropower units, this paper proposes a monitoring data quality enhancement method based on HDBSCAN-WSGAIN-GP, which improves the quality and usability of the condition monitoring data of hydropower units by combining the advantages of density clustering and a generative adversarial network. First, the monitoring data are grouped according to the density level by the HDBSCAN clustering method in combination with the working conditions, and the anomalies in this dataset are detected, recognized adaptively and cleaned. Further combining the superiority of the WSGAIN-GP model in data filling, the missing values in the cleaned data are automatically generated by the unsupervised learning of the features and the distribution of real monitoring data. The validation analysis is carried out by the online monitoring dataset of the actual operating units, and the comparison experiments show that the clustering contour coefficient (SCI) of the HDBSCAN-based anomaly detection model reaches 0.4935, which is higher than that of the other comparative models, indicating that the proposed model has superiority in distinguishing between the valid samples and anomalous samples. The probability density distribution of the data filling model based on WSGAIN-GP is similar to that of the measured data, and the KL dispersion, JS dispersion and Hellinger’s distance of the distribution between the filled data and the original data are close to 0. Compared with the filling methods such as SGAIN, GAIN, KNN, etc., the effect of data filling with different missing rates is verified, and the RMSE error of data filling with WSGAIN-GP is lower than that of other comparative models. The WSGAIN-GP method has the lowest RMSE error under different missing rates, which proves that the proposed filling model has good accuracy and generalization, and the research results in this paper provide a high-quality data basis for the subsequent trend prediction and state warning.
Funder
National Key Research and Development Program of China
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference65 articles.
1. Handling missing data in near real-time environmental monitoring: A system and a review of selected methods;Zhang;Future Gener. Comput. Syst.,2022
2. Improved K-means based anomaly data detection for wind turbine;Tao;For. Electron. Meas. Technol.,2023
3. Anomaly detection of distribution network voltage data based on improved K-means clustering k-value selection algorithm;Liu;Electr. Power Sci. Technol.,2022
4. Anomalous dynamic data detection method for smart meters based on k-means clustering;Liu;Electron. Des. Eng.,2023
5. Research on clustering optimization algorithm for high-dimensional power data;Liu;Sci. Technol. Bull.,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献