Affiliation:
1. Yunnan Electric Power Dispatching and Control Center, Kunming 650011, China
2. School of Energy and Power Engineering, Huazhong University of Science and Technology, Wuhan 430074, China
3. China-EU Institute for Clean and Renewable, Huazhong University of Science and Technology, Wuhan 430074, China
Abstract
The development in the fields of clean energy, particularly wind and photovoltaic power, generates a large amount of data streams, and how to mine valuable information from these data to improve the efficiency of power generation has become a hot spot of current research. Traditional classification algorithms cannot cope with dynamically changing data streams, so data stream classification techniques are particularly important. The current data stream classification techniques mainly include decision trees, neural networks, Bayesian networks, and other methods, which have been applied to wind power and photovoltaic power data processing in existing research. However, the data drift problem is gradually highlighted due to the dynamic change in data, which significantly impacts the performance of classification algorithms. This paper reviews the latest research on data stream classification technology in wind power and photovoltaic applications. It provides a detailed introduction to the data drift problem in machine learning, which significantly affects algorithm performance. The discussion covers covariate drift, prior probability drift, and concept drift, analyzing their potential impact on the practical deployment of data stream classification methods in wind and photovoltaic power sectors. Finally, by analyzing examples for addressing data drift in energy-system data stream classification, the article highlights the future prospects of data drift research in this field and suggests areas for improvement. Combined with the systematic knowledge of data stream classification techniques and data drift handling presented, it offers valuable insights for future research.
Funder
Science and Technology Program of China Southern Power Grid Co., Ltd.
Reserve Talents Program for Middle-Aged and Young Leaders of Disciplines in Science and Technology of Yunnan Province, China
Reference105 articles.
1. IRENA (2024, August 01). Renewable Electricity Capacity and Generation Statistics. Available online: https://www.irena.org/-/media/Files/IRENA/Agency/Publication/2023/Mar/IRENA_RE_Capacity_Statistics_2023.pdf.
2. Data Streams Mining Techniques and its Application in Simulation System;Bin;Comput. Sci.,2009
3. Ensemble learning for data stream analysis: A survey;Krawczyk;Inf. Fusion,2017
4. Hossain, M.L., Abu-Siada, A., and Muyeen, S.M. (2018). Methods for Advanced Wind Turbine Condition Monitoring and Early Diagnosis: A Literature Review. Energies, 11.
5. Ozturk, S. (2019). Forecasting Wind Turbine Failures and Associated Costs: Investigating Failure Causes, Effects and Criticalities, Modeling Reliability and Predicting Time-to-Failure, Time-to-Repair and Cost of Failures for Wind Turbines Using Reliability Methods and Machine Learning Techniques, ProQuest LLC.