Affiliation:
1. Department of Computer Engineering, Adıyaman University, Adıyaman 02040, Türkiye
2. MOBILERS Team, Sivas Cumhuriyet University, Sivas 58140, Türkiye
Abstract
This study conducted a detailed comparative analysis of various machine learning models to enhance wind energy forecasts, including linear regression, decision tree, random forest, gradient boosting machine, XGBoost, LightGBM, and CatBoost. Furthermore, it developed an end-to-end MLOps pipeline leveraging SCADA data from a wind turbine in Türkiye. This research not only compared models using the RMSE metric for selection and optimization but also explored in detail the impact of integrating machine learning with MLOps on the precision of energy production forecasts. It investigated the suitability and efficiency of ML models in predicting wind energy with MLOps integration. The study explored ways to improve LightGBM algorithm performance through hyperparameter tuning and Docker utilization. It also highlighted challenges in speeding up MLOps development and deployment processes. Model performance was assessed using the RMSE metric, conducting a comparative evaluation across different models. The findings revealed that the RMSE values among the regression models ranged from 460 kW to 192 kW. Focusing on enhancing LightGBM, the research decreased the RMSE value to 190.34 kW. Despite facing technical and operational hurdles, the implementation of MLOps was proven to enhance the speed (latency of 9 ms), reliability (through Docker encapsulation), and scalability (using Docker swarm) of machine learning endeavors.
Funder
European Union’s Horizon Europe research and innovation program
Reference42 articles.
1. (2024, March 18). Internet: Republic of Türkiye Ministry of Energy and Natural Resources, Available online: https://enerji.gov.tr/eigm-yenilenebilir-enerji-kaynaklar-ruzgar.
2. McKinnon, C., Carroll, J., McDonald, A., Koukoura, S., Infield, D., and Soraghan, C. (2020). Comparison of New Anomaly Detection Technique for Wind Turbine Condition Monitoring Using Gearbox SCADA Data. Energies, 13.
3. Alla, S., and Adari, S.K. (2021). Beginning MLOps with MLFlow: Deploy Models in AWS SageMaker, Google Cloud, and Microsoft Azure, Apress.
4. Pendyala, V. (2020). Tools and Techniques for Software Development in Large Organizations: Emerging Research and Opportunities, IGI Global.
5. The machine learning life cycle and the cloud: Implications for drug discovery;Spjuth;Expert Opin. Drug Discov.,2021
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Analysis of SARIMA Models for Forecasting Electricity Demand;2024 12th International Conference on Smart Grid (icSmartGrid);2024-05-27