Author:
Aziz Abdul,Mahmood Danish,Qureshi Muhammad Shuaib,Qureshi Muhammad Bilal,Kim Kyungsup
Abstract
The economy of a country is directly proportional to the power sector of that country. An unmanaged power sector causes instability in the country. Pakistan is also facing this phenomenon due to uncontrolled power outage and circular debt. Pakistan’s power sector is analyzed as a case study to find out the root cause for the unmanaged power sector and for proposing the most effective data-driven solution. After a literature review and discussion with domain experts, it was found that inaccurate power demand forecast is one of the main reasons for power crisis in Pakistan. Under-forecasting caused load shedding, and over-forecasting increased circular debt due to idle capacity payments. Previously, traditional statistical methods were used for power demand forecasting. The multiple linear regression model that is being used since 2018 (IGCEP) uses features such as previous year load and demographic and economic variables for long-term peak power demand forecasting till 2030. The problem is that the independent variables used in existing models are manipulated and cause a gap between actual and forecasted power demand. Moreover, even yearly peak power demand is not absolutely linear in nature; hence, it is necessary to apply AI-based techniques that can handle nonlinearity effectively. Not using system-generated data, not using the most appropriate features, not using an appropriate forecasting time horizon, and not using the appropriate forecasting model are main reasons for inaccurate peak power demand forecasting. The issue can be resolved by forecasting monthly peak power demand for the next 5 years by using the National Power Control Center’s (NPCC) system-generated data. Accurate monthly peak load forecasting leads to accurate yearly peak power demand. The monthly peak load forecasting strategy not only helps in managing operational issues of the power sector such as fuel scheduling and power plant maintenance scheduling but also guides decision-makers toward power and transmission expansion or contraction in the long term. More accurate monthly peak power demand forecasting can be achieved by applying nonlinear AI models in a comprehensive dataset comprising new engineered features, climate features, and the number of consumers. All these features are mostly system-generated and cannot be manipulated. As a result, the accuracy is improved and the results are more reliable than those of the existing models. The new features can be engineered from recent monthly peak load data generated by the system operator (NPCC). Climate features are collected from the Meteorological Department of Pakistan through sensors or database connectivity. The number of electricity consumers can be extracted from NEPRA’s state-of-industry report. All three datasets are combined on a common key (month–year) to a comprehensive dataset, which is passed through different AI models. In the experimental setup, it is found that support vector regression (SVR) produces the most accurate results, with an R-square of 99%, RMSE of 28, and MAPE of 0.1355, which are the best results compared to the literature reviewed.