Abstract
Panel data is commonly used for the numerical response variables, while the literature for forecasting categorical variables on the panel data structure is still challenging to find. Forecasting is important because it is helpful for government policies. This study aimed to forecast multiclass or categorical variables on the panel data structure. The proposed forecasting models were autoregressive multinomial logit and autoregressive C5.0. The strategy applied so that the two models could be used for forecasting was to add autoregressive effects and fixed predictor variables such as location, time, strata, and month of observations. The autoregressive effect was assumed to be a fixed effect and treated as a dummy variable. The data used was the category of land conditions through The Area Sampling Frame (ASF) survey conducted by the BPS-Statistics Indonesia. The evaluation of both models was based on classification and forecasting performance. Classification performance was obtained by dividing the dataset into 75% training data for modeling and 25% test data for validation and then repeated 200 times. The classification results showed that the autoregressive C5.0 accuracy was 86.48%, while the autoregressive multinomial logit was 83.97%. A comparison of forecasting performance was obtained by dividing the data into training and testing based on the time sequence. The result showed that the forecasting performance was worse than the classification performance. Autoregressive C5.0 had an accuracy of 77.43%, while autoregressive multinomial logit had 77.77%.
Publisher
Pakistan Journal of Statistics and Operation Research
Subject
Management Science and Operations Research,Statistics, Probability and Uncertainty,Modeling and Simulation,Statistics and Probability
Reference15 articles.
1. Abdalla ME. (2012). An Application on Multinomial Logistic Regression Model. Pakistan Journal of Statistics and Operation Research, 8(2), 271-291.
2. Ardiansyah, Djuraidah A, Sumertajaya IM, Wigena AH, Fitrianto A. (2021). Development of the Panel ARDL by Adding Space-Time effect to Modeling Monthly Paddy Producer Price in Java. Journal of Physics: Conference Series, 1863, 1-18, 10.1088/1742-6596/1863/1/012053.
3. Ardiansyah M, Kurnia A, Sadik K, Djuraidah A, Wijayanto H. (2021). Numerical Prediction of paddy weight of Crop Cutting Survey using Generalized Geoadditive Linear Mixed Model. Journal of Physics: Conference Series. 1863, 1-17, 10.1088/1742-6596/1863/1/012024.
4. BPS. (2018). Manual of Integrated Food Crops Agricultural Statistics Data Collection Using the Area Sample Framework (ASF) Method. Jakarta: BPS-Statistics of Indonesia.
5. Dutang C. (2017). Some explanations about the IWLS algorithm to fit generalized linear models. hal-01577698f.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献