Abstract
PurposeThe state of Mato Grosso represents the largest producer and exporter of soybeans in Brazil; given this importance, it was aimed to propose to use the univariate imputation tool for time series, through applications of splines interpolations, in 46 of its municipalities that had missing data in the variables soybean production in thousand tons, production value and soy derivatives in R$ thousand, and also to assess the differences between the observed series and those with imputed values, in each of these municipalities, in these variables.Design/methodology/approachThe proposed methodology was based on the use of the univariate imputation method through the application of cubic spline interpolation in each of the 46 municipalities, for each of the 3 variables. Then, for each municipality, the original series were compared with each observed series plus the values imputed in these variables by the Quenouille test of correlation of time series.FindingsIt was observed that, after imputation, all series were compared with those observed and are equal by the Queinouille test in the 46 municipalities analyzed, and the Wilcoxon test also showed equality for the accumulated total of the three variables involved with the production of soybeans. And there were increases of 5.92%, 3.58% and 2.84% for soy production, soy production value and soy derivatives value accumulated in the state after imputation in the 46 municipalities.Originality/valueThe present research and its results facilitate the process of estimates and monitoring the total soy production in the state of Mato Grosso and its municipalities from 1990 to 2018.
Subject
Economics and Econometrics,Agricultural and Biological Sciences (miscellaneous)
Reference58 articles.
1. The treatment of missing values and its effect in the classifier accuracy, classification, clustering, and data mining applications,2004
2. Study of cubic splines and fourier series as interpolation techniques for filling in short periods of missing building energy use and weather data;Journal of Solar Energy Engineering Transactions of the ASME,2006
3. Gerenciamento de sistemas agroindustriais: definições e correntes metodológicas;Gestão Agroindustrial,2007
4. An analysis of four missing data treatment methods for supervised learning;Applied Artificial Intelligence,2003
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献