GPRChinaTemp1km: a high-resolution monthly air temperature data set for China (1951–2020) based on machine learning
-
Published:2022-07-15
Issue:7
Volume:14
Page:3273-3292
-
ISSN:1866-3516
-
Container-title:Earth System Science Data
-
language:en
-
Short-container-title:Earth Syst. Sci. Data
Author:
He Qian,Wang Ming,Liu Kai,Li Kaiwen,Jiang Ziyu
Abstract
Abstract. An accurate spatially continuous air temperature data set is crucial for
multiple applications in the environmental and ecological sciences. Existing
spatial interpolation methods have relatively low accuracy, and the
resolution of available long-term gridded products of air temperature for
China is coarse. Point observations from meteorological stations can provide
long-term air temperature data series but cannot represent spatially
continuous information. Here, we devised a method for spatial interpolation
of air temperature data from meteorological stations based on powerful
machine learning tools. First, to determine the optimal method for
interpolation of air temperature data, we employed three machine learning
models: random forest, support vector machine, and Gaussian process
regression. A comparison of the mean absolute error, root mean square error,
coefficient of determination, and residuals revealed that a Gaussian process
regression had high accuracy and clearly outperformed the other two models
regarding the interpolation of monthly maximum, minimum, and mean air
temperatures. The machine learning methods were compared with three
traditional methods used frequently for spatial interpolation: inverse
distance weighting, ordinary kriging, and ANUSPLIN (Australian
National University Spline). Results showed that the Gaussian process
regression model had higher accuracy and greater robustness than the
traditional methods regarding interpolation of monthly maximum, minimum, and
mean air temperatures in each month. A comparison with the TerraClimate
(Monthly Climate and Climatic Water Balance for Global Terrestrial
Surfaces), FLDAS (Famine Early Warning Systems Network (FEWS NET) Land Data
Assimilation System), and ERA5 (ECMWF, European Centre for Medium-Range Weather Forecasts, Climate Reanalysis) data sets revealed
that the accuracy of the temperature data generated using the Gaussian
process regression model was higher. Finally, using the Gaussian process
regression method, we produced a long-term (January 1951 to December 2020)
gridded monthly air temperature data set, with 1 km resolution and high
accuracy for China, which we named GPRChinaTemp1km. The data set consists of
three variables: monthly mean air temperature, monthly maximum air
temperature, and monthly minimum air temperature. The obtained
GPRChinaTemp1km data were used to analyse the spatiotemporal variations of
air temperature using Theil–Sen median trend analysis in combination with
the Mann–Kendall test. It was found that the monthly mean and minimum air
temperatures across China were characterised by a significant trend of
increase in each month, whereas monthly maximum air temperatures showed a
more spatially heterogeneous pattern, with significant increase,
non-significant increase, and non-significant decrease. The GPRChinaTemp1km
data set is publicly available at https://doi.org/10.5281/zenodo.5112122 (He
et al., 2021a) for monthly maximum air temperature, at
https://doi.org/10.5281/zenodo.5111989 (He et al., 2021b) for monthly mean
air temperature, and at https://doi.org/10.5281/zenodo.5112232 (He et al.,
2021c) for monthly minimum air temperature.
Funder
National Key Research and Development Program of China
Publisher
Copernicus GmbH
Subject
General Earth and Planetary Sciences
Reference119 articles.
1. Abatzoglou, J. T., Dobrowski, S. Z., Parks, S. A., and Hegewisch, K. C.:
TerraClimate, a high-resolution global data set of monthly climate and
climatic water balance from 1958–2015, Scientific Data, 5, 170191,
https://doi.org/10.1038/sdata.2017.191, 2018. 2. Alizamir, M., Kisi, O., Ahmed, A. N., Mert, C., Fai, C. M., Kim, S., Kim, N.
W., and El-Shafie, A.: Advanced machine learning model for better prediction
accuracy of soil temperature at different depths, PLOS ONE, 15, e0231055,
https://doi.org/10.1371/journal.pone.0231055, 2020. 3. Alvarez, O., Guo, Q., Klinger, R. C., Li, W., and Doherty, P.: Comparison of elevation and remote sensing derived products as auxiliary data for climate surface interpolation, Int. J. Climatol., 34, 2258–2268, https://doi.org/10.1002/joc.3835, 2014. 4. Amini, M. A., Torkan, G., Eslamian, S., Zareian, M. J., and Adamowski, J.
F.: Analysis of deterministic and geostatistical interpolation techniques
for mapping meteorological variables at large watershed scales, Acta
Geophys., 67, 191–203, https://doi.org/10.1007/s11600-018-0226-y, 2019. 5. Appelhans, T., Mwangomo, E., Hardy, D. R., Hemp, A., and Nauss, T.:
Evaluating machine learning approaches for the interpolation of monthly air
temperature at Mt. Kilimanjaro, Tanzania, Spatial Statistics, 14, 91–113,
https://doi.org/10.1016/j.spasta.2015.05.008, 2015.
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|