Himawari-8-derived diurnal variations in ground-level PM<sub>2.5</sub> pollution across China using the fast space-time Light Gradient Boosting Machine (LightGBM)
-
Published:2021-05-25
Issue:10
Volume:21
Page:7863-7880
-
ISSN:1680-7324
-
Container-title:Atmospheric Chemistry and Physics
-
language:en
-
Short-container-title:Atmos. Chem. Phys.
Author:
Wei JingORCID, Li ZhanqingORCID, Pinker Rachel T., Wang JunORCID, Sun Lin, Xue Wenhao, Li RunzeORCID, Cribb Maureen
Abstract
Abstract. Fine particulate matter with a diameter of less than 2.5 µm (PM2.5) has been used as an important atmospheric
environmental parameter mainly because of its impact on human health. PM2.5 is affected by both natural and anthropogenic factors that
usually have strong diurnal variations. Such information helps toward understanding the causes of air pollution, as well as our adaptation to
it. Most existing PM2.5 products have been derived from polar-orbiting satellites. This study exploits the use of the next-generation
geostationary meteorological satellite Himawari-8/AHI (Advanced Himawari Imager) to document the diurnal variation in PM2.5. Given the huge volume of satellite data,
based on the idea of gradient boosting, a highly efficient tree-based Light Gradient Boosting Machine (LightGBM) method by involving the
spatiotemporal characteristics of air pollution, namely the space-time LightGBM (STLG) model, is developed. An hourly PM2.5 dataset for
China (i.e., ChinaHighPM2.5) at a 5 km spatial resolution is derived based on Himawari-8/AHI aerosol products with additional
environmental variables. Hourly PM2.5 estimates (number of data samples = 1 415 188) are well correlated with ground measurements in
China (cross-validation coefficient of determination, CV-R2 = 0.85), with a root-mean-square error (RMSE) and mean absolute error (MAE) of
13.62 and 8.49 µg m−3, respectively. Our model captures well the PM2.5 diurnal variations showing that pollution increases
gradually in the morning, reaching a peak at about 10:00 LT (GMT+8), then decreases steadily until sunset. The proposed approach outperforms most
traditional statistical regression and tree-based machine-learning models with a much lower computational burden in terms of speed and memory, making
it most suitable for routine pollution monitoring.
Funder
National Key Research and Development Program of China National Natural Science Foundation of China
Publisher
Copernicus GmbH
Subject
Atmospheric Science
Reference66 articles.
1. An, Z., Huang, R. J., Zhang, R., Tie, X., Li, G., Cao, J., Zhou, W., Shi, Z., Han, Y., Gu, Z., and Ji, Y.:
Severe haze in northern China: a synergy of anthropogenic emissions and atmospheric processes,
P. Natl. Acad. Sci. USA,
116, 8657–8666, 2019. 2. Baez-Villanueva, O., Zambrano-Bigiarini, M., Beck, H., Mcnamara, I., and Thinh, N.:
RF-MEP: a novel random forest method for merging gridded precipitation products and ground-based measurements,
Remote Sens. Environ.,
239, 111606, https://doi.org/10.1016/j.rse.2019.111606, 2020. 3. Behrens, T., Schmidt, K., Viscarra, R., Gries, P., Scholten, T., and Macmillan, R.:
Spatial modelling with Euclidean distance fields and machine learning,
Eur. J. Soil Sci.,
69, 757–770, 2018. 4. Bessho, K., Date, K., Hayashi, M., Ikeda, A., and Yoshida, R.:
An introduction to Himawari-8/9 – Japan's new-generation geostationary meteorological satellites,
J. Meteorol. Soc. Jpn.,
2016, 94, 151–183, 2016. 5. Breiman, L.: Random forests,
Mach. Learn.,
45, 5–32, 2001.
Cited by
105 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|