Diagnosing drivers of PM2.5 simulation biases in China from meteorology, chemical composition, and emission sources using an efficient machine learning method
-
Published:2024-05-06
Issue:9
Volume:17
Page:3617-3629
-
ISSN:1991-9603
-
Container-title:Geoscientific Model Development
-
language:en
-
Short-container-title:Geosci. Model Dev.
Author:
Wang Shuai, Zhang Mengyuan, Gao Yueqi, Wang PengORCID, Fu QingyanORCID, Zhang HongliangORCID
Abstract
Abstract. Chemical transport models (CTMs) are widely used for air pollution modeling, which suffer from significant biases due to uncertainties in simplified parameterization, meteorological fields, and emission inventories. Accurate diagnosis of simulation biases is critical for the improvement of models, interpretation of results, and management of air quality, especially for the simulation of fine particulate matter (PM2.5). In this study, an efficient method with high speed and a low computational resource requirement based on the tree-based machine learning (ML) method, the light gradient boosting machine (LightGBM), was designed to diagnose CTM simulation biases. The drivers of the Community Multiscale Air Quality (CMAQ) model biases are compared to observations obtained by simulating PM2.5 concentrations from the perspectives of meteorology, chemical composition, and emission sources. The source-oriented CMAQ was used to diagnose the influences of different emission sources on PM2.5 biases. The model can capture the complex relationship between input variables and simulation bias well; meteorology, PM2.5 components, and source sectors can partially explain the simulation bias. The CMAQ model underestimates PM2.5 by −19.25 to −2.66 µg m−3 in 2019, especially in winter and spring and during high-PM2.5 events. Secondary organic components showed the largest contribution to the PM2.5 simulation bias for different regions and seasons (13.8 %–22.6 %) of all components. Relative humidity, cloud cover, and soil surface moisture were the main meteorological factors contributing to PM2.5 bias in the North China Plain, Pearl River Delta, and northwestern China, respectively. Primary and secondary inorganic components from residential sources showed the two largest contributions to this bias (12.05 % and 12.78 %), implying large uncertainties in this sector. The ML-based methods provide valuable complements to traditional-mechanism-based methods for model improvement, with high efficiency and low reliance on prior information.
Funder
National Natural Science Foundation of China
Publisher
Copernicus GmbH
Reference79 articles.
1. Aleksankina, K., Reis, S., Vieno, M., and Heal, M. R.: Advanced methods for uncertainty assessment and global sensitivity analysis of an Eulerian atmospheric chemistry transport model, Atmos. Chem. Phys., 19, 2881–2898, https://doi.org/10.5194/acp-19-2881-2019, 2019. 2. Bai, K., Li, K., Ma, M., Li, K., Li, Z., Guo, J., Chang, N.-B., Tan, Z., and Han, D.: LGHAP: the Long-term Gap-free High-resolution Air Pollutant concentration dataset, derived via tensor-flow-based multimodal data fusion, Earth Syst. Sci. Data, 14, 907–927, https://doi.org/10.5194/essd-14-907-2022, 2022. 3. Beekmann, M. and Derognat, C.: Monte Carlo uncertainty analysis of a regional-scale transport chemistry model constrained by measurements from the atmospheric pollution over the Paris area (ESQUIF) campaign, J. Geophys. Res.-Atmos., 108, 8859, 10.1029/2003JD003391, 2003. 4. Bei, N., Wu, J., Elser, M., Feng, T., Cao, J., El-Haddad, I., Li, X., Huang, R., Li, Z., Long, X., Xing, L., Zhao, S., Tie, X., Prévôt, A. S. H., and Li, G.: Impacts of meteorological uncertainties on the haze formation in Beijing–Tianjin–Hebei (BTH) during wintertime: a case study, Atmos. Chem. Phys., 17, 14579–14591, https://doi.org/10.5194/acp-17-14579-2017, 2017. 5. Belgiu, M. and Drăguţ, L.: Random forest in remote sensing: A review of applications and future directions, ISPRS J. Photogramm., 114, 24–31, 2016.
|
|