A Comparison of Machine Learning Methods to Forecast Tropospheric Ozone Levels in Delhi-Reference-Cited by-同舟云学术

A Comparison of Machine Learning Methods to Forecast Tropospheric Ozone Levels in Delhi

Published:2021-12-28 Issue:1 Volume:13 Page:46
ISSN:2073-4433
Container-title:Atmosphere
language:en
Short-container-title:Atmosphere

Author:

Juarez Eliana Kai^ORCID,Petersen Mark R.^ORCID

Abstract

Ground-level ozone is a pollutant that is harmful to urban populations, particularly in developing countries where it is present in significant quantities. It greatly increases the risk of heart and lung diseases and harms agricultural crops. This study hypothesized that, as a secondary pollutant, ground-level ozone is amenable to 24 h forecasting based on measurements of weather conditions and primary pollutants such as nitrogen oxides and volatile organic compounds. We developed software to analyze hourly records of 12 air pollutants and 5 weather variables over the course of one year in Delhi, India. To determine the best predictive model, eight machine learning algorithms were tuned, trained, tested, and compared using cross-validation with hourly data for a full year. The algorithms, ranked by R2 values, were XGBoost (0.61), Random Forest (0.61), K-Nearest Neighbor Regression (0.55), Support Vector Regression (0.48), Decision Trees (0.43), AdaBoost (0.39), and linear regression (0.39). When trained by separate seasons across five years, the predictive capabilities of all models increased, with a maximum R2 of 0.75 during winter. Bidirectional Long Short-Term Memory was the least accurate model for annual training, but had some of the best predictions for seasonal training. Out of five air quality index categories, the XGBoost model was able to predict the correct category 24 h in advance 90% of the time when trained with full-year data. Separated by season, winter is considerably more predictable (97.3%), followed by post-monsoon (92.8%), monsoon (90.3%), and summer (88.9%). These results show the importance of training machine learning methods with season-specific data sets and comparing a large number of methods for specific applications.

Publisher

MDPI AG

Subject

Atmospheric Science,Environmental Science (miscellaneous)

Link

https://www.mdpi.com/2073-4433/13/1/46/pdf

Reference70 articles.

1. Air-Pollution Prediction in Smart Cities through Machine Learning Methods: A Case of Study in Murcia, Spain;Martınez-Espana;J. Univ. Comput. Sci.,2018

2. Outdoor Air Pollution: Ozone Health Effects

3. Predicting ozone levels from climatic parameters and leaf traits of Bel-W3 tobacco variety

4. The DOE E3SM Coupled Model Version 1: Overview and Evaluation at Standard Resolution

5. An Evaluation of the Ocean and Sea Ice Climate of E3SM Using MPAS and Interannual CORE‐II Forcing

Cited by 22 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A bivariate simultaneous pollutant forecasting approach by Unified Spectro-Spatial Graph Neural Network (USSGNN) and its application in prediction of O3 and NO2 for New Delhi, India;Sustainable Cities and Society;2024-11

2. IoT-based monitoring system and air quality prediction using machine learning for a healthy environment in Cameroon;Environmental Monitoring and Assessment;2024-06-15

3. A novel ensemble machine learning method for accurate air quality prediction;International Journal of Environmental Science and Technology;2024-05-06

4. Prediction of Ground-Level Ozone in Bengaluru Using Machine Learning Techniques;2024 IEEE 9th International Conference for Convergence in Technology (I2CT);2024-04-05

5. Urban ozone variability using automated machine learning: inference from different feature importance schemes;Environmental Monitoring and Assessment;2024-03-23