Affiliation:
1. Department of Electronics and Computer Discipline, Indian Institute of Technology Roorkee Saharanpur Campus,
Saharanpur, India
Abstract
Background:
With the increase in populations in urban areas, there is an increase in
pollution also. Air pollution is one of the challenging environmental issues in smart cities.
Objective:
Real-time monitoring of air quality can help the administration to take appropriate decisions on time. Advancement in the Internet of Things based sensors has changed the way to monitor air quality.
Methods:
In this paper, we have applied two-stage regressions. In the first stage, ten regression algorithms (Decision
Tree, Random Forest, Elastic Net, Adaboost, Extra Tree, Linear Regression, Lasso, XGBoost, Light GBM, AdaBoost,
and Multi-Layer Perceptron) is applied and in second stage best four algorithms are picked and stacking ensemble algorithms is applied using python to predict the PM2.5 pollutants in air. Data set of five Chinese cities (Beijing, Chengdu,
Guangzhou, Shanghai, and Shenyang) has taken into consideration and compared based on MAE (Mean Absolute Error),
RMSE (Root Mean Square Error), and R2 parameters.
Results:
We observed that out of ten regression algorithms applied, extra tree algorithm exhibited
the best performance on all the five datasets, and further stacking improved the performance.
Conclusion:
Feature importance for Sheyang and Beijing city was computed using three regression algorithms, and we found that the four most important features are humidity, wind speed, wind direction and dew point.
Publisher
Bentham Science Publishers Ltd.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献