Data mining methods for prediction of air pollution-Reference-Cited by-同舟云学术

Data mining methods for prediction of air pollution

Published:2016-06-01 Issue:2 Volume:26 Page:467-478
ISSN:2083-8492
Container-title:International Journal of Applied Mathematics and Computer Science
language:en
Short-container-title:

Author:

Siwek Krzysztof¹,Osowski Stanisław¹²

Affiliation:

1. Faculty of Electrical Engineering Warsaw University of Technology, pl. Politechniki 1, 00-661 Warsaw, Poland

2. Faculty of Electronic Engineering Military University of Technology, ul. Kaliskiego 2, 00-908 Warsaw, Poland

Abstract

Abstract The paper discusses methods of data mining for prediction of air pollution. Two tasks in such a problem are important: generation and selection of the prognostic features, and the final prognostic system of the pollution for the next day. An advanced set of features, created on the basis of the atmospheric parameters, is proposed. This set is subject to analysis and selection of the most important features from the prediction point of view. Two methods of feature selection are compared. One applies a genetic algorithm (a global approach), and the other-a linear method of stepwise fit (a locally optimized approach). On the basis of such analysis, two sets of the most predictive features are selected. These sets take part in prediction of the atmospheric pollutants PM10, SO2, NO2 and O3. Two approaches to prediction are compared. In the first one, the features selected are directly applied to the random forest (RF), which forms an ensemble of decision trees. In the second case, intermediate predictors built on the basis of neural networks (the multilayer perceptron, the radial basis function and the support vector machine) are used. They create an ensemble integrated into the final prognosis. The paper shows that preselection of the most important features, cooperating with an ensemble of predictors, allows increasing the forecasting accuracy of atmospheric pollution in a significant way.

Publisher

Walter de Gruyter GmbH

Subject

Applied Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Cited by 57 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Transforming air pollution management in India with AI and machine learning technologies;Scientific Reports;2024-09-02

2. Statistical Characterization of Full-Scale Thermophilic Biological Systems to Inform Process Optimization;Environments;2024-02-17

3. Multimodal Imputation-Based Multimodal Autoencoder Framework for AQI Classification and Prediction of Indian Cities;IEEE Access;2024

4. The ST-GRNN Cooperative Training Model Based on Complex Network for Air Quality Prediction;Lecture Notes in Computer Science;2024

5. Air Quality Index Prediction Using Support Vector Regression Based on African Buffalo Optimization;Communications in Computer and Information Science;2024