Affiliation:
1. Uppsala University, Uppsala, Sweden
2. The University of Hong Kong, Pokfulam Road, Hong Kong, China
Abstract
Air pollution is one of the major concerns in global urbanization. Data science can help to understand the dynamics of air pollution and build reliable statistical models to forecast air pollution levels. To achieve these goals, one needs to learn the statistical models which can capture the dynamics from the historical data and predict air pollution in the future. Furthermore, the large size and heterogeneity of today’s big urban data pose significant challenges on the scalability and flexibility of the statistical models. In this work, we present a scalable belief updating framework that is able to produce reliable predictions, using over millions of historical hourly air pollutant and meteorology records. We also present a non-parametric approach to learn the statistical model which reveals interesting periodical dynamics and correlations of the dataset. Based on the scalable belief update framework and the non-parametric model learning approach, we propose an iterative update algorithm to accelerate Gaussian process, which is notorious for its prohibitive computation with large input data. Finally, we demonstrate how to integrate information from heterogeneous data by regarding the beliefs produced by other models as the informative prior. Numerical examples and experimental results are presented to validate the proposed method.
Publisher
Association for Computing Machinery (ACM)
Reference33 articles.
1. U. S. Environmental Protection Agency. 1999. Nitrogen Oxides (NOx) Why and How They Are Controlled. U. S. Environmental Protection Agency. 1999. Nitrogen Oxides (NOx) Why and How They Are Controlled.
2. Deepairnet: Applying recurrent networks for air quality prediction;Athira V.;Procedia Computer Science,2018
3. Matthias Bauer Mark van der Wilk and Carl Edward Rasmussen. 2016. Understanding probabilistic sparse Gaussian process approximations. In Advances in Neural Information Processing Systems. 1533--1541. Matthias Bauer Mark van der Wilk and Carl Edward Rasmussen. 2016. Understanding probabilistic sparse Gaussian process approximations. In Advances in Neural Information Processing Systems. 1533--1541.
4. A general framework for updating belief distributions
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献