Affiliation:
1. University of Würzburg, Würzburg, Germany
2. Stanford University, Stanford, CA, USA
Abstract
Land-use regression (LUR) models are important for the assessment of air pollution concentrations in areas without measurement stations. While many such models exist, they often use manually constructed features based on restricted, locally available data. Thus, they are typically hard to reproduce and challenging to adapt to areas beyond those they have been developed for.
In this article, we advocate a paradigm shift for LUR models: We propose the
D
ata-driven,
O
pen,
G
lobal (DOG) paradigm that entails models based on purely data-driven approaches using only openly and globally available data. Progress within this paradigm will alleviate the need for experts to adapt models to the local characteristics of the available data sources and thus facilitate the generalizability of air pollution models to new areas on a global scale.
To illustrate the feasibility of the DOG paradigm for LUR, we introduce a deep-learning model called MapLUR. It is based on a convolutional neural network architecture and is trained exclusively on globally and openly available map data without requiring manual feature engineering. We compare our model to state-of-the-art baselines like linear regression, random forests and multi-layer perceptrons using a large data set of modeled NO
2
concentrations in Central London. Our results show that MapLUR significantly outperforms these approaches even though they are provided with manually tailored features.
Furthermore, we illustrate that the automatic feature extraction inherent to models based on the DOG paradigm can learn features that are readily interpretable and closely resemble those commonly used in traditional LUR approaches.
Funder
DFG grant “p2Map: Learning Environmental Maps—Integrating Participatory Sensing and Human Perception”
Publisher
Association for Computing Machinery (ACM)
Subject
Discrete Mathematics and Combinatorics,Geometry and Topology,Computer Science Applications,Modeling and Simulation,Information Systems,Signal Processing
Reference56 articles.
1. Air Quality Team (Greater London Authority). [n.d.]. London Atmospheric Emissions Inventory (LAEI). Retrieved from https://data.london.gov.uk/dataset/london-atmospheric-emissions-inventory-2013. Air Quality Team (Greater London Authority). [n.d.]. London Atmospheric Emissions Inventory (LAEI). Retrieved from https://data.london.gov.uk/dataset/london-atmospheric-emissions-inventory-2013.
2. Exploring the modeling of spatiotemporal variations in ambient air pollution within the land use regression framework: Estimation of PM10 concentrations on a daily basis
3. Hourly PM2.5 concentration forecast using stacked autoencoder model with emphasis on seasonality
4. An ensemble long short-term memory neural network for hourly PM2.5 concentration forecasting
Cited by
10 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献