Efficient multi-scale Gaussian process regression for massive remote sensing data with satGP v0.1.2
-
Published:2020-07-31
Issue:7
Volume:13
Page:3439-3463
-
ISSN:1991-9603
-
Container-title:Geoscientific Model Development
-
language:en
-
Short-container-title:Geosci. Model Dev.
Author:
Susiluoto Jouni,Spantini Alessio,Haario Heikki,Härkönen Teemu,Marzouk Youssef
Abstract
Abstract. Satellite remote sensing provides a global view to processes on Earth that has unique benefits compared to making measurements on the ground, such as global coverage and enormous data volume. The typical downsides are spatial and temporal gaps and potentially low data quality. Meaningful statistical inference from such data requires overcoming these problems and developing efficient and robust computational tools.
We design and implement a computationally efficient multi-scale Gaussian process (GP) software package, satGP, geared towards remote sensing applications. The software is able to handle problems of enormous sizes and to compute marginals and sample from the random field conditioning on at least hundreds of millions of observations. This is achieved by optimizing the computation by, e.g., randomization and splitting the problem into parallel local subproblems which aggressively discard uninformative data. We describe the mean function of the Gaussian process by approximating marginals of a Markov random field (MRF). Variability around the mean is modeled with a multi-scale covariance kernel, which consists of Matérn, exponential, and periodic components. We also demonstrate how winds can be used to inform covariances locally.
The covariance kernel parameters are learned by calculating an approximate marginal maximum likelihood estimate, and the validity of both the multi-scale approach and the method used to learn the kernel parameters is verified in synthetic experiments. We apply these techniques to a moderate size ozone data set produced by an atmospheric chemistry model and to the very large number of observations retrieved from the Orbiting Carbon Observatory 2 (OCO-2) satellite. The satGP software is released under an open-source license.
Publisher
Copernicus GmbH
Reference43 articles.
1. Ambikasaran, S., Foreman-Mackey, D., Greengard, L., Hogg, D. W., and
O'Neil, M.: Fast Direct Methods for Gaussian Processes, IEEE T.
Pattern Anal., 38, 252–265,
https://doi.org/10.1109/TPAMI.2015.2448083, 2016. a 2. Bertaux, J., Hauchecorne, A., Dalaudier, F., Cot, C., Kyrölä, E., Fussen,
D., Tamminen, J., Leppelmeier, G., Sofieva, V., Hassinen, S., Fanton
d'Andon, O., Barrot, G., Mangin, A., Theodore, B., Guirlet, M., Korablev,
O., Snoeij, P., Koopman, R., and Fraisse, R.: First results on GOMOS/ENVISAT,
Adv. Space Res., 33, 1029–1035,
https://doi.org/10.1016/j.asr.2003.09.037,
2004. a 3. Bertaux, J. L., Kyrölä, E., Fussen, D., Hauchecorne, A., Dalaudier, F., Sofieva, V., Tamminen, J., Vanhellemont, F., Fanton d'Andon, O., Barrot, G., Mangin, A., Blanot, L., Lebrun, J. C., Pérot, K., Fehr, T., Saavedra, L., Leppelmeier, G. W., and Fraisse, R.: Global ozone monitoring by occultation of stars: an overview of GOMOS measurements on ENVISAT, Atmos. Chem. Phys., 10, 12091–12148, https://doi.org/10.5194/acp-10-12091-2010, 2010. a 4. Chiles, J.-P. and Delfiner, P.: Geostatistics, Wiley, 2012. a 5. Cressie, N.: Mission CO2ntrol: A Statistical Scientist's Role in Remote Sensing
of Atmospheric Carbon Dioxide, J. Am. Stat.
Assoc., 113, 152–168, https://doi.org/10.1080/01621459.2017.1419136, 2018. a
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|