Lossy compression of Earth system model data based on a hierarchical tensor with Adaptive-HGFDR (v1.0)
-
Published:2021-02-11
Issue:2
Volume:14
Page:875-887
-
ISSN:1991-9603
-
Container-title:Geoscientific Model Development
-
language:en
-
Short-container-title:Geosci. Model Dev.
Author:
Yu Zhaoyuan,Li Dongshuang,Zhang Zhengfang,Luo Wen,Liu Yuan,Wang Zengjie,Yuan Linwang
Abstract
Abstract. Lossy compression has been applied to the data compression of
large-scale Earth system model data (ESMD) due to its advantages of a high
compression ratio. However, few lossy compression methods consider both
global and local multidimensional coupling correlations, which could lead to
information loss in data approximation of lossy compression. Here, an
adaptive lossy compression method, adaptive hierarchical geospatial field data representation (Adaptive-HGFDR), is developed based on the
foundation of a stream compression method for geospatial data called blocked
hierarchical geospatial field data representation (Blocked-HGFDR). In addition, the
original Blocked-HGFDR method is also improved from the following perspectives.
Firstly, the original data are divided into a series of data blocks of a
more balanced size to reduce the effect of the dimensional unbalance of
ESMD. Following this, based on the mathematical relationship between the compression
parameter and compression error in Blocked-HGFDR, the control mechanism is
developed to determine the optimal compression parameter for the given
compression error. By assigning each data block an independent compression
parameter, Adaptive-HGFDR can capture the local variation of
multidimensional coupling correlations to improve the approximation
accuracy. Experiments are carried out based on the Community Earth System
Model (CESM) data. The results show that our method has higher compression
ratio and more uniform error distributions compared with ZFP and
Blocked-HGFDR. For the compression results among 22 climate variables,
Adaptive-HGFDR can achieve good compression performances for most flux
variables with significant spatiotemporal heterogeneity and fast changing rate.
This study provides a new potential method for the lossy compression of the
large-scale Earth system model data.
Publisher
Copernicus GmbH
Reference35 articles.
1. Andrew, P., Joseph, N., Noah, Feldman., Allison, H. B., Alexander, P., and Dorit, M. H.:
A statistical analysis of lossily compressed climate model data, Comput. Geosci., 145, 104599,
https://doi.org/10.1016/j.cageo.2020.104599, 2020. 2. Baker, A. H., Xu, H., Dennis, J. M., Levy, M. N., Nychka, D., Mickelson, S. A.,
Edwards, J., Vertenstein, M., and Wegener, A.: A methodology for evaluating the impact of data
compression on climate simulation data, in: Proceedings of the 23rd International Symposium on
High-Performance Parallel and Distributed Computing, Vancouver, Canada, 23–27 June 2014. 3. Baker, A. H., Hammerling, D. M., Mickelson, S. A., Xu, H., Stolpe, M. B., Naveau, P.,
Sanderson, B., Ebert-Uphoff, I., Samarasinghe, S., De Simone, F., Carbone, F., Gencarelli, C. N.,
Dennis, J. M., Kay, J. E., and Lindstrom, P.: Evaluating lossy data compression on climate
simulation data within a large ensemble, Geosci. Model Dev., 9, 4381–4403,
https://doi.org/10.5194/gmd-9-4381-2016, 2016. 4. Bengua, J. A., Phien, H. N., Tuan, H. D., and Do, M. N.: Matrix product state for
higher-order tensor compression and classification, IEEE Trans. Signal Process., 65, 4019–4030,
https://doi.org/10.1109/TSP.2017.2703882, 2016. 5. Cai, J. Y., Chen, X., and Lu, P.: Non-negative weighted #csps: an effective
complexity dichotomy, Comput. Sci., 6, 45–54, https://doi.org/10.1109/CCC.2011.32, 2012.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Exploring Lossy Compressibility through Statistical Correlations of Scientific Datasets;2021 7th International Workshop on Data Analysis and Reduction for Big Scientific Data (DRBSD-7);2021-11
|
|