A data-driven persistence test for robust (probabilistic) quality control of measured environmental time series: constant value episodes
-
Published:2023-06-21
Issue:12
Volume:16
Page:3085-3100
-
ISSN:1867-8548
-
Container-title:Atmospheric Measurement Techniques
-
language:en
-
Short-container-title:Atmos. Meas. Tech.
Abstract
Abstract. Robust quality control is a prerequisite and an essential
component in any data application. That is especially important for time
series of environmental observations such as air quality due to their
dynamic and irreversible nature. One of the common issues in these data is
constant value episodes (CVEs), where a set of consecutive data values
remains constant over a given period. Although CVEs are often considered to be an indicator of sensor failure or other measurement errors and are removed during quality control procedures, there are situations when CVEs reflect natural environmental phenomena, and they should not be removed from the data or analysis. Assessing whether the CVEs are erroneous data or valid observations is a challenge. As there are no formal procedures established for this, their classification is based on subjective judgment and is therefore uncertain and irreproducible. This paper presents a novel test procedure, i.e., constant value test, to estimate the probability of CVEs being valid data. The theoretical foundation of this test is based on
statistical characteristics and probability theory and takes into account
the numerical precision of the data values. The test is a data-driven
(parametric) approach, which makes it usable for time series analysis in
different environmental research domains, as long as serial dependency is
given and the data distribution is not too different from Gaussian. The
robustness of the test was demonstrated with sensitivity studies using
synthetic data with different distributions. Example applications to
measured air temperature and ozone mixing ratio data confirm the versatility
of the test.
Funder
H2020 European Research Council
Publisher
Copernicus GmbH
Subject
Atmospheric Science
Reference56 articles.
1. Bey, I., Jacob, D. J., Yantosca, R. M., Logan, J. A., Field, B. D., Fiore,
A. M., Li, Q., Liu, H. Y., Mickley, L. J., and Schultz, M. G.: Global
modeling of tropospheric chemistry with assimilated meteorology: Model
description and evaluation, J. Geophys. Res.-Atmos., 106, 23073–23095, https://doi.org/10.1029/2001JD000807, 2001. 2. Box, G. E. P., Jenkins, G. M., Reinsel, G. C., and Ljung, G. M.: Time series
analysis: forecasting and control, 5th edn., John Wiley & Sons,
Inc, Hoboken, New Jersey, 712 pp., ISBN: 978-1-118-67502-1, 2015. 3. Bushnell, M., Waldmann, C., Seitz, S., Buckley, E., Tamburri, M., Hermes, J., Heslop, E., and Lara-Lopez, A.: Quality Assurance of Oceanographic Observations: Standards and Guidance Adopted by an International Partnership, Frontiers in Marine Science, 6, 706, https://doi.org/10.3389/fmars.2019.00706, 2019. 4. Campbell, J. L., Rustad, L. E., Porter, J. H., Taylor, J. R., Dereszynski,
E. W., Shanley, J. B., Gries, C., Henshaw, D. L., Martin, M. E., Sheldon, W.
M., and Boose, E. R.: Quantity is Nothing without Quality: Automated QA/QC
for Streaming Environmental Sensor Data, BioScience, 63, 574–585, https://doi.org/10.1525/bio.2013.63.7.10, 2013. 5. Castelão, G. P.: A Flexible System for Automatic Quality Control of
Oceanographic Data, arXiv [preprint], https://doi.org/10.48550/arXiv.1503.02714, 17 November 2016.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|