Research on outlier detection in CTD conductivity data based on cubic spline fitting
-
Published:2022-11-01
Issue:
Volume:9
Page:
-
ISSN:2296-7745
-
Container-title:Frontiers in Marine Science
-
language:
-
Short-container-title:Front. Mar. Sci.
Author:
Yu Long,Sun Jia,Guo Yanliang,Zhang Baohua,Yang Guangbing,Chen Liang,Ju Xia,Yang Fanlin,Xiong Xuejun,Lv Xianqing
Abstract
Outlier detection is the key to the quality control of marine survey data. For the detection of outliers in Conductivity-Temperature-Depth (CTD) data, previous methods, such as the Wild Edit method and the Median Filter Combined with Maximum Deviation method, mostly set a threshold based on statistics. Values greater than the threshold are treated as outliers, but there is no clear specification for the selection of threshold, thus multiple attempts are required. The process is time-consuming and inefficient, and the results have high false negative and positive rates. In response to this problem, we proposed an outlier detection method in CTD conductivity data, based on a physical constraint, the continuity of seawater. The method constructs a cubic spline fitting function based on the independent points scheme and the cubic spline interpolation to fit the conductivity data. The maximum fitting residual points will be flagged as outliers. The fitting stops when the optimal number of iterations is reached, which is automatically obtained by the minimum value of the sequence of maximum fitting residuals. Verification of the accuracy and stability of the method by means of examples proves that it has a lower false negative rate (17.88%) and false positive rate (0.24%) than other methods. Indeed, rates for the Wild Edit method are 56.96% and 2.19%, while for the Median Filter Combined with Maximum Deviation method rates are 23.28% and 0.31%. The Cubic Spline Fitting method is simple to operate, the result is clear and definite, better solved the problem of conductivity outliers detection.
Publisher
Frontiers Media SA
Subject
Ocean Engineering,Water Science and Technology,Aquatic Science,Global and Planetary Change,Oceanography
Reference37 articles.
1. Distinctive climate signals in reanalysis of global ocean heat content;Balmaseda;Geophysical Res. Letters.,2013
2. World ocean database 2018;Boyer,2018
3. Quality control and processing of historical oceanographic temperature, salinity, and oxygen data;Boyer;NOAA Technical Report NESDIS,1994
4. Data-Driven Science and Engineering
5. A guide to quality control and quality assurance of in-situ temperature and salinity observations;Bushnell,2020
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献