Assessment of Different Methods for Estimation of Missing Rainfall Data-Reference-Cited by-同舟云学术

Assessment of Different Methods for Estimation of Missing Rainfall Data

Published:2024-07-31 Issue: Volume: Page:
ISSN:0920-4741
Container-title:Water Resources Management
language:en
Short-container-title:Water Resour Manage

Author:

Hırca Tuğçe^ORCID,Eryılmaz Türkkan Gökçen^ORCID

Abstract

AbstractMissing data is a common problem encountered in various fields, including clinical research, environmental sciences and hydrology. In order to obtain reliable results from the analysis, the data inventory must be completed. This paper presents a methodology for addressing the missing data problem by examining the missing data structure and missing data techniques. Simulated datasets were created by considering the number of missing data, missing data pattern and missing data mechanism of real datasets containing missing values, which are often overlooked in hydrology. Considering the missing data pattern, the most commonly used methods for missing data analysis in hydrology and other fields were applied to the created simulated datasets. Simple imputation techniques and expectation maximization (EM) were implemented in SPSS software and machine learning techniques such as k-nearest neighbor (kNN), together with the hot-deck were implemented in the Python programming language. In the performance evaluation based on error metrics, it is concluded that the EM method is the most suitable completion method. Homogeneity analyses were performed in the Mathematica programming language to identify possible changes and inconsistencies in the completed rainfall dataset. Homogeneity analyses revealed that most of the completed rainfall datasets are homogeneous at class 1 level, consistent and reliable and do not show systematic changes in time.

Funder

Bayburt University

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11269-024-03936-3.pdf

Reference89 articles.

1. Addi M, Gyasi-Agyei Y, Obuobie E, Amekudzi LK (2022) Evaluation of imputation techniques for infilling missing daily rainfall records on river basins in Ghana. J Des Sci Hydrologiques 67(4):613–627. https://doi.org/10.1080/02626667.2022.2030868

2. Ahani H, Kherad M, Kousari MR, Zadeh MR, Karampour MA, Ejraee F, Kamali S (2012) An investigation of trends in precipitation volume for the last three decades in different regions of Fars province, Iran. Theor Appl Climatol 109:361–382. https://doi.org/10.1007/s00704-011-0572-z

3. Alexandersson H (1986) A homogeneity test applied to precipitationdata. J Climatol 6:661–675. https://doi.org/10.1002/joc.3370060607

4. Amirteimoori A, Kordrostami S (2010) A Euclidean distance-based measure of efficiency in data envelopment analysis. Optimization 59(7):985–996. https://doi.org/10.1080/02331930902878333

5. Andridge RR, Little RJ (2010) A Review of hot deck ımputation for survey non-response. Int Stat Rev 78:40–64. https://doi.org/10.1111/j.1751-5823.2010.00103.x