Abstract
Abstract. This study proposes a comprehensive benchmark dataset for
streamflow forecasting, WaterBench-Iowa, that follows FAIR (findability, accessibility, interoperability, and reuse) data principles
and is prepared with a focus on convenience for utilizing in data-driven
and machine learning studies, and provides benchmark performance for
state of art deep learning architectures on the dataset for comparative
analysis. By aggregating the datasets of streamflow, precipitation,
watershed area, slope, soil types, and evapotranspiration from federal
agencies and state organizations (i.e., NASA, NOAA, USGS, and Iowa Flood
Center), we provided the WaterBench-Iowa for hourly streamflow forecast
studies. This dataset has a high temporal and spatial resolution with rich
metadata and relational information, which can be used for a variety of deep learning and machine learning research. We defined a sample benchmark task
of predicting the hourly streamflow for the next 5 d for future
comparative studies, and provided benchmark results on this task with sample
linear regression and deep learning models, including long short-term memory
(LSTM), gated recurrent units (GRU), and sequence-to-sequence (S2S). Our
benchmark model results show a median Nash-Sutcliffe efficiency (NSE) of 0.74 and a median Kling-Gupta efficiency (KGE) of 0.79
among 125 watersheds for the 120 h ahead streamflow prediction task.
WaterBench-Iowa makes up for the lack of unified benchmarks in earth science research and can be accessed at Zenodo https://doi.org/10.5281/zenodo.7087806 (Demir et al., 2022a).
Subject
General Earth and Planetary Sciences
Reference63 articles.
1. Agliamzanov, R., Sit, M., and Demir, I.: Hydrology@ Home: a distributed
volunteer computing framework for hydrological research and applications,
J. Hydroinform., 22, 235–248, 2020.
2. Athira, V., Geetha, P., Vinayakumar, R., and Soman, K. P.: Deepairnet:
Applying recurrent networks for air quality prediction, Proc. Comput.
Sci., 132, 1394–1403, 2018.
3. Bai, Y., Bezak, N., Sapač, K., Klun, M., and Zhang, J.: Short-term
streamflow forecasting using the feature-enhanced regression model, Water
Resour. Manage., 33, 4783–4797, 2019.
4. Chung, J., Gulcehre, C., Cho, K., and Bengio, Y.: Empirical evaluation of
gated recurrent neural networks on sequence modeling, arXiv [preprint],
https://doi.org/10.48550/arXiv.1412.3555, 2014.
5. Cybenko, G.: Approximation by superpositions of a sigmoidal function,
Math. Control Signal., 2, 303–314, 1989.
Cited by
15 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献