Machine Learning Framework with Feature Importance Interpretation for Discharge Estimation: A Case Study in Huitanggou Sluice Hydrological Station, China

Author:

He Sheng1,Niu Geng1ORCID,Sang Xuefeng1,Sun Xiaozhong2,Yin Junxian1,Chen Heting3

Affiliation:

1. State Key Laboratory of Simulation and Regulation of Water Cycle in River Basin, China Institute of Water Resources and Hydropower Research, Beijing 100038, China

2. Energy and Water Conservancy Planning Institute, Powerchina Huadong Engineering Corporation Limited, Hangzhou 311100, China

3. Suzhou Hydrology and Water Resources Bureau of Anhui Province, Suzhou 234000, China

Abstract

Accurate and reliable discharge estimation plays an important role in water resource management as well as downstream applications such as ecosystem conservation and flood control. Recently, data-driven machine learning (ML) techniques showed seemingly insurmountable performance in runoff forecasting and other geophysical domains, but they still need to be improved in terms of reliability and interpretability. In this study, focusing on discharge estimation and management, we developed an ML-based framework and applied it to the Huitanggou sluice hydrological station in Anhui Province, China. The framework contains two ML algorithms, the ensemble learning random forest (ELRF) and the ensemble learning gradient boosting decision tree (ELGBDT). The SHapley Additive exPlanation (SHAP) was introduced into our framework to interpret the impact of the model features. In our framework, the correlation analysis of the dataset can provide feature information for modeling, and the quartile method was utilized to solve the outlier problem of the dataset. The Bayesian optimization algorithm was adopted to optimize the hyperparameters of the ensemble ML models. The ensemble ML models are further compared with the traditional stage–discharge rating curve (SDRC) method and the single ML model. The results show that the estimation performance of the ensemble ML models is superior to that of the SDRC and the single ML model. In addition, an analysis of the discharge estimation without considering the flow state was performed. This analysis reveals that the ensemble ML models have strong adaptability. The ensemble ML models accurately estimate the discharge, with a coefficient of determination of 0.963, a root mean squared error of 31.268, and a coefficient of correlation of 0.984. Our framework can prove helpful to improve the efficiency of short-term hydrological estimation and simultaneously provide the interpretation of the impact of the hydrological features on estimation results.

Funder

National Natural Science Foundation of China

National Key Research and Development Program of China

Shenzhen Smart Water Project Phase I, China

Publisher

MDPI AG

Subject

Water Science and Technology,Aquatic Science,Geography, Planning and Development,Biochemistry

Reference49 articles.

1. Estimate stage-discharge relation for rivers using artificial neural networks-Case study: Dost Bayglu hydrometry station over Qara Su River;Nezamkhiavy;Int. J. Water Resour. Environ. Eng.,2014

2. Scenario-based prediction of short-term river stage-discharge process using wavelet-EEMD-based relevance vector machine;Roushangar;J. Hydroinform.,2019

3. Gene-Expression Programming for the Development of a Stage-Discharge Curve of the Pahang River;Azamathulla;Water Resour. Manag.,2011

4. Ghimire, B., and Reddy, M.J. (2010). Development of Stage-Discharge Rating Curve in River Using Genetic Algorithms and Model Tree, International Workshop on Advances in Statistical Hydrology.

5. New Approach for Stage–Discharge Relationship: Gene-Expression Programming;Guven;J. Hydrol. Eng.,2009

Cited by 2 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3