A feature extraction method based on the entropy-minimal description length principle and GBDT for common surface water pollution identification

Author:

Huang Pingjie1,Wang Lixiang1,Hou Dibo1,Lin Wangli1,Yu Jie1,Zhang Guangxin1,Zhang Hongjian1

Affiliation:

1. College of Control Science and Engineering, Zhejiang University, Hangzhou, Zhejiang, China

Abstract

Abstract To effectively prevent river water pollution, water quality monitoring is necessary. However, existing methods for water quality assessment are limited in terms of the characterization of water quality conditions, and few researchers have been able to focus on feature extraction methods relative to water pollution identification, or to obtain accurate water pollution source information. Thus, this study proposed a feature extraction method based on the entropy-minimal description length principle and gradient boosting decision tree (GBDT) algorithm for identifying the type of surface water pollution in consideration of the distribution characteristics and intrinsic association of conventional water quality indicators. To improve the robustness to noise, we constructed the coarse-grained discretization features of each water quality index based on information entropy. The nonlinear correlation between water quality indexes and pollution classes was excavated by the GBDT algorithm, which was utilized to acquire tree transformed features. Water samples collected by a southern city Environmental Monitoring Center were used to test the performance of the proposed algorithm. Experimental results demonstrate that features extracted by the proposed method are more effective than the water quality indicators without feature engineering and features extracted by the principal component analysis algorithm.

Funder

the Key Technology Research and Development Program of Zhejiang Province

the National Natural Science Foundation of China

the National Key R&D Program of China

Publisher

IWA Publishing

Subject

Atmospheric Science,Geotechnical Engineering and Engineering Geology,Civil and Structural Engineering,Water Science and Technology

Reference32 articles.

1. Water pollution and its sources, effects & management: a case study of Delhi;Ahmed;International Journal of Current Advanced Research,2018

2. Categorical Variables in Regression Analysis: A Comparison of Dummy and Effect Coding

3. Classification of River Water Quality Using Multivariate Analysis

4. Forecasting of river discharges in the presence of chaos and noise;Babovic,2000

5. Analysis of applicability of Nemerow pollution index to evaluation of water quality of Taihu Lake;Bin;Water Resources Protection,2014

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3