Affiliation:
1. a Department of Energy, Aalborg University, Esbjerg, Denmark
2. b Krüger A/S, Veolia Water Technologies, Aalborg, Denmark
Abstract
Abstract
In this work, operational data collected from four Danish wastewater treatment plants (WWTP) are assessed for quality issues and analyzed to investigate the feasibility of data-driven modeling for control purposes. All plants have permanent N2O sensors installed in the biological reactors, and N2O data are collected on the same terms as other operational data. We present and deploy a six-dimensional data quality assessment to the operational data evaluating (1) relevance, (2) accuracy, (3) completeness, (4) consistency, (5) comparability, and (6) accessibility. To increase the accuracy and completeness of the stored data, it is suggested that future initiatives are taken toward the collection and storing of metadata in WWTPs. Furthermore, seasonal variations and time-varying relationships between N2O, nitrogenous variables, and oxygen are investigated and compared across various case plants and process designs. Results show that the quality of the operational data varies substantially between plants. The investigation of time-varying interrelation between N2O and nitrogenous variables showed no clear pattern within or across different case plants. Furthermore, it is recommended that future research should consider adapting models so that more influence is linked to reliable measurements, contrary to assuming that all variables are of equal quality.
Reference31 articles.
1. Breck E., Polyzotis N., Roy S., Whang S. & Zinkevich M. 2019 Data validation for machine learning. In: Proceedings of Machine Learning and Systems, pp. 334–347.
2. Budach L., Feuerpfeil M., Ihde N., Nathansen A., Noack N., Patzlaff H., Harmouch H. & Naumann F. 2022 The effects of data quality on machine learning performance. arXiv preprint arXiv:2207.14529, pp. 1–40.
3. BIO-DENITRO and BIO-DENIPHO Systems – Experiences and Advanced Model Development: The Danish Systems for Biological N and P Removal
4. The Challenges of Data Quality and Data Quality Assessment in the Big Data Era
5. The Analysis of Time Series
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献