A novel interpretable machine learning model approach for the prediction of TiO2 photocatalytic degradation of air contaminants

Author:

Schossler Rodrigo Teixeira,Ojo Samuel,Jiang Zhuoying,Hu Jiajie,Yu Xiong

Abstract

AbstractAir contaminants lead to various environmental and health issues. Titanium dioxide (TiO2) features the benefits of autogenous photocatalytic degradation of air contaminants. To evaluate its performance, laboratory experiments are commonly used to determine the kinetics of the photocatalytic-degradation rate, which is labor intensive, time-consuming, and costly. In this study, Machine Learning (ML) models were developed to predict the photo-degradation rate constants of air-borne organic contaminants with TiO2 nanoparticles and ultraviolet irradiation. The hyperparameters of the ML models were optimized, which included Artificial Neural Network (ANN) with Bayesian optimization, gradient booster regressor (GBR) with Bayesian optimization, Extreme Gradient Boosting (XGBoost) with optimization using Hyperopt, and Catboost combined with Adaboost. The organic contaminant was encoded through Molecular fingerprints (MF). Imputation method was applied to deal with the missing data. A generative ML model Vanilla Gan was utilized to create synthetic data to further augment the size of available dataset and the SHapley Additive exPlanations (SHAP) was employed for ML model interpretability. The results indicated that data imputation allowed for the full utilization of the limited dataset, leading to good machine learning prediction performance and preventing common overfitting problems with small-sized data. Additionally, augmenting experimental data with synthetic data significantly improved prediction accuracy and considerably reduced overfitting issues. The results ranked the feature importance and assessed the impacts of different experimental variables on the rate of photo-degradation, which were consistent with physico-chemical laws.

Funder

NSF

Publisher

Springer Science and Business Media LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3