Abstract
AbstractToday more and more data are freely available. Based on these big datasets deep neural networks (DNNs) rapidly gain relevance in computational chemistry. Here, we explore the potential of DNNs to predict chemical properties from chemical structures. We have selected the octanol-water partition coefficient (log P) as an example, which plays an essential role in environmental chemistry and toxicology but also in chemical analysis. The predictive performance of the developed DNN is good with an rmse of 0.47 log units in the test dataset and an rmse of 0.33 for an external dataset from the SAMPL6 challenge. To this end, we trained the DNN using data augmentation considering all potential tautomeric forms of the chemicals. We further demonstrate how DNN models can help in the curation of the log P dataset by identifying potential errors, and address limitations of the dataset itself.
Publisher
Springer Science and Business Media LLC
Subject
Materials Chemistry,Biochemistry,Environmental Chemistry,General Chemistry
Reference74 articles.
1. Escher, B. I., Stapleton, H. M. & Schymanski, E. L. Tracking complex mixtures of chemicals in our changing environment. Science 367, 388–392 (2020).
2. Altenburger, R. et al. Future water quality monitoring: improving the balance between exposure and toxicity assessments of real-world pollutant mixtures. Environmental Sciences. Europe 31, 12 (2019).
3. Min, K., Cuiffi, J. D. & Mathers, R. T. Ranking environmental degradation trends of plastic marine debris based on physical properties and molecular structure. Nat. Commun. 11, 727 (2020).
4. Roldin, P. et al. The role of highly oxygenated organic molecules in the Boreal aerosol-cloud-climate system. Nat. Commun. 10, 4370 (2019).
5. Halbach, K. et al. Yolk Sac of Zebrafish Embryos as Backpack for Chemicals? Environ. Sci. Technol. 54, 10159–10169 (2020).
Cited by
31 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献