Abstract
AbstractTraining neural networks with small and imbalanced datasets often leads to overfitting and disregard of the minority class. For predictive toxicology, however, models with a good balance between sensitivity and specificity are needed. In this paper we introduce conformational oversampling as a means to balance and oversample datasets for prediction of toxicity. Conformational oversampling enhances a dataset by generation of multiple conformations of a molecule. These conformations can be used to balance, as well as oversample a dataset, thereby increasing the dataset size without the need of artificial samples. We show that conformational oversampling facilitates training of neural networks and provides state-of-the-art results on the Tox21 dataset.
Funder
Innovative Medicines Initiative
Austrian Science Fund
Publisher
Springer Science and Business Media LLC
Subject
Library and Information Sciences,Computer Graphics and Computer-Aided Design,Physical and Theoretical Chemistry,Computer Science Applications
Reference51 articles.
1. Russell WMS, Burch RL (1959) The principles of humane experimental technique. Methuen, London. OCLC: 595267154. http://books.google.com/books?id=j75qAAAAMAAJ. Accessed 18 Feb 2019
2. Zurlo J, Rudacille D, Goldberg AM (1996) The three Rs: the way forward. Environ Health Perspect 104(8):878–880. https://doi.org/10.1289/ehp.96104878
3. Executive Committee of the Congress (2009) Background to the three Rs declaration of Bologna, as adopted by the 3rd world congress on alternatives and animal use in the life sciences, Bologna, Italy, on 31 August 1999. Alternatives to laboratory animals: ATLA, vol 37, no 3, pp 286–289
4. Khanna I (2012) Drug discovery in pharmaceutical industry: productivity challenges and trends. Drug Discov Today 17(19):1088–1102. https://doi.org/10.1016/j.drudis.2012.05.007
5. Scannell JW, Blanckley A, Boldon H, Warrington B (2012) Diagnosing the decline in pharmaceutical R&D efficiency. Nat Rev Drug Discov 11(3):191–200. https://doi.org/10.1038/nrd3681
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献