Predicting drug properties with parameter-free machine learning: pareto-optimal embedded modeling (POEM)-Reference-Cited by-同舟云学术

Predicting drug properties with parameter-free machine learning: pareto-optimal embedded modeling (POEM)

Published:2020-05-19 Issue:2 Volume:1 Page:025008
ISSN:2632-2153
Container-title:Machine Learning: Science and Technology
language:
Short-container-title:Mach. Learn.: Sci. Technol.

Author:

Brereton Andrew E^ORCID,MacKinnon Stephen^ORCID,Safikhani Zhaleh,Reeves Shawn,Alwash Sana,Shahani Vijay,Windemuth Andreas^ORCID

Abstract

Abstract The prediction of absorption, distribution, metabolism, excretion, and toxicity (ADMET) of small molecules from their molecular structure is a central problem in medicinal chemistry with great practical importance in drug discovery. Creating predictive models conventionally requires substantial trial-and-error for the selection of molecular representations, machine learning (ML) algorithms, and hyperparameter tuning. A generally applicable method that performs well on all datasets without tuning would be of great value but is currently lacking. Here, we describe pareto-optimal embedded modeling (POEM), a similarity-based method for predicting molecular properties. POEM is a non-parametric, supervised ML algorithm developed to generate reliable predictive models without need for optimization. POEM’s predictive strength is obtained by combining multiple different representations of molecular structures in a context-specific manner, while maintaining low dimensionality. We benchmark POEM relative to industry-standard ML algorithms and published results across 17 classifications tasks. POEM performs well in all cases and reduces the risk of overfitting.

Funder

Ontario Centre of Excellence TalentEdge Data Analytics Internship

Publisher

IOP Publishing

Subject

Artificial Intelligence,Human-Computer Interaction,Software

Link

https://iopscience.iop.org/article/10.1088/2632-2153/ab891b/pdf

Reference39 articles.

1. QSAR—origins and present status: a historical perspective;Craig;Drug Inf. J.,1984

2. Informing the selection of screening hit series with in silico absorption, distribution, metabolism, excretion, and toxicity profiles;Sanders;J. Med. Chem.,2017

3. Computational methods for the prediction of drug-likeness;Clark;Drug Discovery Today,2000