Target identification of drug candidates with machine-learning algorithms: how choosing negative examples for training-Reference-Cited by-同舟云学术

Target identification of drug candidates with machine-learning algorithms: how choosing negative examples for training

Published:2021-04-06 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Najm Matthieu^ORCID,Azencott Chloé-Agathe^ORCID,Playe Benoit^ORCID,Stoven Véronique^ORCID

Abstract

Abstract(1) Background:Identification of hit molecules protein targets is essential in the drug discovery process. Target prediction with machine-learning algorithms can help accelerate this search, limiting the number of required experiments. However, Drug-Target Interactions databases used for training present high statistical bias, leading to a high number of false positive predicted targets, thus increasing time and cost of experimental validation campaigns. (2) Methods: To minimize the number of false positive predicted proteins, we propose a new scheme for choosing negative examples, so that each protein and each drug appears an equal number of times in positive and negative examples. We artificially reproduce the process of target identification for 3 particular drugs, and more globally for 200 approved drugs. (3) Results: For the detailed 3 drugs examples, and for the larger set of 200 drugs, training with the proposed scheme for the choice of negative examples improved target prediction results: the average number of false positive among the top ranked predicted targets decreased and overall the rank of the true targets was improved. (4) Conclusion: Our method enables to correct databases statistical bias and reduces the number of false positive predictions, and therefore the number of useless experiments potentially undertaken.

Publisher

Cold Spring Harbor Laboratory

Reference28 articles.

1. How were new medicines discovered?

2. Opportunities and challenges in phenotypic drug discovery: an industry perspective

3. State of the Art Review and Report of New Tool for Drug Discovery

4. Docking-based inverse virtual screening: methods, applications, and challenges

5. Machine Learning for In Silico Virtual Screening and Chemical Genomics: New Strategies

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Recent Advances and Techniques for Identifying Novel Antibacterial Targets;Current Medicinal Chemistry;2024-02