Drug Target Identification with Machine Learning: How to Choose Negative Examples-Reference-Cited by-同舟云学术

Drug Target Identification with Machine Learning: How to Choose Negative Examples

Published:2021-05-12 Issue:10 Volume:22 Page:5118
ISSN:1422-0067
Container-title:International Journal of Molecular Sciences
language:en
Short-container-title:IJMS

Author:

Najm Matthieu^ORCID,Azencott Chloé-Agathe^ORCID,Playe Benoit^ORCID,Stoven Véronique^ORCID

Abstract

Identification of the protein targets of hit molecules is essential in the drug discovery process. Target prediction with machine learning algorithms can help accelerate this search, limiting the number of required experiments. However, Drug-Target Interactions databases used for training present high statistical bias, leading to a high number of false positives, thus increasing time and cost of experimental validation campaigns. To minimize the number of false positives among predicted targets, we propose a new scheme for choosing negative examples, so that each protein and each drug appears an equal number of times in positive and negative examples. We artificially reproduce the process of target identification for three specific drugs, and more globally for 200 approved drugs. For the detailed three drug examples, and for the larger set of 200 drugs, training with the proposed scheme for the choice of negative examples improved target prediction results: the average number of false positives among the top ranked predicted targets decreased, and overall, the rank of the true targets was improved.Our method corrects databases’ statistical bias and reduces the number of false positive predictions, and therefore the number of useless experiments potentially undertaken.

Funder

Vaincre la Mucoviscidose

Publisher

MDPI AG

Subject

Inorganic Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Computer Science Applications,Spectroscopy,Molecular Biology,General Medicine,Catalysis

Link

https://www.mdpi.com/1422-0067/22/10/5118/pdf

Reference32 articles.

1. How were new medicines discovered?

2. Opportunities and challenges in phenotypic drug discovery: an industry perspective

3. State of the Art Review and Report of New Tool for Drug Discovery

4. Docking-based inverse virtual screening: methods, applications, and challenges

5. Machine Learning for In Silico Virtual Screening and Chemical Genomics: New Strategies

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An Extended Feature Representation Technique for Predicting Sequenced-based Host-pathogen Protein-protein Interaction;Current Bioinformatics;2025-01

2. Drug–Target Interactions Prediction at Scale: The Komet Algorithm with the LCIdb Dataset;Journal of Chemical Information and Modeling;2024-09-05

3. Artificial Intelligence in Drug Identification and Validation: A Scoping Review;Drug Research;2024-06

4. Gtie-Rt: A comprehensive graph learning model for predicting drugs targeting metabolic pathways in human;Journal of Bioinformatics and Computational Biology;2024-06

5. Introduction;Big Data Analysis and Artificial Intelligence for Medical Sciences;2024-05-10