BatchDTA: implicit batch alignment enhances deep learning-based drug–target affinity estimation-Reference-Cited by-同舟云学术

BatchDTA: implicit batch alignment enhances deep learning-based drug–target affinity estimation

Published:2022-07-07 Issue:4 Volume:23 Page:
ISSN:1467-5463
Container-title:Briefings in Bioinformatics
language:en
Short-container-title:

Author:

Luo Hongyu¹,Xiang Yingfei¹,Fang Xiaomin¹,Lin Wei¹,Wang Fan¹,Wu Hua²,Wang Haifeng²

Affiliation:

1. PaddleHelix team, Baidu Inc. , 518000, Shenzhen, China

2. Baidu Inc. , 100000, Beijing, China

Abstract

Abstract Candidate compounds with high binding affinities toward a target protein are likely to be developed as drugs. Deep neural networks (DNNs) have attracted increasing attention for drug–target affinity (DTA) estimation owning to their efficiency. However, the negative impact of batch effects caused by measure metrics, system technologies and other assay information is seldom discussed when training a DNN model for DTA. Suffering from the data deviation caused by batch effects, the DNN models can only be trained on a small amount of ‘clean’ data. Thus, it is challenging for them to provide precise and consistent estimations. We design a batch-sensitive training framework, namely BatchDTA, to train the DNN models. BatchDTA implicitly aligns multiple batches toward the same protein through learning the orders of candidate compounds with respect to the batches, alleviating the impact of the batch effects on the DNN models. Extensive experiments demonstrate that BatchDTA facilitates four mainstream DNN models to enhance the ability and robustness on multiple DTA datasets (BindingDB, Davis and KIBA). The average concordance index of the DNN models achieves a relative improvement of 4.0%. The case study reveals that BatchDTA can successfully learn the ranking orders of the compounds from multiple batches. In addition, BatchDTA can also be applied to the fused data collected from multiple sources to achieve further improvement.

Publisher

Oxford University Press (OUP)

Subject

Molecular Biology,Information Systems

Link

https://academic.oup.com/bib/article-pdf/23/4/bbac260/45017546/bbac260.pdf

Reference49 articles.

1. On the design and analysis of gene expression studies in human populations;Akey;Nat Genet,2007

2. A survey of cross-validation procedures for model selection;Arlot;Statistics surveys,2010

3. High-resolution serum proteomic patterns for ovarian cancer detection;Baggerly;Endocr Relat Cancer,2004

4. Learning to rank using gradient descent

5. Machine learning for drug-target interaction prediction;Chen;Molecules,2018

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Exploring the anti-gout potential of sunflower receptacles alkaloids: A computational and pharmacological analysis;Computers in Biology and Medicine;2024-04

2. DrugMGR: a deep bioactive molecule binding method to identify compounds targeting proteins;Bioinformatics;2024-03-29

3. The discovery of subunit-selective GluN1/GluN2B NMDAR antagonist via pharmacophere-based virtual screening;Experimental Biology and Medicine;2023-12

4. Multi-task bioassay pre-training for protein-ligand binding affinity prediction;Briefings in Bioinformatics;2023-11-22

5. Artificial intelligence in systems biology;Handbook of Statistics;2023