Benchmarking Data Sets from PubChem BioAssay Data: Current Scenario and Room for Improvement-Reference-Cited by-同舟云学术

Benchmarking Data Sets from PubChem BioAssay Data: Current Scenario and Room for Improvement

Published:2020-06-19 Issue:12 Volume:21 Page:4380
ISSN:1422-0067
Container-title:International Journal of Molecular Sciences
language:en
Short-container-title:IJMS

Author:

Tran-Nguyen Viet-Khoa^ORCID,Rognan Didier^ORCID

Abstract

Developing realistic data sets for evaluating virtual screening methods is a task that has been tackled by the cheminformatics community for many years. Numerous artificially constructed data collections were developed, such as DUD, DUD-E, or DEKOIS. However, they all suffer from multiple drawbacks, one of which is the absence of experimental results confirming the impotence of presumably inactive molecules, leading to possible false negatives in the ligand sets. In light of this problem, the PubChem BioAssay database, an open-access repository providing the bioactivity information of compounds that were already tested on a biological target, is now a recommended source for data set construction. Nevertheless, there exist several issues with the use of such data that need to be properly addressed. In this article, an overview of benchmarking data collections built upon experimental PubChem BioAssay input is provided, along with a thorough discussion of noteworthy issues that one must consider during the design of new ligand sets from this database. The points raised in this review are expected to guide future developments in this regard, in hopes of offering better evaluation tools for novel in silico screening procedures.

Publisher

MDPI AG

Subject

Inorganic Chemistry,Organic Chemistry,Physical and Theoretical Chemistry,Computer Science Applications,Spectroscopy,Molecular Biology,General Medicine,Catalysis

Link

https://www.mdpi.com/1422-0067/21/12/4380/pdf

Reference114 articles.

1. PubChem: a public information system for analyzing bioactivities of small molecules

2. PubChem as a public resource for drug discovery

3. An overview of the PubChem BioAssay resource

4. PubChem's BioAssay Database

5. PubChem BioAssay: A Decade’s Development toward Open High-Throughput Screening Data Sharing

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Impact of Artificial Intelligence on Drug Development and Delivery;Current Topics in Medicinal Chemistry;2024-08-12

2. Do Molecular Fingerprints Identify Diverse Active Drugs in Large-Scale Virtual Screening? (No);Pharmaceuticals;2024-07-26

3. ClassyPose: A Machine‐Learning Classification Model for Ligand Pose Selection Applied to Virtual Screening in Drug Discovery;Advanced Intelligent Systems;2024-05-12

4. A practical guide to machine-learning scoring for structure-based virtual screening;Nature Protocols;2023-10-16

5. Copper transporter protein (MctB) as a therapeutic target to elicit antimycobacterial activity against tuberculosis;Journal of Biomolecular Structure and Dynamics;2023-06-20