<scp>PrePCI</scp>: A structure‐ and chemical similarity‐informed database of predicted protein compound interactions-Reference-Cited by-同舟云学术

PrePCI: A structure‐ and chemical similarity‐informed database of predicted protein compound interactions

Published:2023-03-16 Issue:4 Volume:32 Page:
ISSN:0961-8368
Container-title:Protein Science
language:en
Short-container-title:Protein Science

Author:

Trudeau Stephen J.¹²,Hwang Howook¹³,Mathur Deepika¹⁴⁵,Begum Kamrun¹,Petrey Donald¹,Murray Diana¹,Honig Barry¹⁶⁷⁸^ORCID

Affiliation:

1. Department of Systems Biology Columbia University Irving Medical Center New York New York USA

2. Integrated Graduate Program in Cellular, Molecular and Biomedical Studies (CMBS), Columbia University Irving Medical Center New York New York USA

3. Schrodinger, Inc. New York New York USA

4. Department of Genetics and Genomic Sciences Icahn School of Medicine at Mount Sinai New York New York USA

5. Department of Psychiatry Icahn School of Medicine at Mount Sinai New York New York USA

6. Department of Biochemistry and Molecular Biophysics Columbia University Irving Medical Center New York New York USA

7. Department of Medicine Columbia University New York New York USA

8. Zuckerman Mind Brain and Behavior Institute Columbia University New York New York USA

Abstract

AbstractWe describe the Predicting Protein–Compound Interactions (PrePCI) database which comprises over 5 billion predicted interactions between 6.8 million chemical compounds and 19,797 human proteins. PrePCI relies on a proteome‐wide database of structural models based on both traditional modeling techniques and the AlphaFold Protein Structure Database. Sequence‐ and structural similarity‐based metrics are established between template proteins, T, in the Protein Data Bank that bind compounds, C, and query proteins in the model database, Q. When the metrics exceed threshold values, it is assumed that C also binds to Q with a likelihood ratio (LR) derived from machine learning. If the relationship is based on structural similarity, the LR is based on a scoring function that measures the extent to which C is compatible with the binding site of Q as described in the LT‐scanner algorithm. For every predicted complex derived in this way, chemical similarity based on the Tanimoto coefficient identifies other small molecules that may bind to Q. An overall LR for the binding of C to Q is obtained from Naive Bayesian statistics. The PrePCI database can be queried by entering a UniProt ID or gene name for a protein to obtain a list of compounds predicted to bind to it along with associated LRs. Alternatively, entering an identifier for the compound outputs a list of proteins it is predicted to bind. Specific applications of the database to lead discovery, elucidation of drug mechanism of action, and biological function annotation are described.

Publisher

Wiley

Subject

Molecular Biology,Biochemistry

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/pro.4594

Reference66 articles.

1. Advancing the activity cliff concept;Bajorath J;F1000Res,2013

2. Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations?

3. A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking

4. UniProt: the universal protein knowledgebase in 2023;Bateman A;Nucleic Acids Res,2022

5. The Protein Data Bank

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Databases of ligand-binding pockets and protein-ligand interactions;Computational and Structural Biotechnology Journal;2024-12

2. MAGPIE: An interactive tool for visualizing and analyzing protein–ligand interactions;Protein Science;2024-07-11

3. A Review of Protein-Protein Interaction Databases;Reference Module in Life Sciences;2024

4. In silico Screening of Plectranthus ampoinicus and Hyptis suaveolens Phytochemicals: Novel Repellents Targeting Odorant Binding Proteins of Aedes aegypti and Aedes albopictus;2023-11-14

5. PrePPI: A Structure Informed Proteome-wide Database of Protein–Protein Interactions;Journal of Molecular Biology;2023-07