PDBrenum: A webserver and program providing Protein Data Bank files renumbered according to their UniProt sequences-Reference-Cited by-同舟云学术

PDBrenum: A webserver and program providing Protein Data Bank files renumbered according to their UniProt sequences

Published:2021-07-06 Issue:7 Volume:16 Page:e0253411
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Faezov Bulat,Dunbrack Roland L.^ORCID

Abstract

The Protein Data Bank (PDB) was established at Brookhaven National Laboratories in 1971 as an archive for biological macromolecular crystal structures. In mid 2021, the database has almost 180,000 structures solved by X-ray crystallography, nuclear magnetic resonance, cryo-electron microscopy, and other methods. Many proteins have been studied under different conditions, including binding partners such as ligands, nucleic acids, or other proteins; mutations, and post-translational modifications, thus enabling extensive comparative structure-function studies. However, these studies are made more difficult because authors are allowed by the PDB to number the amino acids in each protein sequence in any manner they wish. This results in the same protein being numbered differently in the available PDB entries. For instance, some authors may include N-terminal signal peptides or the N-terminal methionine in the sequence numbering and others may not. In addition to the coordinates, there are many fields that contain structural and functional information regarding specific residues numbered according to the author. Here we provide a webserver and Python3 application that fixes the PDB sequence numbering problem by replacing the author numbering with numbering derived from the corresponding UniProt sequences. We obtain this correspondence from the SIFTS database from PDBe. The server and program can take a list of PDB entries or a list of UniProt identifiers (e.g., “P04637” or “P53_HUMAN”) and provide renumbered files in mmCIF format and the legacy PDB format for both asymmetric unit files and biological assembly files provided by PDBe.

Funder

National Institute of General Medical Sciences

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference20 articles.

1. The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data;H Berman;Nucleic Acids Res,2007

2. RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy;SK Burley;Nucleic Acids Res,2019

3. PDBe: improved findability of macromolecular structure data in the PDB;DR Armstrong;Nucleic Acids Res,2020

4. New tools and functions in data-out activities at Protein Data Bank Japan (PDBj);AR Kinjo;Protein Sci,2018

5. GenBank;EW Sayers;Nucleic Acids Res,2019

Cited by 26 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Ginkgo biloba active compounds can modulate the development of acute mountain sickness and ischemic stroke as discovered by network pharmacology and molecular docking;iLABMED;2024-09-03

2. Next-generation Drosophila protein interactome map and its functional implications;Developmental Cell;2024-06

3. Improving AlphaFold Predicted Contacts for Alpha-Helical Transmembrane Proteins Using Structural Features;International Journal of Molecular Sciences;2024-05-11

4. DrugMap: A quantitative pan-cancer analysis of cysteine ligandability;Cell;2024-05

5. Quercetin alleviates hyperoxia‐induced bronchopulmonary dysplasia by inhibiting ferroptosis through the MAPK/PTGS2 pathway: Insights from network pharmacology, molecular docking, and experimental evaluations;Chemical Biology & Drug Design;2024-04