Abstract
AbstractStructural bioinformatics and molecular modeling of proteins strongly depend on the protein structure selected for investigation. The choice of protein structure relies on direct application from the Protein Data Bank (PDB), homology- or de-novo modeling. Recent de-novo models, such as AlphaFold2, require little preprocessing and omit the need to navigate the many parameters of choosing an experimentally determined model. Yet, the experimentally determined structure still has much to offer, why it should be of interest to the community to ease the choice of experimentally determined models. We provide an open-source software package, PDBminer, to mine both the AlphaFold Database (AlphaFoldDB) and the PDB based on search criteria set by the user. This tool provides an up-to-date, quality-ranked table of structures applicable for further research. PDBminer provides an overview of the available protein structures to one or more input proteins, parallelizing the runs if multiple cores are specified. The output table reports the coverage of the protein structures aligned to the UniProt sequence, overcoming numbering differences in PDB structures, and providing information regarding model quality, protein complexes, ligands, and nucleotide binding. The PDBminer2coverage and PDBminer2network tools assist in visualizing the results. We suggest that PDBminer can be applied to overcome the tedious task of choosing a PDB structure without losing the wealth of additional information available in the PDB. As developers, we will guarantee the introduction of new functionalities, assistance, training of new contributors, and package maintenance. The package is available athttp://github.com/ELELAB/PDBminer.
Publisher
Cold Spring Harbor Laboratory
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献