Comprehensive Database of Circular Permutations: Systematic Detection and Analysis Using Deep Learning-Reference-Cited by-同舟云学术

Comprehensive Database of Circular Permutations: Systematic Detection and Analysis Using Deep Learning

Published:2024-08-28 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Hu Yue,Huang Bin

Abstract

AbstractThis study presents a comprehensive approach to detect circular permutations in Protein Data Bank up to date (PDB, 287081 proteins which sequence length is under 800 up to 20240101). We systematically analyzed the Protein Data Bank (PDB) to identify circular permutations, leveraging FoldSeek and MMseqs2 for structural and sequence similarity searches. The 143756535 candidate pairs were filtered by some threshold for corresponding analysis. TM-align, icarus or plmCP was used to align protein structures and refine detection accuracy, while facilitated the precise identification of circular permutations. Finally, we got 20801 candidate circular permutation pairs and 3351 circular permutation proteins(https://github.com/YueHuLab/Circular-permutation-in-PDB). Our methodology provides a robust framework for uncovering circular permutations in protein databases, enhancing our understanding of protein structural variations and evolutionary adaptations.

Publisher

Cold Spring Harbor Laboratory

Reference17 articles.

1. Circular permutations of natural protein sequences: structural evidence

2. CPSARST: an efficient circular permutation search tool applied to the detection of novel protein structural relationships

3. YAKUSA: a fast structural database scanning method;Proteins: Structure, Function, and Bioinformatics,2005

4. SeqCP: A sequence-based algorithm for searching circularly permuted proteins;Computational and Structural Biotechnology Journal,2023

5. CPred: a web server for predicting viable circular permutations in proteins