PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions-Reference-Cited by-同舟云学术

PairK: Pairwise k-mer alignment for quantifying protein motif conservation in disordered regions

Published:2024-07-24 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Halpin Jackson C.^ORCID,Keating Amy E.^ORCID

Abstract

ABSTRACTProtein-protein interactions are often mediated by a modular peptide recognition domain binding to a short linear motif (SLiM) in the disordered region of another protein. The ability to predict domain-SLiM interactions would allow researchers to map protein interaction networks, predict the effects of perturbations to those networks, and develop biologically meaningful hypotheses. Unfortunately, sequence database searches for SLiMs generally yield mostly biologically irrelevant motif matches or false positives. To improve the prediction of novel SLiM interactions, researchers employ filters to discriminate between biologically relevant and improbable motif matches. One promising criterion for identifying biologically relevant SLiMs is the sequence conservation of the motif, exploiting the fact that functional motifs are more likely to be conserved than spurious motif matches. However, the difficulty of aligning disordered regions has significantly hampered the utility of this approach. We present PairK (pairwise k-mer alignment), an MSA-free method to quantify motif conservation in disordered regions. PairK outperforms both standard MSA-based conservation scores and a modern LLM-based conservation score predictor on the task of identifying biologically important motif instances. PairK can quantify conservation over wider phylogenetic distances than MSAs, indicating that SLiMs may be more conserved than is implied by MSA-based metrics. PairK is available as open-source code athttps://github.com/jacksonh1/pairk.

Publisher

Cold Spring Harbor Laboratory

Reference71 articles.

1. The Eukaryotic Linear Motif resource: 2022 release

2. Dual epitope recognition by the VASP EVH1 domain modulates polyproline ligand specificity and binding affinity

3. A distributed residue network permits conformational binding specificity in a conserved family of actin remodelers

4. A Noncanonical Binding Site in the EVH1 Domain of Vasodilator-Stimulated Phosphoprotein Regulates Its Interactions with the Proline Rich Region of Zyxin

5. A Thermodynamic Model for Multivalency in 14-3-3 Protein–Protein Interactions