Prediction of antibiotic resistance mechanisms using a protein language model-Reference-Cited by-同舟云学术

Prediction of antibiotic resistance mechanisms using a protein language model

Published:2024-05-06 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Yagimoto Kanami,Hosoda Shion,Sato Miwa,Hamada Michiaki

Abstract

AbstractMotivationAntibiotic resistance has emerged as a major global health threat, with an increasing number of bacterial infections becoming difficult to treat. Predicting the underlying resistance mechanisms of antibiotic resistance genes (ARGs) is crucial for understanding and combating this problem. However, existing methods struggle to accurately predict resistance mechanisms for ARGs with low similarity to known sequences and lack sufficient interpretability of the prediction models.ResultsIn this study, we present a novel approach for predicting ARG resistance mechanisms using Protein-BERT, a protein language model based on deep learning. Our method outperforms state-of-the-art techniques on diverse ARG datasets, including those with low homology to the training data, highlighting its potential for predicting the resistance mechanisms of unknown ARGs. Attention analysis of the model reveals that it considers biologically relevant features, such as conserved amino acid residues and antibiotic target binding sites, when making predictions. These findings provide valuable insights into the molecular basis of antibiotic resistance and demonstrate the interpretability of protein language models, offering a new perspective on their application in bioinformatics.AvailabilityThe source code is available for free athttps://github.com/hmdlab/ARG-BERT. The output results of the model are published athttps://waseda.box.com/v/ARG-BERT-suppl.Contactmhamada@waseda.jp

Publisher

Cold Spring Harbor Laboratory

Reference32 articles.

1. Ahmed, S. et al. (2022). Lm-arg: Identification & classification of antibiotic resistance genes leveraging pretrained protein language models. In 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pages 3782–3784. IEEE.

2. CARD 2020: antibiotic resistome surveillance with the comprehensive antibiotic resistance database

3. CARD 2023: expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database

4. Rifampicin-resistance, rpoB polymorphism and RNA polymerase genetic engineering