Affiliation:
1. Department of Bioinformatics, School of Biological and Behavioural Sciences, Queen Mary University of London , London E1 4NS, UK
Abstract
Abstract
Motivation
Antimicrobial peptides (AMPs) are increasingly being used in the development of new therapeutic drugs in areas such as cancer therapy and hypertension. Additionally, they are seen as an alternative to antibiotics due to the increasing occurrence of bacterial resistance. Wet-laboratory experimental identification, however, is both time-consuming and costly, so in silico models are now commonly used in order to screen new AMP candidates.
Results
This paper proposes a novel approach for creating model inputs; using pre-trained language models to produce contextualized embeddings, representing the amino acids within each peptide sequence, before a convolutional neural network is trained as the classifier. The results were validated on two datasets—one previously used in AMP prediction research, and a larger independent dataset created by this paper. Predictive accuracies of 93.33% and 88.26% were achieved, respectively, outperforming previous state-of-the-art classification models.
Availability and implementation
All codes are available and can be accessed here: https://github.com/williamdee1/LMPred_AMP_Prediction.
Supplementary information
Supplementary data are available at Bioinformatics Advances online.
Publisher
Oxford University Press (OUP)
Subject
Cell Biology,Developmental Biology,Embryology,Anatomy
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献