Amino Acid k-mer Feature Extraction for Quantitative Antimicrobial Resistance (AMR) Prediction by Machine Learning and Model Interpretation for Biological Insights-Reference-Cited by-同舟云学术

Amino Acid k-mer Feature Extraction for Quantitative Antimicrobial Resistance (AMR) Prediction by Machine Learning and Model Interpretation for Biological Insights

Published:2020-10-28 Issue:11 Volume:9 Page:365
ISSN:2079-7737
Container-title:Biology
language:en
Short-container-title:Biology

Author:

ValizadehAslani Taha^ORCID,Zhao Zhengqiao^ORCID,Sokhansanj Bahrad A.^ORCID,Rosen Gail L.^ORCID

Abstract

Machine learning algorithms can learn mechanisms of antimicrobial resistance from the data of DNA sequence without any a priori information. Interpreting a trained machine learning algorithm can be exploited for validating the model and obtaining new information about resistance mechanisms. Different feature extraction methods, such as SNP calling and counting nucleotide k-mers have been proposed for presenting DNA sequences to the model. However, there are trade-offs between interpretability, computational complexity and accuracy for different feature extraction methods. In this study, we have proposed a new feature extraction method, counting amino acid k-mers or oligopeptides, which provides easier model interpretation compared to counting nucleotide k-mers and reaches the same or even better accuracy in comparison with different methods. Additionally, we have trained machine learning algorithms using different feature extraction methods and compared the results in terms of accuracy, model interpretability and computational complexity. We have built a new feature selection pipeline for extraction of important features so that new AMR determinants can be discovered by analyzing these features. This pipeline allows the construction of models that only use a small number of features and can predict resistance accurately.

Publisher

MDPI AG

Subject

General Agricultural and Biological Sciences,General Immunology and Microbiology,General Biochemistry, Genetics and Molecular Biology

Link

https://www.mdpi.com/2079-7737/9/11/365/pdf

Reference127 articles.

1. Attributable deaths and disability-adjusted life-years caused by infections with antibiotic-resistant bacteria in the EU and the European Economic Area in 2015: a population-level modelling analysis

2. Looming Global-Scale Failures and Missing Institutions

3. Antibiotic resistance: a rundown of a global crisis

4. Strategies for achieving global collective action on antimicrobial resistance

5. New Societal Approaches to Empowering Antibiotic Stewardship

Cited by 27 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A machine learning-based strategy to elucidate the identification of antibiotic resistance in bacteria;Frontiers in Antibiotics;2024-06-18

2. Machine learning-based antibiotic resistance prediction models: An updated systematic review and meta-analysis;Technology and Health Care;2024-05-26

3. Tackling the Antimicrobial Resistance “Pandemic” with Machine Learning Tools: A Summary of Available Evidence;Microorganisms;2024-04-23

4. Effect of tokenization on transformers for biological sequences;Bioinformatics;2024-03-29

5. TCRpred: incorporating T-cell receptor repertoire for clinical outcome prediction;Frontiers in Genetics;2024-03-13