Leveraging R (LevR) for fast processing of mass spectrometry data and machine learning: Applications analyzing fingerprints and glycopeptides-Reference-Cited by-同舟云学术

Leveraging R (LevR) for fast processing of mass spectrometry data and machine learning: Applications analyzing fingerprints and glycopeptides

Published:2022-08-23 Issue: Volume:2 Page:
ISSN:2673-9283
Container-title:Frontiers in Analytical Science
language:
Short-container-title:Front. Anal. Sci.

Author:

Pfeifer Leah D.,Patabandige Milani W.,Desaire Heather

Abstract

Applying machine learning strategies to interpret mass spectrometry data has the potential to revolutionize the way in which disease is diagnosed, prognosed, and treated. A persistent and tedious obstacle, however, is relaying mass spectrometry data to the machine learning algorithm. Given the native format and large size of mass spectrometry data files, preprocessing is a critical step. To ameliorate this challenge, we sought to create an easy-to-use, continuous pipeline that runs from data acquisition to the machine learning algorithm. Here, we present a start-to-finish pipeline designed to facilitate supervised and unsupervised classification of mass spectrometry data. The input can be any ESI data set collected by LC-MS or flow injection, and the output is a machine learning ready matrix, in which each row is a feature (an abundance of a particular m/z), and each column is a sample. This workflow provides automated handling of large mass spectrometry data sets for researchers seeking to implement machine learning strategies but who lack expertise in programming/coding to rapidly format the data. We demonstrate how the pipeline can be used on two different mass spectrometry data sets: 1) ESI-MS of fingerprint lipid compositions acquired by direct infusion and, 2) LC-MS of IgG glycopeptides. This workflow is uncomplicated and provides value via its simplicity and effectiveness.

Funder

University of Kansas

Publisher

Frontiers Media SA

Reference48 articles.

1. The translation of lipid profiles to nutritional biomarkers in the study of infant metabolism;Acharjee;Metabolomics,2017

2. Changes in the lipid composition of latent fingerprint residue with time after deposition on a surface;Archer;Forensic Sci. Int.,2005

3. Analysis of amino acids in latent fingerprint residue by capillary electrophoresis-mass spectrometry;Atherton;J. Sep. Sci.,2012

4. Identification and dereplication of endophytic Colletotrichum strains by MALDI TOF mass spectrometry and molecular networking;Barthélemy;Sci. Rep.,2020

5. Lifestyle chemistries from phones for individual profiling;Bouslimani;Proc. Natl. Acad. Sci. U. S. A.,2016

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Skin Surface Sebum Analysis by ESI-MS;Biomolecules;2024-07-03

2. Enabling Lipidomic Biomarker Studies for Protected Populations by Combining Noninvasive Fingerprint Sampling with MS Analysis and Machine Learning;Journal of Proteome Research;2024-01-03

3. Workflow for Evaluating Normalization Tools for Omics Data Using Supervised and Unsupervised Machine Learning;Journal of the American Society for Mass Spectrometry;2023-10-28

4. Learning channel-selective and aberrance repressed correlation filter with memory model for unmanned aerial vehicle object tracking;Frontiers in Neuroscience;2023-01-10