Python workflow for the selection and identification of marker peptides—proof-of-principle study with heated milk-Reference-Cited by-同舟云学术

Python workflow for the selection and identification of marker peptides—proof-of-principle study with heated milk

Published:2024-04-12 Issue:14 Volume:416 Page:3349-3360
ISSN:1618-2642
Container-title:Analytical and Bioanalytical Chemistry
language:en
Short-container-title:Anal Bioanal Chem

Author:

Kuhnen Gesine,Class Lisa-Carina,Badekow Svenja,Hanisch Kim Lara,Rohn Sascha,Kuballa Jürgen

Abstract

AbstractThe analysis of almost holistic food profiles has developed considerably over the last years. This has also led to larger amounts of data and the ability to obtain more information about health-beneficial and adverse constituents in food than ever before. Especially in the field of proteomics, software is used for evaluation, and these do not provide specific approaches for unique monitoring questions. An additional and more comprehensive way of evaluation can be done with the programming language Python. It offers broad possibilities by a large ecosystem for mass spectrometric data analysis, but needs to be tailored for specific sets of features, the research questions behind. It also offers the applicability of various machine-learning approaches. The aim of the present study was to develop an algorithm for selecting and identifying potential marker peptides from mass spectrometric data. The workflow is divided into three steps: (I) feature engineering, (II) chemometric data analysis, and (III) feature identification. The first step is the transformation of the mass spectrometric data into a structure, which enables the application of existing data analysis packages in Python. The second step is the data analysis for selecting single features. These features are further processed in the third step, which is the feature identification. The data used exemplarily in this proof-of-principle approach was from a study on the influence of a heat treatment on the milk proteome/peptidome. Graphical abstract

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s00216-024-05286-w.pdf

Reference65 articles.

1. Parastar H, Tauler R. Big (bio) chemical data mining using chemometric methods : a need for chemists. Angew Chem. 2022;134:1–29. https://doi.org/10.1002/ange.201801134.

2. Mannila H (1996) Data mining: machine learning, statistics, and databases. In: Proceedings - 8th International Conference on Scientific and Statistical Data Base Management, SSDBM 1996. IEEE, pp 2–8.

3. Class L-C, Kuhnen G, Rohn S, Kuballa J. Diving deep into the data : a review of deep learning approaches and potential applications in foodomics. Foods. 2021;10:1–18. https://doi.org/10.3390/foods10081803.

4. Hibbert DB. Vocabulary of concepts and terms in chemometrics (IUPAC Recommendations 2016). Pure Appl Chem. 2016;88:407–43. https://doi.org/10.1515/pac-2015-0605.

5. Hibbert DB, Minkkinen P, Faber NM, Wise BM. IUPAC project: a glossary of concepts and terms in chemometrics. Anal Chim Acta. 2009;642:3–5. https://doi.org/10.1016/j.aca.2009.02.020.