BERNN: Enhancing classification of Liquid Chromatography Mass Spectrometry data with batch effect removal neural networks-Reference-Cited by-同舟云学术

BERNN: Enhancing classification of Liquid Chromatography Mass Spectrometry data with batch effect removal neural networks

Published:2024-05-06 Issue:1 Volume:15 Page:
ISSN:2041-1723
Container-title:Nature Communications
language:en
Short-container-title:Nat Commun

Author:

Pelletier Simon J.,Leclercq Mickaël,Roux-Dalvai Florence^ORCID,de Geus Matthijs B.^ORCID,Leslie Shannon,Wang Weiwei,Lam TuKiet T.^ORCID,Nairn Angus C.^ORCID,Arnold Steven E.,Carlyle Becky C.^ORCID,Precioso Frédéric,Droit Arnaud^ORCID

Abstract

AbstractLiquid Chromatography Mass Spectrometry (LC-MS) is a powerful method for profiling complex biological samples. However, batch effects typically arise from differences in sample processing protocols, experimental conditions, and data acquisition techniques, significantly impacting the interpretability of results. Correcting batch effects is crucial for the reproducibility of omics research, but current methods are not optimal for the removal of batch effects without compressing the genuine biological variation under study. We propose a suite of Batch Effect Removal Neural Networks (BERNN) to remove batch effects in large LC-MS experiments, with the goal of maximizing sample classification performance between conditions. More importantly, these models must efficiently generalize in batches not seen during training. A comparison of batch effect correction methods across five diverse datasets demonstrated that BERNN models consistently showed the strongest sample classification performance. However, the model producing the greatest classification improvements did not always perform best in terms of batch effect removal. Finally, we show that the overcorrection of batch effects resulted in the loss of some essential biological variability. These findings highlight the importance of balancing batch effect removal while preserving valuable biological diversity in large-scale LC-MS experiments.

Publisher

Springer Science and Business Media LLC

Link

https://www.nature.com/articles/s41467-024-48177-5.pdf

Reference47 articles.

1. Banerjee, S. Empowering clinical diagnostics with mass spectrometry. ACS Omega 5, 2041–2048 (2020).

2. de Fátima Cobre, A. et al. Diagnosis and prognosis of COVID-19 employing analysis of patients’ plasma and serum via LC-MS and machine learning. Comput. Biol. Med. 146, 105659 (2022).

3. Califf, R. M. Biomarker definitions and their applications. Exp. Biol. Med. 243, 213 (2018).

4. Han, W. & Li, L. Evaluating and minimizing batch effects in metabolomics. Mass Spectrom. Rev. 41, 421–442 (2022).

5. Niu, J., Yang, J., Guo, Y., Qian, K. & Wang, Q. Joint deep learning for batch effect removal and classification toward MALDI MS based metabolomics. BMC Bioinforma. 23, 1–19 (2022).