Comprehensive detection of analytes in large chromatographic datasets by coupling factor analysis with a decision tree
-
Published:2022-09-05
Issue:17
Volume:15
Page:5061-5075
-
ISSN:1867-8548
-
Container-title:Atmospheric Measurement Techniques
-
language:en
-
Short-container-title:Atmos. Meas. Tech.
Author:
Kim Sungwoo, Lerner Brian M.ORCID, Sueper Donna T., Isaacman-VanWertz GabrielORCID
Abstract
Abstract. Environmental samples typically contain hundreds or thousands of unique organic compounds, and even minor components may provide valuable insight
into their sources and transformations. To understand atmospheric processes, individual components are frequently identified and quantified using
gas chromatography–mass spectrometry. However, due to the complexity and frequently variable nature of such data, data reduction is a significant
bottleneck in analysis. Consequently, only a subset of known analytes is often reported for a dataset, and large amounts of potentially useful data
are discarded. We present an automated approach of cataloging and potentially identifying all analytes in a large chromatographic dataset and
demonstrate the utility of our approach in an analysis of ambient aerosols. We use a coupled factor analysis–decision tree approach to deconvolute
peaks and comprehensively catalog nearly all analytes in a dataset. Positive matrix factorization (PMF) of small subsections of multiple
chromatograms is applied to extract factors that represent chromatographic profiles and mass spectra of potential analytes, in which peaks are
detected. A decision tree based on peak parameters (e.g., location, width, and height), relative ratios of those parameters, peak shape, noise,
retention time, and mass spectrum is applied to discard erroneous peaks and combine peaks determined to represent the same analyte. With our
approach, all analytes within the small section of the chromatogram are cataloged, and the process is repeated for overlapping sections across the
chromatogram, generating a complete list of the retention times and estimated mass spectra of all peaks in a dataset. We validate this approach
using samples of known compounds and demonstrate the separation of poorly resolved peaks with similar mass spectra and the resolution of peaks that
appear in only a fraction of chromatograms. As a case study, this method is applied to a complex real-world dataset of the composition of
atmospheric particles, in which more than 1100 unique chromatographic peaks are resolved, and the corresponding peak information along with mass spectra are cataloged.
Funder
National Oceanic and Atmospheric Administration
Publisher
Copernicus GmbH
Subject
Atmospheric Science
Reference56 articles.
1. Amigo, J. M., Popielarz, M. J., Callejon, R. M., Morales, M. L., Troncoso, A. M., Petersen, M. A., and Toldam-Andersen, T. B.:
Comprehensive analysis of chromatographic data by using PARAFAC2 and principal components analysis, J. Chromatogr. A, 1217, 4422–4429, https://doi.org/10.1016/j.chroma.2010.04.042, 2010. 2. Anderson, A. H., Gibb, T. C., and Littlewood, A. B.:
Computer Resolution of Unresolved Convoluted Gas-Chromatographic Peaks, J. Chromatogr. Sci., 8, 640–646, https://doi.org/10.1093/chromsci/8.11.640, 1970. 3. Apel, E. C., Hills, A. J., Lueb, R., Zindel, S., Eisele, S., and Riemer, D. D.:
A fast-GC/MS system to measure C2 to C4 carbonyls and methanol aboard aircraft, J. Geophys. Res., 108, 8794, https://doi.org/10.1029/2002JD003199, 2003. 4. Bertsch, W.:
Two-Dimensional Gas Chromatography. Concepts, Instrumentation, and Applications – Part 1: Fundamentals, Conventional Two-Dimensional Gas Chromatography, Selected Applications, J. High Res. Chromatog., 22, 647–665, https://doi.org/10.1002/(SICI)1521-4168(19991201)22:12<647::AID-JHRC647>3.0.CO;2-V, 1999. 5. Blaško, J., Kubinec, R., Ostrovský, I., Pavlíková, E., Krupčík, J., and Soják, L.:
Chemometric deconvolution of gas chromatographic unresolved conjugated linoleic acid isomers triplet in milk samples, J. Chromatogr. A, 1216, 2757–2761, https://doi.org/10.1016/j.chroma.2008.11.019, 2009.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|