EXPLANA: A user-friendly workflow for EXPLoratory ANAlysis and feature selection in cross-sectional and longitudinal microbiome studies-Reference-Cited by-同舟云学术

EXPLANA: A user-friendly workflow for EXPLoratory ANAlysis and feature selection in cross-sectional and longitudinal microbiome studies

Published:2024-03-23 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Fouquier Jennifer^ORCID,Stanislawski Maggie^ORCID,O’Connor John^ORCID,Scadden Ashley^ORCID,Lozupone Catherine^ORCID

Abstract

AbstractMotivationLongitudinal microbiome studies (LMS) are increasingly common but have analytic challenges including non-independent data requiring mixed-effects models and large amounts of data that motivate exploratory analysis to identify factors related to outcome variables. Although change analysis (i.e. calculating deltas between values at different timepoints) can be powerful, how to best conduct these analyses is not always clear. For example, observational LMS measurements show natural fluctuations, so baseline might not be a reference of primary interest; whereas, for interventional LMS, baseline is a key reference point, often indicating the start of treatment.ResultsTo address these challenges, we developed a feature selection workflow for cross-sectional and LMS that supports numerical and categorical data called EXPLANA (EXPLoratory ANAlysis). Machine-learning methods were combined with different types of change calculations and downstream interpretation methods to identify statistically meaningful variables and explain their relationship to outcomes. EXPLANA generates an interactive report that textually and graphically summarizes methods and results. EXPLANA had good performance on simulated data, with an average area under the curve (AUC) of 0.91 (range: 0.79-1.0, SD = 0.05), outperformed an existing tool (AUC: 0.95 vs. 0.56), and identified novel order-dependent categorical feature changes. EXPLANA is broadly applicable and simplifies analytics for identifying features related to outcomes of interest.

Publisher

Cold Spring Harbor Laboratory

Reference51 articles.

1. Santiago-Rodriguez, T. M. & Hollister, E. B . Multi ‘omic data integration: A review of concepts, considerations, and approaches. Seminars in Perinatology 45, 151456 (2021).

2. Defining the human microbiome

3. RNA-Seq methods for transcriptome analysis;WIREs RNA,2017

4. Defining the Metabolome: Size, Flux, and Regulation

5. The Human Microbiome and Obesity: Moving beyond Associations