Combination of multiple omics and machine learning identifies diagnostic genes for ARDS and COVID-19

Author:

Tian Chuanxi1,Guo Yikun2,Guan Huifang3,Ma Kaile4,Hao Rui4,Zhu Wei5,Zhao Jinyue3,Li Min1

Affiliation:

1. Guang’anmen Hospital, China Academy of Chinese Medical Sciences

2. Beijing University of Chinese Medicine

3. Changchun University of Chinese Medicine

4. China Academy of Chinese Medical Sciences

5. Department of Traditional Chinese Medicine,The 942th Hospital of Joint Logistics Support force of Chinese People's Liberation Army

Abstract

Abstract BACKGROUND Acute respiratory distress syndrome (ARDS) is a common acute clinical syndrome of the respiratory system with a high mortality rate and difficult prognosis.COVID-19 is a serious respiratory infectious disease caused by coronaviruses in a global pandemic. Some studies have suggested a possible association between COVID-19 and ARDS, but few studies have investigated the mechanism of interaction between them. METHODS Microarray data of ARDS (GSE32707 and GSE66890) and COVID-19 (GSE213313) were downloaded from the GEO database and searched for common differential genes for enrichment analysis.WGCNA was used to identify co-expression modules and genes associated with ARDS and COVID-19. RF and LASSO were performed for candidate gene identification. Machine learning XGBoost improved the diagnosis of hub genes in ARDS and COVID-19. The degree of immune cell infiltration in ARDS and COVID-19 samples was assessed using the CIBERSORT algorithm, and the relationship between hub genes and infiltrating immune cells was investigated. Changes in pathway activity per cell were visualized using Seurat standard flow down clustering (seurat) to visualize peripheral blood mononuclear cell (PBMC) single-cell RNA sequencing (scRNA-seq) data from patients with sepsis-combined ARDS and patients with sepsis alone. RESULTS Limma difference analysis identified 314 up-regulated genes and 241 down-regulated genes in ARDS and COVID-19.WGCNA identified the purple-red co-expression module as the core module of ARDS and COVID-19. Five candidate genes, namely HIST1H2BK, TCF4, OLFM4, KIF14 and HK1, were screened using two machine learning algorithms, RF and LASSO. XGBoost constructed diagnostic models to evaluate the hub genes with high diagnostic efficacy in ARDS and COVID-19. Single-cell sequencing revealed the presence of alterations in five immune subpopulations, including monocytes, B cells, T cells, NK cells and platelets, with high expression levels and cellular occupancy of TCF4 and HK1, which are involved in oxidative reactions.

Publisher

Research Square Platform LLC

Reference25 articles.

1. Acute Respiratory Distress Syndrome;Thompson BT;N Engl J Med,2017

2. Pathophy siological mechanism of acute respiratory distress syndrome and research progress on diagnostic biomarkers of ARDS;Tang Min;China Journal of Modern Medicine,2022

3. Epidemiology, Patterns of Care, and Mortality for Patients With Acute Respiratory Distress Syndrome in Intensive Care Units in 50 Countries;Bellani G;JAMA,2016

4. Recent Developments on Therapeutic and Diagnostic Approaches for COVID-19;Majumder J;AAPS J,2021

5. COVID-19 and its long-term sequelae: what do we know in 2023?;Lippi G;Pol Arch Intern Med,2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3