Affiliation:
1. Shanghai Literature Institute of Traditional Chinese Medicine
2. ChengZheng Wisdom (Shanghai) Health Sciences and Technology Co., Ltd
3. Zhejiang Zhongwei Medical Research Center
4. Shanghai Daosh Medical Technology Co., Ltd
Abstract
Abstract
Background
Bronchopulmonary Dysplasia (BPD) has a high incidence and affects the health of preterm infants. Cuproptosis is a novel form of cell death, but its mechanism of action in the disease is not yet clear. Machine learning, the latest tool for the analysis of biological samples, is still relatively rarely used for in-depth analysis and prediction of diseases.
Methods and Results
First, the differential expression of cuproptosis-related genes (CRGs) in the GSE108754 dataset was extracted and the heat map showed that the NFE2L2 gene was significantly expressed and highly expressed in the control group and the GLS gene was significantly highly expressed in the treat group. Chromosome location analysis showed that both genes were associated with chromosome 2 and positively correlated between genes. The results of immune infiltration and immune cell differential analysis showed differences in the four immune cells, especially in Monocytes cells. Five new pathways were analyzed by consistent clustering based on the expression of CRGs. Weighted correlation network analysis (WGCNA) set the screening condition to the top 25% to obtain the disease signature genes. Four machine learning algorithms: Generalized Linear Models (GLM), Random Forest (RF), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGB) were used to screen the disease signature genes, and the final five marker genes for disease prediction. The models constructed by GLM method were proved to be more accurate in the validation of two datasets, GSE190215 and GSE188944.
Conclusion
We eventually identified two copper death-associated genes, NFE2L2 and GLS. A machine learning model-GLM was constructed to predict the prevalence of BPD disease, and five disease signature genes NFATC3, ERMN, PLA2G4A, MTMR9LP and LOC440700 were identified. These genes that were bioinformatics analyzed could be potential targets for identifying BPD disease and treatment.
Publisher
Research Square Platform LLC
Reference56 articles.
1. The adverse impact of obesity on heart rate variability is modified by a NFE2L2 gene variant: The SAPALDIA cohort;Adam M;International Journal of Cardiology,2017
2. Allegra, A., Mania, M., D'Ascola, A., Oteri, G., Siniscalchi, E. N., Avenoso, A.,.. . Campo, S. (2020). Altered Long Noncoding RNA Expression Profile in Multiple Myeloma Patients with Bisphosphonate-Induced Osteonecrosis of the Jaw. BioMed Research International, 2020, 9879876. doi:10.1155/2020/9879876
3. Protein Function Analysis through Machine Learning;Avery C;Biomolecules,2022
4. Identification of a novel cuproptosis-related gene signature and integrative analyses in patients with lower-grade gliomas;Bao J-H;Frontiers In Immunology,2022
5. NeuralNetTools: Visualization and Analysis Tools for Neural Networks;Beck MW;Journal of Statistical Software,2018