Author:
Guo Mengfei,Yu Yanan,Wen Tiancai,Zhang Xiaoping,Liu Baoyan,Zhang Jin,Zhang Runshun,Zhang Yanning,Zhou Xuezhong
Abstract
Abstract
Background
Disease comorbidity is popular and has significant indications for disease progress and management. We aim to detect the general disease comorbidity patterns in Chinese populations using a large-scale clinical data set.
Methods
We extracted the diseases from a large-scale anonymized data set derived from 8,572,137 inpatients in 453 hospitals across China. We built a Disease Comorbidity Network (DCN) using correlation analysis and detected the topological patterns of disease comorbidity using both complex network and data mining methods. The comorbidity patterns were further validated by shared molecular mechanisms using disease-gene associations and pathways. To predict the disease occurrence during the whole disease progressions, we applied four machine learning methods to model the disease trajectories of patients.
Results
We obtained the DCN with 5702 nodes and 258,535 edges, which shows a power law distribution of the degree and weight. It further indicated that there exists high heterogeneity of comorbidities for different diseases and we found that the DCN is a hierarchical modular network with community structures, which have both homogeneous and heterogeneous disease categories. Furthermore, adhering to the previous work from US and Europe populations, we found that the disease comorbidities have their shared underlying molecular mechanisms. Furthermore, take hypertension and psychiatric disease as instance, we used four classification methods to predicte the disease occurrence using the comorbid disease trajectories and obtained acceptable performance, in which in particular, random forest obtained an overall best performance (with F1-score 0.6689 for hypertension and 0.6802 for psychiatric disease).
Conclusions
Our study indicates that disease comorbidity is significant and valuable to understand the disease incidences and their interactions in real-world populations, which will provide important insights for detection of the patterns of disease classification, diagnosis and prognosis.
Publisher
Springer Science and Business Media LLC
Subject
Genetics(clinical),Genetics
Reference44 articles.
1. Capobianco E, Lio P. Comorbidity: a multidimensional approach. Trends Mol Med. 2013;19(9):515–21.
2. Radner H, Yoshida K, Smolen JS, et al. multimorbidity and rheumatic conditions-enhancing the concept of comorbidity. Nature reviews. Rheumatology. 2014;10(4):252.
3. Rubioperez C, Guney E, Aguilar D, et al. Genetic and functional characterization of disease associations explains comorbidity. Sci Rep. 2017;7(1):6207.
4. Hu JX, Thomas CE, Brunak S. Network biology concepts in complex disease comorbidities. Nat Rev Genet. 2016;17(10):615–29.
5. Bragina EY, Freidin MB, Babuskina NP, et al. The analysis of associations between cytokine network genes and inverse co-morbidity of ronchial asthma and tuberculosis. Biomed Genet Genom. 2016;1(5):Z2–4.
Cited by
35 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献