Author:
Yuan Xiaohan,Chen Shuyu,Sun Chuan,Yuwen Lu
Abstract
AbstractChronic diseases are one of the most severe health issues in the world, due to their terrible clinical presentations such as long onset cycle, insidious symptoms, and various complications. Recently, machine learning has become a promising technique to assist the early diagnosis of chronic diseases. However, existing works ignore the problems of feature hiding and imbalanced class distribution in chronic disease datasets. In this paper, we present a universal and efficient diagnostic framework to alleviate the above two problems for diagnosing chronic diseases timely and accurately. Specifically, we first propose a network-limited polynomial neural network (NLPNN) algorithm to efficiently capture high-level features hidden in chronic disease datasets, which is data augmentation in terms of its feature space and can also avoid over-fitting. Then, to alleviate the class imbalance problem, we further propose an attention-empowered NLPNN algorithm to improve the diagnostic accuracy for sick cases, which is also data augmentation in terms of its sample space. We evaluate the proposed framework on nine public and two real chronic disease datasets (partly with class imbalance). Extensive experiment results demonstrate that the proposed diagnostic algorithms outperform state-of-the-art machine learning algorithms, and can achieve superior performances in terms of accuracy, recall, F1, and G_mean. The proposed framework can help to diagnose chronic diseases timely and accurately at an early stage.
Funder
National Natural Science Foundation of China
Graduate Research and Innovation Foundation of Chongqing
Chongqing Science and Technology Project
Fundamental Research Funds for the Central Universities
Publisher
Springer Science and Business Media LLC
Reference49 articles.
1. Yuan, X., Chen, S., Sun, C. & Yuwen, L. A novel class imbalance-oriented polynomial neural network algorithm for disease diagnosis. In Proceedings of IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2360–2367 (2021).
2. Organization, W. H. WHO reveals leading causes of death and disability worldwide: 2000–2019. https://www.who.int/news/item/09-12-2020-who-reveals-leading-causes-of-death-and-disability-worldwide-2000-2019.
3. Souza-Pereira, L., Pombo, N., Ouhbi, S., Felizardo, V. & Garcia, N. Clinical decision support systems for chronic diseases: A systematic literature review. Comput. Methods Progr. Biomed. 195, 105565 (2020).
4. Alkenani, A. H., Li, Y., Xu, Y. & Zhang, Q. Predicting Alzheimer’s disease from spoken and written language using fusion-based stacked generalization. J. Biomed. Inform. 118, 103803 (2021).
5. Yuan, X., Chen, S., Yuwen, L., An, S., Mei, S. & Chen, T. An improved SEIR model for reconstructing the dynamic transmission of COVID-19. In Proceedings of IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2320–2327 (2020).
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献