A transfer learning-based multimodal neural network combining metadata and multiple medical images for glaucoma type diagnosis-Reference-Cited by-同舟云学术

A transfer learning-based multimodal neural network combining metadata and multiple medical images for glaucoma type diagnosis

Published:2023-07-26 Issue:1 Volume:13 Page:
ISSN:2045-2322
Container-title:Scientific Reports
language:en
Short-container-title:Sci Rep

Author:

Li Yi,Han Yujie,Li Zihan,Zhong Yi,Guo Zhifen

Abstract

AbstractGlaucoma is an acquired optic neuropathy, which can lead to irreversible vision loss. Deep learning(DL), especially convolutional neural networks(CNN), has achieved considerable success in the field of medical image recognition due to the availability of large-scale annotated datasets and CNNs. However, obtaining fully annotated datasets like ImageNet in the medical field is still a challenge. Meanwhile, single-modal approaches remain both unreliable and inaccurate due to the diversity of glaucoma disease types and the complexity of symptoms. In this paper, a new multimodal dataset for glaucoma is constructed and a new multimodal neural network for glaucoma diagnosis and classification (GMNNnet) is proposed aiming to address both of these issues. Specifically, the dataset includes the five most important types of glaucoma labels, electronic medical records and four kinds of high-resolution medical images. The structure of GMNNnet consists of three branches. Branch 1 consisting of convolutional, cyclic and transposition layers processes patient metadata, branch 2 uses Unet to extract features from glaucoma segmentation based on domain knowledge, and branch 3 uses ResFormer to directly process glaucoma medical images.Branch one and branch two are mixed together and then processed by the Catboost classifier. We introduce a gradient-weighted class activation mapping (Grad-GAM) method to increase the interpretability of the model and a transfer learning method for the case of insufficient training data,i.e.,fine-tuning CNN models pre-trained from natural image dataset to medical image tasks. The results show that GMNNnet can better present the high-dimensional information of glaucoma and achieves excellent performance under multimodal data.

Publisher

Springer Science and Business Media LLC

Subject

Multidisciplinary

Link

https://www.nature.com/articles/s41598-022-27045-6.pdf

Reference31 articles.

1. Barkana, Y. & Dorairaj, S. Re: Tham et al.: Global prevalence of glaucoma and projections of glaucoma burden through 2040: A systematic review and meta-analysis (ophthalmology 2014;121:2081–90).. Ophthalmology 122, e40–e41. https://doi.org/10.1016/j.ophtha.2014.11.030 (2015).

2. Yamamoto, S. et al. Primary open-angle glaucoma in a population associated with high prevalence of primary angle-closure glaucoma: The kumejima study. Ophthalmology 121, 1558–1565. https://doi.org/10.1016/j.ophtha.2014.03.003 (2014).

3. Kapetanakis, V. V. et al. Global variations and time trends in the prevalence of primary open angle glaucoma (poag): A systematic review and meta-analysis. Br. J. Ophthalmol. 100, 86–93. https://doi.org/10.1136/bjophthalmol-2015-307223 (2016).

4. Krizhevsky, A., Sutskever, I. & Hinton, G. E. Imagenet classification with deep convolutional neural networks. Commun. ACM 60, 84–90. https://doi.org/10.1145/3065386 (2017).

5. Chen, X. et al. Automatic feature learning for glaucoma detection based on deep learning. In Medical Image Computing and Computer-Assisted Intervention - MICCAI 2015 (eds Navab, N. et al.) 669–677 (Springer, Cham, 2015).

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Application of artificial intelligence in glaucoma care: An updated review;Taiwan Journal of Ophthalmology;2024-07

2. Assessment of Clinical Metadata on the Accuracy of Retinal Fundus Image Labels in Diabetic Retinopathy: A Pilot Study using the Multimodal Database of Retinal Images in Africa (MoDRIA) (Preprint);JMIR Formative Research;2024-04-29

3. Research and implementation of multi-disease diagnosis on chest X-ray based on vision transformer;Quantitative Imaging in Medicine and Surgery;2024-03

4. Multimodal Deep Convolutional Neural Network Pipeline for AI-Assisted Early Detection of Oral Cancer;IEEE Access;2024

5. Application of artificial intelligence in glaucoma. Part 1. Neural networks and deep learning in glaucoma screening and diagnosis;Russian Annals of Ophthalmology;2024