Deep Learning Architectures for Diagnosis of Diabetic Retinopathy-Reference-Cited by-同舟云学术

Deep Learning Architectures for Diagnosis of Diabetic Retinopathy

Published:2023-03-31 Issue:7 Volume:13 Page:4445
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Solano Alberto¹^ORCID,Dietrich Kevin N.¹,Martínez-Sober Marcelino¹^ORCID,Barranquero-Cardeñosa Regino¹,Vila-Tomás Jorge²,Hernández-Cámara Pablo²^ORCID

Affiliation:

1. Intelligent Data Analysis Laboratory, ETSE (Engineering School), Universitat de València, 46100 Burjassot, Spain

2. Image Processing Lab., Universitat de València, 46980 Paterna, Spain

Abstract

For many years, convolutional neural networks dominated the field of computer vision, not least in the medical field, where problems such as image segmentation were addressed by such networks as the U-Net. The arrival of self-attention-based networks to the field of computer vision through ViTs seems to have changed the trend of using standard convolutions. Throughout this work, we apply different architectures such as U-Net, ViTs and ConvMixer, to compare their performance on a medical semantic segmentation problem. All the models have been trained from scratch on the DRIVE dataset and evaluated on their private counterparts to assess which of the models performed better in the segmentation problem. Our major contribution is showing that the best-performing model (ConvMixer) is the one that shares the approach from the ViT (processing images as patches) while maintaining the foundational blocks (convolutions) from the U-Net. This mixture does not only produce better results (DICE=0.83) than both ViTs (0.80/0.077 for UNETR/SWIN-Unet) and the U-Net (0.82) on their own but reduces considerably the number of parameters (2.97M against 104M/27M and 31M, respectively), showing that there is no need to systematically use large models for solving image problems where smaller architectures with the optimal pieces can get better results.

Funder

MICIIN/FEDER/UE

Spanish MIU

GVA

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/7/4445/pdf

Reference38 articles.

1. McGlinchy, J., Johnson, B., Muller, B., Joseph, M., and Diaz, J. (August, January 28). Application of UNet Fully Convolutional Neural Network to Impervious Surface Segmentation in Urban Environment from High Resolution Satellite Imagery. Proceedings of the IGARSS 2019—2019 IEEE International Geoscience and Remote Sensing Symposium, Yokohama, Japan.

2. A new approach for the morphological segmentation of high-resolution satellite imagery;Pesaresi;IEEE Trans. Geosci. Remote. Sens.,2001

3. Nemni, E., Bullock, J., Belabbes, S., and Bromley, L. (2020). Fully convolutional neural network for rapid flood segmentation in synthetic aperture radar imagery. Remote. Sens., 12.

4. Xie, B., Li, S., Li, M., Liu, C.H., Huang, G., and Wang, G. (2022). SePiCo: Semantic-Guided Pixel Contrast for Domain Adaptive Semantic Segmentation. IEEE Trans. Pattern Anal. Mach. Intell.

5. The Medical Segmentation Decathlon;Antonelli;Nat. Commun.,2021

Cited by 7 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. An efficient approach to detect and segment underwater images using Swin Transformer;Results in Engineering;2024-09

2. An eccentric Iter Net–based Improved Intelligent Water Drop (I2WD) feature selection and Discriminated Multi-Instance Classification (DMIC) models for diabetic retinopathy detection;International Journal of Diabetes in Developing Countries;2024-08-12

3. MT_Net: A Multi-Scale Framework Using the Transformer Block for Retina Layer Segmentation;Photonics;2024-06-27

4. Diabetic retinopathy prediction based on vision transformer and modified capsule network;Computers in Biology and Medicine;2024-06

5. Hierarchical Encoding Method for Retinal Segmentation Evolutionary Architecture Search;IEEE Transactions on Emerging Topics in Computational Intelligence;2024