A feasibility study on the adoption of a generative denoising diffusion model for the synthesis of fundus photographs using a small dataset-Reference-Cited by-同舟云学术

A feasibility study on the adoption of a generative denoising diffusion model for the synthesis of fundus photographs using a small dataset

Published:2024-04-03 Issue:4 Volume:6 Page:
ISSN:3004-9261
Container-title:Discover Applied Sciences
language:en
Short-container-title:Discov Appl Sci

Author:

Kim Hong Kyu,Ryu Ik Hee,Choi Joon Yul^ORCID,Yoo Tae Keun^ORCID

Abstract

AbstractThe generative diffusion model has been highlighted as a state-of-the-art artificial intelligence technique for image synthesis. Here, we show that a denoising diffusion probabilistic model (DDPM) can be used for a domain-specific task generating fundus photographs based on a limited training dataset in an unconditional manner. We trained the DDPM based on U-Net backbone architecture, which is the most popular form of the generative diffusion model. After training, serial multiple denoising U-Nets can generate FPs using random noise seeds. A thousand healthy retinal images were used to train the diffusion model. The input image size was set to a pixel resolution of 128 × 128. The trained DDPM successfully generated synthetic fundus photographs with a resolution of 128 × 128 pixels using our small dataset. We failed to train the DDPM for 256-by-256-pixel images due to the limited computation capacity using a personal cloud platform. In a comparative analysis, the progressive growing generative adversarial network (PGGAN) model synthesized more sharpened images than the DDPM in the retinal vessels and optic discs. The PGGAN (Frechet inception distance [FID] score: 41.761) achieved a better FID score than the DDPM (FID score: 65.605). We used a domain-specific generative diffusion model to synthesize fundus photographs based on a relatively small dataset. Because the DDPM has disadvantages with a small dataset, including difficulty in training and low image quality compared with generative adversarial networks such as PGGAN, further studies are needed to improve diffusion models for domain-specific medical tasks with small numbers of samples.

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s42452-024-05871-9.pdf

Reference27 articles.

1. Jin K, Ye J. Artificial intelligence and deep learning in ophthalmology: Current status and future perspectives. Adv Ophthalmol Pract Res. 2022;2:100078. https://doi.org/10.1016/j.aopr.2022.100078.

2. Yoo TK, Choi JY. Outcomes of adversarial attacks on deep learning models for ophthalmology imaging domains. JAMA Ophthalmol. 2020;138:1213–5. https://doi.org/10.1001/jamaophthalmol.2020.3442.

3. Tavakkoli A, Kamran SA, Hossain KF, Zuckerbrod SL. A novel deep learning conditional generative adversarial network for producing angiography images from retinal fundus photographs. Sci Rep. 2020;10:21580. https://doi.org/10.1038/s41598-020-78696-2.

4. Burlina PM, Joshi N, Pacheco KD, Liu TYA, Bressler NM. Assessment of deep generative models for high-resolution synthetic retinal image generation of age-related macular degeneration. JAMA Ophthalmol. 2019;137:258–64. https://doi.org/10.1001/jamaophthalmol.2018.6156.

5. Yu X, Li M, Ge C, Shum PP, Chen J, Liu L. A generative adversarial network with multi-scale convolution and dilated convolution res-network for OCT retinal image despeckling. Biomed Signal Process Control. 2023;80:104231. https://doi.org/10.1016/j.bspc.2022.104231.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Generative artificial intelligence in ophthalmology: current innovations, future applications and challenges;British Journal of Ophthalmology;2024-06-26

2. D2BGAN: Dual Discriminator Bayesian Generative Adversarial Network for Deformable MR–Ultrasound Registration Applied to Brain Shift Compensation;Diagnostics;2024-06-21

3. Advancing the democratization of generative artificial intelligence in healthcare: a narrative review;Journal of Hospital Management and Health Policy;2024-06

4. Denoising diffusion probabilistic models for addressing data limitations in chest X-ray classification;Informatics in Medicine Unlocked;2024