SinGAN-Seg: Synthetic training data generation for medical image segmentation-Reference-Cited by-同舟云学术

SinGAN-Seg: Synthetic training data generation for medical image segmentation

Published:2022-05-02 Issue:5 Volume:17 Page:e0267976
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Thambawita Vajira^ORCID,Salehi Pegah,Sheshkal Sajad Amouei,Hicks Steven A.,Hammer Hugo L.,Parasa Sravanthi,Lange Thomas de^ORCID,Halvorsen Pål,Riegler Michael A.^ORCID

Abstract

Analyzing medical data to find abnormalities is a time-consuming and costly task, particularly for rare abnormalities, requiring tremendous efforts from medical experts. Therefore, artificial intelligence has become a popular tool for the automatic processing of medical data, acting as a supportive tool for doctors. However, the machine learning models used to build these tools are highly dependent on the data used to train them. Large amounts of data can be difficult to obtain in medicine due to privacy reasons, expensive and time-consuming annotations, and a general lack of data samples for infrequent lesions. In this study, we present a novel synthetic data generation pipeline, calledSinGAN-Seg, to produce synthetic medical images with corresponding masks using a single training image. Our method is different from the traditional generative adversarial networks (GANs) because our model needs only a single image and the corresponding ground truth to train. We also show that the synthetic data generation pipeline can be used to produce alternative artificial segmentation datasets with corresponding ground truth masks when real datasets are not allowed to share. The pipeline is evaluated using qualitative and quantitative comparisons between real data and synthetic data to show that the style transfer technique used in our pipeline significantly improves the quality of the generated data and our method is better than other state-of-the-art GANs to prepare synthetic images when the size of training datasets are limited. By training UNet++ using both real data and the synthetic data generated from the SinGAN-Seg pipeline, we show that the models trained on synthetic data have very close performances to those trained on real data when both datasets have a considerable amount of training data. In contrast, we show that synthetic data generated from the SinGAN-Seg pipeline improves the performance of segmentation models when training datasets do not have a considerable amount of data. All experiments were performed using an open dataset and the code is publicly available on GitHub.

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference69 articles.

1. Artificial intelligence in healthcare: past, present and future;F Jiang;Stroke and vascular neurology,2017

2. Artificial Intelligence in Medicine and Cardiac Imaging: Harnessing Big Data and Advanced Computing to Provide Personalized Medical Diagnosis and Treatment;SE Dilsizian;Current Cardiology Reports,2013

3. The coming of age of artificial intelligence in medicine;VL Patel;Artificial Intelligence in Medicine,2009

4. Adapting to artificial intelligence: radiologists and pathologists as information specialists;S Jha;Jama,2016

5. A logical calculus of the ideas immanent in nervous activity;WS McCulloch;The bulletin of mathematical biophysics,1943

Cited by 41 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Guided image generation for improved surgical image segmentation;Medical Image Analysis;2024-10

2. Generative AI-Assisted Novel View Synthesis of Coronary Arteries for Angiography;2024 IEEE International Symposium on Medical Measurements and Applications (MeMeA);2024-06-26

3. Using diffusion models to generate synthetic labeled data for medical image segmentation;International Journal of Computer Assisted Radiology and Surgery;2024-06-20

4. Few-Shot Learning for Medical Image Segmentation Using 3D U-Net and Model-Agnostic Meta-Learning (MAML);Diagnostics;2024-06-07

5. A synthetic data generation system based on the variational-autoencoder technique and the linked data paradigm;Progress in Artificial Intelligence;2024-06