Generative Adversarial Networks for Video-to-Video Domain Adaptation-Reference-Cited by-同舟云学术

Generative Adversarial Networks for Video-to-Video Domain Adaptation

Published:2020-04-03 Issue:04 Volume:34 Page:3462-3469
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Chen Jiawei,Li Yuexiang,Ma Kai,Zheng Yefeng

Abstract

Endoscopic videos from multicentres often have different imaging conditions, e.g., color and illumination, which make the models trained on one domain usually fail to generalize well to another. Domain adaptation is one of the potential solutions to address the problem. However, few of existing works focused on the translation of video-based data. In this work, we propose a novel generative adversarial network (GAN), namely VideoGAN, to transfer the video-based data across different domains. As the frames of a video may have similar content and imaging conditions, the proposed VideoGAN has an X-shape generator to preserve the intra-video consistency during translation. Furthermore, a loss function, namely color histogram loss, is proposed to tune the color distribution of each translated frame. Two colonoscopic datasets from different centres, i.e., CVC-Clinic and ETIS-Larib, are adopted to evaluate the performance of domain adaptation of our VideoGAN. Experimental results demonstrate that the adapted colonoscopic video generated by our VideoGAN can significantly boost the segmentation accuracy, i.e., an improvement of 5%, of colorectal polyps on multicentre datasets. As our VideoGAN is a general network architecture, we also evaluate its performance with the CamVid driving video dataset on the cloudy-to-sunny translation task. Comprehensive experiments show that the domain gap could be substantially narrowed down by our VideoGAN.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 14 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A non-aligned translation with a neoplastic classifier regularization to include vascular NBI patterns in standard colonoscopies;Computers in Biology and Medicine;2024-03

2. Comparative analysis of Vid2Vid and Fast Vid2Vid Models for Video-to-Video Synthesis on Cityscapes Dataset;2023 International Conference on Computer, Electronics & Electrical Engineering & their Applications (IC2E3);2023-06-08

3. Deep learning in precision medicine and focus on glioma;Bioengineering & Translational Medicine;2023-05-31

4. Deep Gait Recognition: A Survey;IEEE Transactions on Pattern Analysis and Machine Intelligence;2023-01-01

5. Ani-GIFs: A benchmark dataset for domain generalization of action recognition from GIFs;Frontiers in Computer Science;2022-09-26