GP-Net: Image Manipulation Detection and Localization via Long-Range Modeling and Transformers
-
Published:2023-11-05
Issue:21
Volume:13
Page:12053
-
ISSN:2076-3417
-
Container-title:Applied Sciences
-
language:en
-
Short-container-title:Applied Sciences
Author:
Peng Jin1, Liu Chengming1ORCID, Pang Haibo1, Gao Xiaomeng1, Cheng Guozhen2, Hao Bing3
Affiliation:
1. School of Cyber Science and Engineering, Zhengzhou University, No. 97, Wenhua Road, Zhengzhou 450002, China 2. Institute of Information Technology, Information Engineering University, Zhengzhou 450002, China 3. Songshan Laboratory, Zhengzhou 450002, China
Abstract
With the rise of image manipulation techniques, an increasing number of individuals find it easy to manipulate image content. Undoubtedly, this presents a significant challenge to the integrity of multimedia data, thereby fueling the advancement of image forgery detection research. A majority of current methods employ convolutional neural networks (CNNs) for image manipulation localization, yielding promising outcomes. Nevertheless, CNN-based approaches possess limitations in establishing explicit long-range relationships. Consequently, addressing the image manipulation localization task necessitates a solution that adeptly builds global context while preserving a robust grasp of low-level details. In this paper, we propose GPNet to address this challenge. GPNet combines Transformer and CNN in parallel which can build global dependency and capture low-level details efficiently. Additionally, we devise an effective fusion module referred to as TcFusion, which proficiently amalgamates feature maps generated by both branches. Thorough extensive experiments conducted on diverse datasets showcase that our network outperforms prevailing state-of-the-art manipulation detection and localization approaches.
Funder
Nature Science Foundation of China Key science and technology project of Henan Province technological research projects in Henan province
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference31 articles.
1. Razavi, A., Oord, A., and Vinyals, O. (2019, January 8–14). Generating diverse high-fidelity images with vq-vae-2. Proceedings of the Neural Information Processing Systems, Vancouver, BC, Canada. 2. Goodfellow, I., Pouget-Abadie, J., and Mirza, M. (2014, January 8–13). Generative adversarial nets. Proceedings of the Neural Information Processing Systems, Montreal, QC, Canada. 3. Park, T., Zhu, J.-Y., and Wang, O. (2020, January 6–12). Swapping autoencoder for deep image manipulation. Proceedings of the Neural Information Processing Systems, Online. 4. Dhamo, H., Farshad, A., and Laina, I. (2020, January 13–19). Semantic image manipulation using scene graphs. Proceedings of the Computer Vision and Pattern Recognition, Seattle, WA, USA. 5. Li, B., Qi, X., and Lukasiewicz, T. (2020, January 13–19). Manigan: Text-guided image manipulation. Proceedings of the Computer Vision and Pattern Recognition, Seattle, WA, USA.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|