GP-Net: Image Manipulation Detection and Localization via Long-Range Modeling and Transformers-Reference-Cited by-同舟云学术

GP-Net: Image Manipulation Detection and Localization via Long-Range Modeling and Transformers

Published:2023-11-05 Issue:21 Volume:13 Page:12053
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Peng Jin¹,Liu Chengming¹^ORCID,Pang Haibo¹,Gao Xiaomeng¹,Cheng Guozhen²,Hao Bing³

Affiliation:

1. School of Cyber Science and Engineering, Zhengzhou University, No. 97, Wenhua Road, Zhengzhou 450002, China

2. Institute of Information Technology, Information Engineering University, Zhengzhou 450002, China

3. Songshan Laboratory, Zhengzhou 450002, China

Abstract

With the rise of image manipulation techniques, an increasing number of individuals find it easy to manipulate image content. Undoubtedly, this presents a significant challenge to the integrity of multimedia data, thereby fueling the advancement of image forgery detection research. A majority of current methods employ convolutional neural networks (CNNs) for image manipulation localization, yielding promising outcomes. Nevertheless, CNN-based approaches possess limitations in establishing explicit long-range relationships. Consequently, addressing the image manipulation localization task necessitates a solution that adeptly builds global context while preserving a robust grasp of low-level details. In this paper, we propose GPNet to address this challenge. GPNet combines Transformer and CNN in parallel which can build global dependency and capture low-level details efficiently. Additionally, we devise an effective fusion module referred to as TcFusion, which proficiently amalgamates feature maps generated by both branches. Thorough extensive experiments conducted on diverse datasets showcase that our network outperforms prevailing state-of-the-art manipulation detection and localization approaches.

Funder

Nature Science Foundation of China

Key science and technology project of Henan Province

technological research projects in Henan province

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/21/12053/pdf

Reference31 articles.

1. Razavi, A., Oord, A., and Vinyals, O. (2019, January 8–14). Generating diverse high-fidelity images with vq-vae-2. Proceedings of the Neural Information Processing Systems, Vancouver, BC, Canada.

2. Goodfellow, I., Pouget-Abadie, J., and Mirza, M. (2014, January 8–13). Generative adversarial nets. Proceedings of the Neural Information Processing Systems, Montreal, QC, Canada.

3. Park, T., Zhu, J.-Y., and Wang, O. (2020, January 6–12). Swapping autoencoder for deep image manipulation. Proceedings of the Neural Information Processing Systems, Online.

4. Dhamo, H., Farshad, A., and Laina, I. (2020, January 13–19). Semantic image manipulation using scene graphs. Proceedings of the Computer Vision and Pattern Recognition, Seattle, WA, USA.

5. Li, B., Qi, X., and Lukasiewicz, T. (2020, January 13–19). Manigan: Text-guided image manipulation. Proceedings of the Computer Vision and Pattern Recognition, Seattle, WA, USA.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. MTFDN: An image copy‐move forgery detection method based on multi‐task learning;Expert Systems;2024-09-14

2. Image splicing detection using low-dimensional feature vector of texture features and Haralick features based on Gray Level Co-occurrence Matrix;Signal Processing: Image Communication;2024-07

3. Closing Editorial for Computer Vision and Pattern Recognition Based on Deep Learning;Applied Sciences;2024-04-25