Affiliation:
1. School of Information and Communication Engineering, Beijing Information Science and Technology University, Key Laboratory of Information and Communication Systems, Ministry of Information Industry, Beijing, China
Abstract
Transforming optical facial images into sketches while preserving realism and facial features poses a significant challenge. The current methods that rely on paired training data are costly and resource-intensive. Furthermore, they often fail to capture the intricate features of faces, resulting in substandard sketch generation. To address these challenges, we propose the novel hierarchical contrast generative adversarial network (HCGAN). Firstly, HCGAN consists of a global sketch synthesis module that generates sketches with well-defined global features and a local sketch refinement module that enhances the ability to extract features in critical areas. Secondly, we introduce local refinement loss based on the local sketch refinement module, refining sketches at a granular level. Finally, we propose an association strategy called “warmup-epoch” and local consistency loss between the two modules to ensure HCGAN is effectively optimized. Evaluations of the CUFS and SKSF-A datasets demonstrate that our method produces high-quality sketches and outperforms existing state-of-the-art methods in terms of fidelity and realism. Compared to the current state-of-the-art methods, HCGAN reduces FID by 12.6941, 4.9124, and 9.0316 on three datasets of CUFS, respectively, and by 7.4679 on the SKSF-A dataset. Additionally, it obtained optimal scores for content fidelity (CF), global effects (GE), and local patterns (LP). The proposed HCGAN model provides a promising solution for realistic sketch synthesis under unpaired data training.
Funder
The National Natural Science Foundation of China
Reference53 articles.
1. Unsupervised pixel-level domain adaptation with generative adversarial networks;Bousmalis,2017
2. Semi-supervised cycle-GAN for face photo-sketch translation in the wild;Chen;Computer Vision and Image Understanding,2023
3. A simple framework for contrastive learning of visual representations;Chen,2020
4. Example-based facial sketch generation with non-parametric sampling;Chen,2001
5. PortraitNET: photo-realistic portrait cartoon style transfer with self-supervised semantic supervision;Cui;Neurocomputing,2021