A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space-Reference-Cited by-同舟云学术

A Semantic Segment Encoder (SSE): Improving human face inversion quality through minimized learning space

Published:2023-12-05 Issue:12 Volume:18 Page:e0295316
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Kang Byungseok^ORCID,Jo Youngjae

Abstract

Recently, Generative Adversarial Networks (GAN) has been greatly developed and widely used in image synthesis. A Style-Based Generator Architecture for Generative Adversarial Networks (StyleGAN) which is the foremost, continues to develop human face inversion domain. StyleGAN uses insufficient vector space to express more than one million pixels. It is difficult to apply in real business due to distortion-edit tradeoff problem in latent space. To overcome this, we propose a novel semantic segment encoder (SSE) with improved face inversion quality by narrowing the size of restoration latent space. Encoder’s learning area is minimized to logical semantic-segment units that can be recognized by humans. The proposed encoder does not affect other segments because only one segment is edited at a time. To verify the face inversion quality, we compared with the latest encoders both Pixel2style2Pixel and RestyleEncoder. Experimental result shows that the proposed encoder improved distortion quality around 20% while maintain editing performance.

Funder

Ministry of Culture, Sports and Tourism

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference24 articles.

1. Generative adversarial networks: An overview;A. Creswell;IEEE signal processing magazine,2018

2. Encoding in style: a stylegan encoder for image-to-image translation;E. Richardson;In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition,2021

3. Analyzing and improving the image quality of stylegan;T. Karras;In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition,2020

4. Designing an encoder for stylegan image manipulation;O. Tov;ACM Transactions on Graphics (TOG),2021