Author:
Zhao Jiaojiao,Han Jungong,Shao Ling,Snoek Cees G. M.
Abstract
AbstractWhile many image colorization algorithms have recently shown the capability of producing plausible color versions from gray-scale photographs, they still suffer from limited semantic understanding. To address this shortcoming, we propose to exploit pixelated object semantics to guide image colorization. The rationale is that human beings perceive and distinguish colors based on the semantic categories of objects. Starting from an autoregressive model, we generate image color distributions, from which diverse colored results are sampled. We propose two ways to incorporate object semantics into the colorization model: through a pixelated semantic embedding and a pixelated semantic generator. Specifically, the proposed network includes two branches. One branch learns what the object is, while the other branch learns the object colors. The network jointly optimizes a color embedding loss, a semantic segmentation loss and a color generation loss, in an end-to-end fashion. Experiments on Pascal VOC2012 and COCO-stuff reveal that our network, when trained with semantic segmentation labels, produces more realistic and finer results compared to the colorization state-of-the-art.
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Computer Vision and Pattern Recognition,Software
Cited by
48 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Versatile Vision Foundation Model for Image and Video Colorization;Special Interest Group on Computer Graphics and Interactive Techniques Conference Conference Papers '24;2024-07-13
2. PSANet: Automatic colourisation using position‐spatial attention for natural images;IET Computer Vision;2024-06-16
3. Shadow-aware image colorization;The Visual Computer;2024-06-04
4. Gallatic pallet: A review over the deep learning methods for colorization.;2023 6th International Conference on Recent Trends in Advance Computing (ICRTAC);2023-12-14
5. Brighten-and-Colorize: A Decoupled Network for Customized Low-Light Image Enhancement;Proceedings of the 31st ACM International Conference on Multimedia;2023-10-26