Dirty Pixels: Towards End-to-end Image Processing and Perception-Reference-Cited by-同舟云学术

Dirty Pixels: Towards End-to-end Image Processing and Perception

Published:2021-05-06 Issue:3 Volume:40 Page:1-15
ISSN:0730-0301
Container-title:ACM Transactions on Graphics
language:en
Short-container-title:ACM Trans. Graph.

Author:

Diamond Steven¹,Sitzmann Vincent²,Julca-Aguilar Frank³^ORCID,Boyd Stephen¹,Wetzstein Gordon¹,Heide Felix⁴

Affiliation:

1. Stanford University

2. Stanford University, MIT

3. Algolux

4. Princeton University

Abstract

Real-world, imaging systems acquire measurements that are degraded by noise, optical aberrations, and other imperfections that make image processing for human viewing and higher-level perception tasks challenging. Conventional cameras address this problem by compartmentalizing imaging from high-level task processing. As such, conventional imaging involves processing the RAW sensor measurements in a sequential pipeline of steps, such as demosaicking, denoising, deblurring, tone-mapping, and compression. This pipeline is optimized to obtain a visually pleasing image. High-level processing, however, involves steps such as feature extraction, classification, tracking, and fusion. While this silo-ed design approach allows for efficient development, it also dictates compartmentalized performance metrics without knowledge of the higher-level task of the camera system. For example, today’s demosaicking and denoising algorithms are designed using perceptual image quality metrics but not with domain-specific tasks such as object detection in mind. We propose an end-to-end differentiable architecture that jointly performs demosaicking, denoising, deblurring, tone-mapping, and classification (see Figure 1). The architecture does not require any intermediate losses based on perceived image quality and learns processing pipelines whose outputs differ from those of existing ISPs optimized for perceptual quality, preserving fine detail at the cost of increased noise and artifacts. We show that state-of-the-art ISPs discard information that is essential in corner cases, such as extremely low-light conditions, where conventional imaging and perception stacks fail. We demonstrate on captured and simulated data that our model substantially improves perception in low light and other challenging conditions, which is imperative for real-world applications such as autonomous driving, robotics, and surveillance. Finally, we found that the proposed model also achieves state-of-the-art accuracy when optimized for image reconstruction in low-light conditions, validating the architecture itself as a potentially useful drop-in network for reconstruction and analysis tasks beyond the applications demonstrated in this work. Our proposed models, datasets, and calibration data are available at https://github.com/princeton-computational-imaging/DirtyPixels .

Funder

Stanford Graduate Fellowship in Science and Engineering

National Science Foundation (NSF) CAREER award

Sloan Fellowship

PECASE from the ARO

KAUST Office of Sponsored Research through the Visual Computing Center CCF

NSF CAREER Award

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Link

https://dl.acm.org/doi/pdf/10.1145/3446918

Reference62 articles.

1. A fast iterative shrinkage-thresholding algorithm for linear inverse problems;Beck A.;SIAM J. Imag. Sci.,2009

2. Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems;Beck A.;IEEE Trans. Image Proc.,2009

3. Optimizing image acquisition systems for autonomous driving;Blasinski Henryk;Electron. Imag.,2018

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning real-world heterogeneous noise models with a benchmark dataset;Pattern Recognition;2024-12

2. Deep learning optimization for small object classification in lensfree holographic microscopy;Optics Express;2024-09-13

3. Test-time Adaptation Meets Image Enhancement: Improving Accuracy via Uncertainty-aware Logit Switching;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. A Vision-Centric Approach for Static Map Element Annotation;2024 IEEE International Conference on Robotics and Automation (ICRA);2024-05-13

5. ISP Parameter Optimization and FPGA Implementation for Object Detection in Low-Light Conditions;2024 IEEE Symposium in Low-Power and High-Speed Chips (COOL CHIPS);2024-04-17