Optical implementation and robustness validation for multi-scale masked autoencoder-Reference-Cited by-同舟云学术

Optical implementation and robustness validation for multi-scale masked autoencoder

Published:2023-04-01 Issue:4 Volume:8 Page:
ISSN:2378-0967
Container-title:APL Photonics
language:en
Short-container-title:

Author:

Xue Yizheng¹^ORCID,Su Xiongfei¹^ORCID,Zhang Shiyu¹^ORCID,Yuan Xin¹^ORCID

Affiliation:

1. Research Center for Industries of the Future (RCIF) and School of Engineering, Westlake University , Hangzhou 310030, Zhejiang, China

Abstract

Masked Autoencoders (MAEs), the state-of-the-art self-supervised neural network architecture in miscellaneous vision tasks, show surprisingly effective potential in reconstructing images distorted by random masking. This paper first introduces an optical implementation of MAEs, employing digital micromirror devices in the optical path to capture partially blocked images. MAEs with multi-scale patches are deployed in the reconstruction procedure. By using an optical-specialized version of the reconstruction network, the system can reconstruct original scenes of high quality. Simulations and experimental measurements showed a significant performance, achieving 24.41 dB average peak-signal-to-noise on Davis2017 datasets and 29.92 dB (masked areas) on authentic captured images under 70% of pixels being blocked. This paves the way for the application of low-bandwidth sampling of high-throughput high-resolution images.

Funder

National Natural Science Foundation of China

Zhejiang Provincial Natural Science Foundation of China

Publisher

AIP Publishing

Subject

Computer Networks and Communications,Atomic and Molecular Physics, and Optics

Link

https://pubs.aip.org/aip/app/article-pdf/doi/10.1063/5.0139050/16821965/046106_1_5.0139050.pdf

Reference39 articles.

1. I. Turc , M.-W.Chang, K.Lee, and K.Toutanova, “Well-read students learn better: On the importance of pre-training compact models,” arXiv:1908.08962v2 (2019).

2. An image is worth 16 × 16 words: Transformers for image recognition at scale

3. CvT: Introducing convolutions to vision transformers,2021

4. An empirical study of training self-supervised vision transformers,2021

5. Emerging properties in self-supervised vision transformers,2021

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Model-Guided Iterative Diffusion Sampling for NLOS Reconstruction;IEEE Journal of Selected Topics in Quantum Electronics;2024-01