Efficient Hybrid Zoom Using Camera Fusion on Mobile Phones

Author:

Wu Xiaotong1,Lai Wei-Sheng1,Shih Yichang1,Herrmann Charles1,Krainin Michael1,Sun Deqing1,Liang Chia-Kai1

Affiliation:

1. Google, USA

Abstract

DSLR cameras can achieve multiple zoom levels via shifting lens distances or swapping lens types. However, these techniques are not possible on smart-phone devices due to space constraints. Most smartphone manufacturers adopt a hybrid zoom system: commonly a Wide ( W ) camera at a low zoom level and a Telephoto ( T ) camera at a high zoom level. To simulate zoom levels between W and T , these systems crop and digitally upsample images from W , leading to significant detail loss. In this paper, we propose an efficient system for hybrid zoom super-resolution on mobile devices, which captures a synchronous pair of W and T shots and leverages machine learning models to align and transfer details from T to W. We further develop an adaptive blending method that accounts for depth-of-field mismatches, scene occlusion, flow uncertainty, and alignment errors. To minimize the domain gap, we design a dual-phone camera rig to capture real-world inputs and ground-truths for supervised training. Our method generates a 12-megapixel image in 500ms on a mobile platform and compares favorably against state-of-the-art methods under extensive evaluation on real-world scenarios.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Graphics and Computer-Aided Design

Reference56 articles.

1. Symmetrical Dense Optical Flow Estimation with Occlusions Detection

2. Sameer Ansari , Neal Wadhwa , Rahul Garg , and Jiawen Chen . 2019. Wireless software synchronization of multiple distributed cameras . In ICCP. IEEE , Tokyo, Japan , 1--9. Sameer Ansari, Neal Wadhwa, Rahul Garg, and Jiawen Chen. 2019. Wireless software synchronization of multiple distributed cameras. In ICCP. IEEE, Tokyo, Japan, 1--9.

3. Kelvin C.K. Chan , Xintao Wang , Xiangyu Xu , Jinwei Gu , and Chen Change Loy . 2021 . GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution . In CVPR. IEEE , Virtual/ Online , 14245--14254. Kelvin C.K. Chan, Xintao Wang, Xiangyu Xu, Jinwei Gu, and Chen Change Loy. 2021. GLEAN: Generative Latent Bank for Large-Factor Image Super-Resolution. In CVPR. IEEE, Virtual/Online, 14245--14254.

4. Ferenc Huszar Jose Caballero Andrew Cunningham Alejandro Acosta Andrew Aitken Alykhan Tejani Johannes Totz Zehan Wang Wenzhe Shi Christian Ledig Lucas Theis. 2017. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network. In CVPR. Ferenc Huszar Jose Caballero Andrew Cunningham Alejandro Acosta Andrew Aitken Alykhan Tejani Johannes Totz Zehan Wang Wenzhe Shi Christian Ledig Lucas Theis. 2017. Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network. In CVPR.

5. Xiaodong Cun and Chi-Man Pun. 2020. Defocus blur detection via depth distillation. In ECCV. Xiaodong Cun and Chi-Man Pun. 2020. Defocus blur detection via depth distillation. In ECCV.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3