Joint estimation of depth and motion from a monocular endoscopy image sequence using a multi-loss rebalancing network-Reference-Cited by-同舟云学术

Joint estimation of depth and motion from a monocular endoscopy image sequence using a multi-loss rebalancing network

Published:2022-04-11 Issue:5 Volume:13 Page:2707
ISSN:2156-7085
Container-title:Biomedical Optics Express
language:en
Short-container-title:Biomed. Opt. Express

Author:

Liu Shiyuan¹,Fan Jingfan¹,Song Dengpan¹,Fu Tianyu¹,Lin Yucong¹,Xiao Deqiang¹,Song Hong²,Wang Yongtian¹,Yang Jian¹

Affiliation:

1. School of Optics and Photonics, Beijing Institute of Technology

2. School of Computer Science and Technology, Beijing Institute of Technology

Abstract

Building an in vivo three-dimensional (3D) surface model from a monocular endoscopy is an effective technology to improve the intuitiveness and precision of clinical laparoscopic surgery. This paper proposes a multi-loss rebalancing-based method for joint estimation of depth and motion from a monocular endoscopy image sequence. The feature descriptors are used to provide monitoring signals for the depth estimation network and motion estimation network. The epipolar constraints of the sequence frame is considered in the neighborhood spatial information by depth estimation network to enhance the accuracy of depth estimation. The reprojection information of depth estimation is used to reconstruct the camera motion by motion estimation network with a multi-view relative pose fusion mechanism. The relative response loss, feature consistency loss, and epipolar consistency loss function are defined to improve the robustness and accuracy of the proposed unsupervised learning-based method. Evaluations are implemented on public datasets. The error of motion estimation in three scenes decreased by 42.1%,53.6%, and 50.2%, respectively. And the average error of 3D reconstruction is 6.456 ± 1.798mm. This demonstrates its capability to generate reliable depth estimation and trajectory reconstruction results for endoscopy images and meaningful applications in clinical.

Funder

National Natural Science Foundation of China

Beijing Nova Program

National Key R&D Program of Zhejiang Province

Beijing Institute of Technology Research Fund Program for Young Scholars

Publisher

Optica Publishing Group

Subject

Atomic and Molecular Physics, and Optics,Biotechnology

Reference36 articles.

1. A Robotic System With Multichannel Flexible Parallel Manipulators for Single Port Access Surgery

2. Live Tracking and Dense Reconstruction for Handheld Monocular Endoscopy

3. Perception enhancement using importance-driven hybrid rendering for augmented reality based endoscopic surgical navigation

4. Multimodal endoscopic system based on multispectral and photometric stereo imaging and analysis

5. ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Multi-scale, multi-dimensional binocular endoscopic image depth estimation network;Computers in Biology and Medicine;2023-09

2. 多模态图像引导手术导航进展;Acta Optica Sinica;2023

3. Where do we stand in AI for endoscopic image analysis? Deciphering gaps and future directions;npj Digital Medicine;2022-12-20