Affiliation:
1. Department of Automation, Shanghai Jiao Tong University, Shanghai 200240, China
2. Institute of Medical Robotics, Shanghai Jiao Tong University, Shanghai 200240, China
Abstract
With stereo cameras becoming widely used in invasive surgery systems, stereo endoscopic images provide important depth information for delicate surgical tasks. However, the small size of sensors and their limited lighting conditions lead to low-quality and low-resolution endoscopic images and videos. In this paper, we propose a stereo endoscopic video super-resolution method using transformer with a hybrid attention mechanism named HA-VSR. Stereo video SR aims to reconstruct high-resolution (HR) images from corresponding low-resolution (LR) videos. In our method, the stereo correspondence and temporal correspondence are incorporated into the HA-VSR model. Specifically, the Swin transformer architecture is utilized in proposed framework with hybrid attention mechanisms. The parallel attention mechanism is utilized by using the symmetry and consistency of left and right images, and the temporal attention mechanism is utilized by using the consistency of consecutive frames. Detailed quantitative evaluation and experiments on two datasets show the proposed model achieves advanced SR reconstruction performance, showing that the proposed stereo VSR framework outperforms alternative approaches.
Funder
Ministry of Science and Technology, China
Subject
Physics and Astronomy (miscellaneous),General Mathematics,Chemistry (miscellaneous),Computer Science (miscellaneous)
Reference31 articles.
1. Review of emerging surgical robotic technology;Peters;Surg. Endosc.,2018
2. Possibilities and limitations of current stereo-endoscopy;Limberger;Surg. Endosc.,2004
3. Recent development of computer vision technology to improve capsule endoscopy;Park;Clin. Endosc.,2019
4. Wang, C.C., Chiu, Y.C., Chen, W.L., Yang, T.W., Tsai, M.C., and Tseng, M.H. (2021). A deep learning model for classification of endoscopic gastroesophageal reflux disease. Int. J. Environ. Res. Public Health, 18.
5. Deep learning for detection and segmentation of artefact and disease instances in gastrointestinal endoscopy;Ali;Med. Image Anal.,2021
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献