Surgical Gesture Recognition in Laparoscopic Tasks Based on the Transformer Network and Self-Supervised Learning-Reference-Cited by-同舟云学术

Surgical Gesture Recognition in Laparoscopic Tasks Based on the Transformer Network and Self-Supervised Learning

Published:2022-11-29 Issue:12 Volume:9 Page:737
ISSN:2306-5354
Container-title:Bioengineering
language:en
Short-container-title:Bioengineering

Author:

Gazis Athanasios^ORCID,Karaiskos Pantelis^ORCID,Loukas Constantinos^ORCID

Abstract

In this study, we propose a deep learning framework and a self-supervision scheme for video-based surgical gesture recognition. The proposed framework is modular. First, a 3D convolutional network extracts feature vectors from video clips for encoding spatial and short-term temporal features. Second, the feature vectors are fed into a transformer network for capturing long-term temporal dependencies. Two main models are proposed, based on the backbone framework: C3DTrans (supervised) and SSC3DTrans (self-supervised). The dataset consisted of 80 videos from two basic laparoscopic tasks: peg transfer (PT) and knot tying (KT). To examine the potential of self-supervision, the models were trained on 60% and 100% of the annotated dataset. In addition, the best-performing model was evaluated on the JIGSAWS robotic surgery dataset. The best model (C3DTrans) achieves an accuracy of 88.0%, a 95.2% clip level, and 97.5% and 97.9% (gesture level), for PT and KT, respectively. The SSC3DTrans performed similar to C3DTrans when training on 60% of the annotated dataset (about 84% and 93% clip-level accuracies for PT and KT, respectively). The performance of C3DTrans on JIGSAWS was close to 76% accuracy, which was similar to or higher than prior techniques based on a single video stream, no additional video training, and online processing.

Publisher

MDPI AG

Subject

Bioengineering

Link

https://www.mdpi.com/2306-5354/9/12/737/pdf

Reference31 articles.

1. Computer vision in surgery;Ward;Surgery,2021

2. Machine learning for surgical phase recognition: A systematic review;Garrow;Ann. Surg.,2021

3. Gesture Recognition in Robotic Surgery: A Review;Clarkson;IEEE Trans. Biomed. Eng.,2021

4. Gao, Y., Vedula, S.S., Reiley, C.E., Ahmidi, N., Varadarajan, B., Lin, H.C., Tao, L., Zappella, L., Béjar, B., and Yuh, D.D. (2014, January 25). JHU-ISI Gesture and Skill Assessment Working Set (JIGSAWS): A Surgical Activity Dataset for Human Motion Modeling. Proceedings of the Modeling and Monitoring of Computer Assisted Interventions (M2CAI)—MICCAI Workshop, Boston, MA, USA.

5. Tao, L., Zappella, L., Hager, G., and Vidal, R. (2013, January 22–26). Surgical Gesture Segmentation and Recognition. Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), Nagoya, Japan.

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Research on Surgical Gesture Recognition in Open Surgery Based on Fusion of R3D and Multi-Head Attention Mechanism;Applied Sciences;2024-09-07

2. Automated performance metrics and surgical gestures: two methods for assessment of technical skills in robotic surgery;Journal of Robotic Surgery;2024-07-27

3. Surgical gestures can be used to assess surgical competence in robot-assisted surgery;Journal of Robotic Surgery;2024-01-20

4. Artificial Intelligence and Robotics-Based Minimally Invasive Surgery;Advances in Healthcare Information Systems and Administration;2023-06-30

5. Artificial Intelligence for Personalized Genetics and New Drug Development: Benefits and Cautions;Bioengineering;2023-05-19