Convolutional-de-convolutional neural networks for recognition of surgical workflow-Reference-Cited by-同舟云学术

Convolutional-de-convolutional neural networks for recognition of surgical workflow

Published:2022-09-07 Issue: Volume:16 Page:
ISSN:1662-5188
Container-title:Frontiers in Computational Neuroscience
language:
Short-container-title:Front. Comput. Neurosci.

Author:

Chen Yu-wen,Zhang Ju,Wang Peng,Hu Zheng-yu,Zhong Kun-hua

Abstract

Computer-assisted surgery (CAS) has occupied an important position in modern surgery, further stimulating the progress of methodology and technology. In recent years, a large number of computer vision-based methods have been widely used in surgical workflow recognition tasks. For training the models, a lot of annotated data are necessary. However, the annotation of surgical data requires expert knowledge and thus becomes difficult and time-consuming. In this paper, we focus on the problem of data deficiency and propose a knowledge transfer learning method based on artificial neural network to compensate a small amount of labeled training data. To solve this problem, we propose an unsupervised method for pre-training a Convolutional-De-Convolutional (CDC) neural network for sequencing surgical workflow frames, which performs neural convolution in space (for semantic abstraction) and neural de-convolution in time (for frame level resolution) simultaneously. Specifically, through neural convolution transfer learning, we only fine-tuned the CDC neural network to classify the surgical phase. We performed some experiments for validating the model, and it showed that the proposed model can effectively extract the surgical feature and determine the surgical phase. The accuracy (Acc), recall, precision (Pres) of our model reached 91.4, 78.9, and 82.5%, respectively.

Funder

National Key Research and Development Program of China

Youth Innovation Promotion Association of the Chinese Academy of Sciences

Publisher

Frontiers Media SA

Subject

Cellular and Molecular Neuroscience,Neuroscience (miscellaneous)

Reference55 articles.

1. YouTube-8M: A large-scale video classification benchmark.;Abu-El-Haija;arXiv,2016

2. Real-time identification of operating room state from video;Bhatia;Proceedings of the 19th Conference on Innovative Applications of Artificial Intelligence (IAAI),2007

3. Automatic annotation of human actions in video;Calder;Paper Presented at the IEEE International Conference on Computer Vision.,2009

4. Semi-supervised spatio-temporal CNN for recognition of surgical workflow.;Chen;EURASIP J. Image Video Proc.,2018

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Orthopedic Joint Preservation: A Comprehensive Review;Advances in Surgical Sciences;2024-04-28

2. Development of a deep learning model for safe direct optical trocar insertion in minimally invasive surgery: an innovative method to prevent trocar injuries;Surgical Endoscopy;2023-08-09