Author:
Chen Yu-wen,Zhang Ju,Wang Peng,Hu Zheng-yu,Zhong Kun-hua
Abstract
Computer-assisted surgery (CAS) has occupied an important position in modern surgery, further stimulating the progress of methodology and technology. In recent years, a large number of computer vision-based methods have been widely used in surgical workflow recognition tasks. For training the models, a lot of annotated data are necessary. However, the annotation of surgical data requires expert knowledge and thus becomes difficult and time-consuming. In this paper, we focus on the problem of data deficiency and propose a knowledge transfer learning method based on artificial neural network to compensate a small amount of labeled training data. To solve this problem, we propose an unsupervised method for pre-training a Convolutional-De-Convolutional (CDC) neural network for sequencing surgical workflow frames, which performs neural convolution in space (for semantic abstraction) and neural de-convolution in time (for frame level resolution) simultaneously. Specifically, through neural convolution transfer learning, we only fine-tuned the CDC neural network to classify the surgical phase. We performed some experiments for validating the model, and it showed that the proposed model can effectively extract the surgical feature and determine the surgical phase. The accuracy (Acc), recall, precision (Pres) of our model reached 91.4, 78.9, and 82.5%, respectively.
Funder
National Key Research and Development Program of China
Youth Innovation Promotion Association of the Chinese Academy of Sciences
Subject
Cellular and Molecular Neuroscience,Neuroscience (miscellaneous)
Reference55 articles.
1. YouTube-8M: A large-scale video classification benchmark.;Abu-El-Haija;arXiv,2016
2. Real-time identification of operating room state from video;Bhatia;Proceedings of the 19th Conference on Innovative Applications of Artificial Intelligence (IAAI),2007
3. Automatic annotation of human actions in video;Calder;Paper Presented at the IEEE International Conference on Computer Vision.,2009
4. Semi-supervised spatio-temporal CNN for recognition of surgical workflow.;Chen;EURASIP J. Image Video Proc.,2018
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献