Challenges in multi-centric generalization: phase and step recognition in Roux-en-Y gastric bypass surgery-Reference-Cited by-同舟云学术

Challenges in multi-centric generalization: phase and step recognition in Roux-en-Y gastric bypass surgery

Published:2024-05-18 Issue: Volume: Page:
ISSN:1861-6429
Container-title:International Journal of Computer Assisted Radiology and Surgery
language:en
Short-container-title:Int J CARS

Author:

Lavanchy Joël L.^ORCID,Ramesh Sanat,Dall’Alba Diego,Gonzalez Cristians,Fiorini Paolo,Müller-Stich Beat P.,Nett Philipp C.,Marescaux Jacques,Mutter Didier,Padoy Nicolas

Abstract

Abstract Purpose Most studies on surgical activity recognition utilizing artificial intelligence (AI) have focused mainly on recognizing one type of activity from small and mono-centric surgical video datasets. It remains speculative whether those models would generalize to other centers. Methods In this work, we introduce a large multi-centric multi-activity dataset consisting of 140 surgical videos (MultiBypass140) of laparoscopic Roux-en-Y gastric bypass (LRYGB) surgeries performed at two medical centers, i.e., the University Hospital of Strasbourg, France (StrasBypass70) and Inselspital, Bern University Hospital, Switzerland (BernBypass70). The dataset has been fully annotated with phases and steps by two board-certified surgeons. Furthermore, we assess the generalizability and benchmark different deep learning models for the task of phase and step recognition in 7 experimental studies: (1) Training and evaluation on BernBypass70; (2) Training and evaluation on StrasBypass70; (3) Training and evaluation on the joint MultiBypass140 dataset; (4) Training on BernBypass70, evaluation on StrasBypass70; (5) Training on StrasBypass70, evaluation on BernBypass70; Training on MultiBypass140, (6) evaluation on BernBypass70 and (7) evaluation on StrasBypass70. Results The model’s performance is markedly influenced by the training data. The worst results were obtained in experiments (4) and (5) confirming the limited generalization capabilities of models trained on mono-centric data. The use of multi-centric training data, experiments (6) and (7), improves the generalization capabilities of the models, bringing them beyond the level of independent mono-centric training and validation (experiments (1) and (2)). Conclusion MultiBypass140 shows considerable variation in surgical technique and workflow of LRYGB procedures between centers. Therefore, generalization experiments demonstrate a remarkable difference in model performance. These results highlight the importance of multi-centric datasets for AI model generalization to account for variance in surgical technique and workflows. The dataset and code are publicly available at https://github.com/CAMMA-public/MultiBypass140.

Funder

Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung

Novartis Stiftung für Medizinisch-Biologische Forschung

Horizon 2020 Framework Programme

Academie Nationale de la Recherche

University of Basel

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s11548-024-03166-3.pdf

Reference24 articles.

1. Maier-Hein L, Eisenmann M, Sarikaya D et al (2022) Surgical data science - from concepts toward clinical translation. Med Image Anal 76:102306

2. Pedrett R, Mascagni P, Beldi G, Padoy N, Lavanchy JL (2023) Technical skill assessment in minimally invasive surgery using artificial intelligence: A systematic review. Surg Endosc 37:7412–424

3. Meireles OR, Rosman G, Altieri MS, Carin L, Hager G, Madani A, Padoy N, Pugh CM, Sylla P, Ward TM, H DA, (2021) SAGES consensus recommendations on an annotation framework for surgical video. Surg Endosc 35(9):4918–4929

4. Garrow CR, Kowalewski K-F, Li L, Wagner M, Schmidt MW, Engelhardt S, Hashimoto DA, Kenngott HG, Bodenstedt S, Speidel S, Müller-Stich BP, Nickel F (2020) Machine learning for surgical phase recognition. Ann Surg 273(4):684–693

5. Twinanda AP, Shehata S, Mutter D, Marescaux J, de Mathelin M, Padoy N (2017) EndoNet: A deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging 36(1):86–97