Deep multiple instance learning versus conventional deep single instance learning for interpretable oral cancer detection-Reference-Cited by-同舟云学术

Deep multiple instance learning versus conventional deep single instance learning for interpretable oral cancer detection

Published:2024-04-30 Issue:4 Volume:19 Page:e0302169
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Koriakina Nadezhda^ORCID,Sladoje Nataša,Bašić Vladimir,Lindblad Joakim

Abstract

The current medical standard for setting an oral cancer (OC) diagnosis is histological examination of a tissue sample taken from the oral cavity. This process is time-consuming and more invasive than an alternative approach of acquiring a brush sample followed by cytological analysis. Using a microscope, skilled cytotechnologists are able to detect changes due to malignancy; however, introducing this approach into clinical routine is associated with challenges such as a lack of resources and experts. To design a trustworthy OC detection system that can assist cytotechnologists, we are interested in deep learning based methods that can reliably detect cancer, given only per-patient labels (thereby minimizing annotation bias), and also provide information regarding which cells are most relevant for the diagnosis (thereby enabling supervision and understanding). In this study, we perform a comparison of two approaches suitable for OC detection and interpretation: (i) conventional single instance learning (SIL) approach and (ii) a modern multiple instance learning (MIL) method. To facilitate systematic evaluation of the considered approaches, we, in addition to a real OC dataset with patient-level ground truth annotations, also introduce a synthetic dataset—PAP-QMNIST. This dataset shares several properties of OC data, such as image size and large and varied number of instances per bag, and may therefore act as a proxy model of a real OC dataset, while, in contrast to OC data, it offers reliable per-instance ground truth, as defined by design. PAP-QMNIST has the additional advantage of being visually interpretable for non-experts, which simplifies analysis of the behavior of methods. For both OC and PAP-QMNIST data, we evaluate performance of the methods utilizing three different neural network architectures. Our study indicates, somewhat surprisingly, that on both synthetic and real data, the performance of the SIL approach is better or equal to the performance of the MIL approach. Visual examination by cytotechnologist indicates that the methods manage to identify cells which deviate from normality, including malignant cells as well as those suspicious for dysplasia. We share the code as open source.

Funder

VINNOVA

Vetenskapsrådet

Cancerfonden

Publisher

Public Library of Science (PLoS)

Reference28 articles.

1. Clinical study on primary screening of oral cancer and precancerous lesions by oral cytology;S Sukegawa;Diagnostic Pathology,2020

2. Lu J, Sladoje N, Stark CR, Ramqvist ED, Hirsch JM, Lindblad J. A deep learning based pipeline for efficient oral cancer screening on whole slide images. In: International Conference on Image Analysis and Recognition. Springer; 2020. p. 249–261.

3. A review of computational methods for cervical cells segmentation and abnormality classification;T Conceição;International journal of molecular sciences,2019

4. Not-so-supervised: A survey of semi-supervised, multi-instance, and transfer learning in medical image analysis;V Cheplygina;Medical Image Analysis,2019

5. Deep learning for computational cytology: A survey;H Jiang;Medical Image Analysis,2022