TDT-MIL: a framework with a dual-channel spatial positional encoder for weakly-supervised whole slide image classification-Reference-Cited by-同舟云学术

TDT-MIL: a framework with a dual-channel spatial positional encoder for weakly-supervised whole slide image classification

Published:2024-09-13 Issue:10 Volume:15 Page:5831
ISSN:2156-7085
Container-title:Biomedical Optics Express
language:en
Short-container-title:Biomed. Opt. Express

Author:

Zhang Hongbin^ORCID,Feng Ya,Zhang Jin,Li Guangli,Wu Jianguo¹,Ji Donghong²

Affiliation:

1. The Second Affiliated Hospital of Nanchang University

2. Wuhan University

Abstract

The classic multiple instance learning (MIL) paradigm is harnessed for weakly-supervised whole slide image (WSI) classification. The spatial position relationship located between positive tissues is crucial for this task due to the small percentage of these tissues in billions of pixels, which has been overlooked by most studies. Therefore, we propose a framework called TDT-MIL. We first serially connect a convolutional neural network and transformer for basic feature extraction. Then, a novel dual-channel spatial positional encoder (DCSPE) module is designed to simultaneously capture the complementary local and global positional information between instances. To further supplement the spatial position relationship, we construct a convolutional triple-attention (CTA) module to attend to the inter-channel information. Thus, the spatial positional and inter-channel information is fully mined by our model to characterize the key pathological semantics in WSI. We evaluated TDT-MIL on two publicly available datasets, including CAMELYON16 and TCGA-NSCLC, with the corresponding classification accuracy and AUC up to 91.54%, 94.96%, and 90.21%, 94.36%, respectively, outperforming state-of-the-art baselines. More importantly, our model possesses a satisfactory capability in solving the imbalanced WSI classification task using an ingenious but interpretable structure.

Funder

National Natural Science Foundation of China

Key Research and Development Plan of Jiangxi Provincial Science and Technology Department

Humanities and Social Science Fund of Ministry of Education of China

Natural Science Foundation of Jiangxi Provincial Department of Science and Technology

Humanity and Social Science Foundation of the Jiangxi Province

Publisher

Optica Publishing Group

Reference35 articles.

1. Whole-slide Imaging

2. Review of the current state of whole slide imaging in pathology

3. Clinical-grade computational pathology using weakly supervised deep learning on whole slide images

4. Single image super-resolution for whole slide image using convolutional neural networks and self-supervised color normalization

5. A survey on artificial intelligence in histopathology image analysis