Pathological Insights: Enhanced Vision Transformers for the Early Detection of Colorectal Cancer-Reference-Cited by-同舟云学术

Pathological Insights: Enhanced Vision Transformers for the Early Detection of Colorectal Cancer

Published:2024-04-08 Issue:7 Volume:16 Page:1441
ISSN:2072-6694
Container-title:Cancers
language:en
Short-container-title:Cancers

Author:

Ayana Gelan¹²^ORCID,Barki Hika³^ORCID,Choe Se-woon¹⁴⁵^ORCID

Affiliation:

1. Department of Medical IT Convergence Engineering, Kumoh National Institute of Technology, Gumi 39253, Republic of Korea

2. School of Biomedical Engineering, Jimma University, Jimma 378, Ethiopia

3. Department of Artificial Intelligence Convergence, Pukyong National University, Busan 48513, Republic of Korea

4. Department of IT Convergence Engineering, Kumoh National Institute of Technology, Gumi 39253, Republic of Korea

5. Emerging Pathogens Institute, University of Florida, Gainesville, FL 32608, USA

Abstract

Endoscopic pathological findings of the gastrointestinal tract are crucial for the early diagnosis of colorectal cancer (CRC). Previous deep learning works, aimed at improving CRC detection performance and reducing subjective analysis errors, are limited to polyp segmentation. Pathological findings were not considered and only convolutional neural networks (CNNs), which are not able to handle global image feature information, were utilized. This work introduces a novel vision transformer (ViT)-based approach for early CRC detection. The core components of the proposed approach are ViTCol, a boosted vision transformer for classifying endoscopic pathological findings, and PUTS, a vision transformer-based model for polyp segmentation. Results demonstrate the superiority of this vision transformer-based CRC detection method over existing CNN and vision transformer models. ViTCol exhibited an outstanding performance in classifying pathological findings, with an area under the receiver operating curve (AUC) value of 0.9999 ± 0.001 on the Kvasir dataset. PUTS provided outstanding results in segmenting polyp images, with mean intersection over union (mIoU) of 0.8673 and 0.9092 on the Kvasir-SEG and CVC-Clinic datasets, respectively. This work underscores the value of spatial transformers in localizing input images, which can seamlessly integrate into the main vision transformer network, enhancing the automated identification of critical image features for early CRC detection.

Funder

National Research Foundation of Korea

Korea Ministry of SMEs and Startups

Publisher

MDPI AG

Link

https://www.mdpi.com/2072-6694/16/7/1441/pdf

Reference65 articles.

1. Cancer Statistics, 2022;Siegel;CA Cancer J. Clin.,2022

2. Epidemiology of Colorectal Cancer: Incidence, Mortality, Survival, and Risk Factors;Rawla;Gastroenterol. Rev.,2019

3. Time Trends of Colorectal Cancer Incidence and Associated Lifestyle Factors in South Korea;Khil;Sci. Rep.,2021

4. Ayana, G., Ryu, J., and Choe, S. (2022). Ultrasound-Responsive Nanocarriers for Breast Cancer Chemotherapy. Micromachines, 13.

5. Cancer Statistics, 2020;Siegel;CA Cancer J. Clin.,2020