Classification of Mobile-Based Oral Cancer Images Using the Vision Transformer and the Swin Transformer
Author:
Song Bofan1, KC Dharma Raj2ORCID, Yang Rubin Yuchan2, Li Shaobai1, Zhang Chicheng2, Liang Rongguang1
Affiliation:
1. Wyant College of Optical Sciences, The University of Arizona, Tucson, AZ 85721, USA 2. Computer Science Department, The University of Arizona, Tucson, AZ 85721, USA
Abstract
Oral cancer, a pervasive and rapidly growing malignant disease, poses a significant global health concern. Early and accurate diagnosis is pivotal for improving patient outcomes. Automatic diagnosis methods based on artificial intelligence have shown promising results in the oral cancer field, but the accuracy still needs to be improved for realistic diagnostic scenarios. Vision Transformers (ViT) have outperformed learning CNN models recently in many computer vision benchmark tasks. This study explores the effectiveness of the Vision Transformer and the Swin Transformer, two cutting-edge variants of the transformer architecture, for the mobile-based oral cancer image classification application. The pre-trained Swin transformer model achieved 88.7% accuracy in the binary classification task, outperforming the ViT model by 2.3%, while the conventional convolutional network model VGG19 and ResNet50 achieved 85.2% and 84.5% accuracy. Our experiments demonstrate that these transformer-based architectures outperform traditional convolutional neural networks in terms of oral cancer image classification, and underscore the potential of the ViT and the Swin Transformer in advancing the state of the art in oral cancer image analysis.
Funder
National Institute of Cancers National Institute of Dental and Craniofacial Research National Institutes of Health Tobacco-Related Disease Research Program
Reference42 articles.
1. (2023, December 01). WHO Mortality Database. Available online: https://platform.who.int/mortality/themes/theme-details/topics/indicator-groups/indicators/indicator-details/MDB/a-lip-and-oral-cavity-cancers. 2. Cancer statistics for the year 2020: An overview;Ferlay;Int. J. Cancer,2021 3. The impact of delayed diagnosis on the outcomes of oral cancer patients: A retrospective cohort study;Thomas;Int. J. Oral Maxillofac. Surg.,2021 4. Challenges in early diagnosis of oral cancer: Cases series;Sujir;Acta Stomatol. Croat.,2019 5. Uthoff, R.D., Song, B., Sunny, S., Patrick, S., Suresh, A., Kolur, T., Keerthi, G., Spires, O., Anbarani, A., and Wilder-Smith, P. (2018). Point-of-care, smartphone-based, dual-modality, dual-view, oral cancer screening device with neural network classification for low-resource communities. PLoS ONE, 13.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|