High-Speed and Accurate Diagnosis of Gastrointestinal Disease: Learning on Endoscopy Images Using Lightweight Transformer with Local Feature Attention-Reference-Cited by-同舟云学术

High-Speed and Accurate Diagnosis of Gastrointestinal Disease: Learning on Endoscopy Images Using Lightweight Transformer with Local Feature Attention

Published:2023-12-13 Issue:12 Volume:10 Page:1416
ISSN:2306-5354
Container-title:Bioengineering
language:en
Short-container-title:Bioengineering

Author:

Wu Shibin¹,Zhang Ruxin¹,Yan Jiayi¹,Li Chengquan²,Liu Qicai³,Wang Liyang²,Wang Haoqian¹

Affiliation:

1. Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, China

2. School of Clinical Medicine, Tsinghua University, Beijing 100084, China

3. Vanke School of Public Health, Tsinghua University, Beijing 100084, China

Abstract

In response to the pressing need for robust disease diagnosis from gastrointestinal tract (GIT) endoscopic images, we proposed FLATer, a fast, lightweight, and highly accurate transformer-based model. FLATer consists of a residual block, a vision transformer module, and a spatial attention block, which concurrently focuses on local features and global attention. It can leverage the capabilities of both convolutional neural networks (CNNs) and vision transformers (ViT). We decomposed the classification of endoscopic images into two subtasks: a binary classification to discern between normal and pathological images and a further multi-class classification to categorize images into specific diseases, namely ulcerative colitis, polyps, and esophagitis. FLATer has exhibited exceptional prowess in these tasks, achieving 96.4% accuracy in binary classification and 99.7% accuracy in ternary classification, surpassing most existing models. Notably, FLATer could maintain impressive performance when trained from scratch, underscoring its robustness. In addition to the high precision, FLATer boasted remarkable efficiency, reaching a notable throughput of 16.4k images per second, which positions FLATer as a compelling candidate for rapid disease identification in clinical practice.

Funder

Shenzhen Science and Technology Project

Publisher

MDPI AG

Subject

Bioengineering

Link

https://www.mdpi.com/2306-5354/10/12/1416/pdf

Reference43 articles.

1. Cancer statistics in China, 2015;Chen;Cancer J. Clin.,2016

2. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries;Bray;Cancer J. Clin.,2018

3. Lee, J.M., Park, Y.M., Yun, J.S., Ahn, Y.B., Lee, K.M., Kim, D.B., Lee, J.M., Han, K., and Ko, S.H. (2020). The association between nonalcoholic fatty liver disease and esophageal, stomach, or colorectal cancer: National population-based cohort study. PLoS ONE, 15.

4. Factors influencing the miss rate of polyps in a back-to-back colonoscopy study;Leufkens;Endoscopy,2012

5. Chronic ulcerative colitis and colorectal cancer;Rogler;Cancer Lett.,2014

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Diagnostic Accuracy of Artificial Intelligence in Endoscopy: Umbrella Review;JMIR Medical Informatics;2024-07-15

2. An Overview Comparison between Convolutional Neural Networks and Vision Transformers;Proceedings of the 7th International Conference on Networking, Intelligent Systems and Security;2024-04-18