Author:
Chincholi Farheen,Koestler Harald
Abstract
Although the Vision Transformer architecture has become widely accepted as the standard for image classification tasks, using it for object detection in computer vision poses significant challenges. This research aims to explore the potential of extending the Vision Transformer for object detection in medical imaging, specifically for glaucoma detection, and also includes an examination of the Detection Transformer for comparative analysis. The analysis involves assessing the cup-to-disc ratio and identifying signs of vertical thinning of the neuroretinal rim. A diagnostic threshold is proposed, flagging a cup-to-disc ratio exceeding 0.6 as a potential indicator of glaucoma. The experimental results demonstrate a remarkable 90.48% accuracy achieved by the pre-trained Detection Transformer, while the Vision Transformer exhibits competitive accuracy at 87.87%. Comparative evaluations leverage a previously untapped dataset from the Standardized Fundus Glaucoma Dataset available on Kaggle, providing valuable insights into automated glaucoma detection. The evaluation criteria and results are comprehensively validated by medical experts specializing in the field of glaucoma.
Reference19 articles.
1. AhalliA. D.
Smdg modified2023
2. “Automated glaucoma diagnosis using deep learning approach,”;Al-Bander;2017 14th International Multi-Conference on Systems, Signals and Devices (SSD),2017
3. Glaucoma diagnosis with machine learning based on optical coherence tomography and color fundus images;An;J. Healthc. Eng,2019
4. “End-to-end object detection with transformers,”;Carion,2020
5. “Glaucoma detection based on deep convolutional neural network,”;Chen;2015 37th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC),2015
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献