Affiliation:
1. School of Computer Science and Technology Zhejiang Sci‐Tech University Hangzhou China
2. Department of Neurosurgery The 903rd Hospital of The Chinese People's Liberation Army Hangzhou China
3. National Heart and Lung Institute Imperial College London London UK
4. Bioengineering Department and Imperial‐X Imperial College London London UK
5. Cardiovascular Research Centre Royal Brompton Hospital London UK
6. School of Biomedical Engineering and Imaging Sciences King's College London London UK
Abstract
ABSTRACTIn the field of medical image analysis, object detection plays a crucial role by providing interpretable diagnostic information to healthcare professionals. Although current object detection models have achieved remarkable success in conventional images, their performance in detecting abnormalities in medical images has not been as satisfactory. This is primarily due to the complexity of anatomical structures in medical images, and the fact that some lesions may have subtle features, particularly in the case of early‐stage, small‐scale abnormalities. To address this challenge, we introduce SOCR‐YOLO, a novel lesion detection model with online convolutional reparameterization based on channel shuffling. First, it employs the SOCR (Shuffled Channel with Online Convolutional Re‐parameterization) module to establish a connection between feature concatenation and computational efficiency, aiming to extract more comprehensive information while reducing time consumption. Second, it incorporates the Bi‐FPN structure to achieve multiscale feature fusion. Lastly, the loss function has been optimized to improve the model training process. We evaluated two datasets, chest x‐ray (Vindr‐CXR) and brain tumor (Br35H), provided by the Kaggle competition. Experimental results show that the proposed method has outperformed several state‐of‐the‐art models, including YOLOv8, YOLO‐NAS, and RT‐DETR, in both speed and accuracy. Notably, in the context of chest x‐ray anomaly detection, SOCR‐YOLO exhibits a 1.8% enhancement in accuracy over YOLOv8 while simultaneously reducing floating‐point operations by 26.3%. Additionally, a similar 1.8% improvement in accuracy is observed in the detection of brain tumors. The results indicate the superior ability of our model to detect multiscale variations and small lesions.
Funder
National Natural Science Foundation of China
Key Research and Development Program of Ningxia
Horizon 2020 Framework Programme
Medical Research Council
Royal Society
Reference49 articles.
1. VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations
2. W.Lv S.Xu Y.Zhao et al. “Detrs Beat Yolos on Real‐Time Object Detection ”arXiv Preprint arXiv:2304.08069 2023.
3. A.Vaswani N.Shazeer N.Parmar et al. “Attention Is all You Need ”arXiv Preprint arXiv: 1706.03762 2022.
4. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks