Affiliation:
1. Key Laboratory of Light Field Manipulation and System Integration Applications in Fujian Province, School of Physics and Information Engineering, Minnan Normal University, Zhangzhou, Fujian, China
Abstract
Recently, the medical image segmentation scheme combining Vision Transformer (ViT) and multilayer perceptron (MLP) has been widely used. However, one of its disadvantages is that the feature fusion ability of different levels is weak and lacks flexible localization information. To reduce the semantic gap between the encoding and decoding stages, we propose a mixture conv-MLP network with multi-scale features fusion Unet (MCNMF-Unet) for medical image segmentation. MCNMF-Unet is a U-shaped network based on convolution and MLP, which not only inherits the advantages of convolutional in extracting underlying features and visual structures, but also utilizes MLP to fuse local and global information of each layer of the network. MCNMF-Unet performs multi-layer fusion and multi-scale feature map skip connections in each network stage so that all the feature information can be fully utilized and the gradient disappearance problem can be alleviated. Additionally, MCNMF-Unet incorporates a multi-axis and multi-windows MLP module. This module is fully end-to-end and eliminates the need to consider the negative impact of image cropping. It not only fuses information from multiple dimensions and receptive fields but also reduces the number of parameters and computational complexity. We evaluated the proposed model on BUSI, ISIC2018 and CVC-ClinicDB datasets. The experimental results show that the performance of our proposed model is superior to most existing networks, with an IoU of 84.04% and a F1-score of 91.18%.
Funder
The Natural Science Foundation of Fujian Province
The Principal Foundation of Minnan Normal University
Reference59 articles.
1. Dataset of breast ultrasound images;Al-Dhabyani;Data in Brief,2020
2. DAE-Former: dual attention-guided efficient transformer for medical image segmentation;Azad,2022
3. Enhancing medical image segmentation with transception: a multi-scale feature fusion approach;Azad,2023
4. WM-DOVA maps for accurate polyp highlighting in colonoscopy: validation vs. saliency maps from physicians;Bernal;Computerized Medical Imaging and Graphics,2015
5. Swin-unet: Unet-like pure transformer for medical image segmentation;Cao,2023
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献