MCNMF-Unet: a mixture Conv-MLP network with multi-scale features fusion Unet for medical image segmentation-Reference-Cited by-同舟云学术

MCNMF-Unet: a mixture Conv-MLP network with multi-scale features fusion Unet for medical image segmentation

Published:2024-01-12 Issue: Volume:10 Page:e1798
ISSN:2376-5992
Container-title:PeerJ Computer Science
language:en
Short-container-title:

Author:

Yuan Lei¹,Song Jianhua¹,Fan Yazhuo¹

Affiliation:

1. Key Laboratory of Light Field Manipulation and System Integration Applications in Fujian Province, School of Physics and Information Engineering, Minnan Normal University, Zhangzhou, Fujian, China

Abstract

Recently, the medical image segmentation scheme combining Vision Transformer (ViT) and multilayer perceptron (MLP) has been widely used. However, one of its disadvantages is that the feature fusion ability of different levels is weak and lacks flexible localization information. To reduce the semantic gap between the encoding and decoding stages, we propose a mixture conv-MLP network with multi-scale features fusion Unet (MCNMF-Unet) for medical image segmentation. MCNMF-Unet is a U-shaped network based on convolution and MLP, which not only inherits the advantages of convolutional in extracting underlying features and visual structures, but also utilizes MLP to fuse local and global information of each layer of the network. MCNMF-Unet performs multi-layer fusion and multi-scale feature map skip connections in each network stage so that all the feature information can be fully utilized and the gradient disappearance problem can be alleviated. Additionally, MCNMF-Unet incorporates a multi-axis and multi-windows MLP module. This module is fully end-to-end and eliminates the need to consider the negative impact of image cropping. It not only fuses information from multiple dimensions and receptive fields but also reduces the number of parameters and computational complexity. We evaluated the proposed model on BUSI, ISIC2018 and CVC-ClinicDB datasets. The experimental results show that the performance of our proposed model is superior to most existing networks, with an IoU of 84.04% and a F1-score of 91.18%.

Funder

The Natural Science Foundation of Fujian Province

The Principal Foundation of Minnan Normal University

Publisher

PeerJ

Link

https://peerj.com/articles/cs-1798.pdf

Reference59 articles.

1. Dataset of breast ultrasound images;Al-Dhabyani;Data in Brief,2020

2. DAE-Former: dual attention-guided efficient transformer for medical image segmentation;Azad,2022

3. Enhancing medical image segmentation with transception: a multi-scale feature fusion approach;Azad,2023

4. WM-DOVA maps for accurate polyp highlighting in colonoscopy: validation vs. saliency maps from physicians;Bernal;Computerized Medical Imaging and Graphics,2015

5. Swin-unet: Unet-like pure transformer for medical image segmentation;Cao,2023

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. AFC-Unet: Attention-fused full-scale CNN-transformer unet for medical image segmentation;Biomedical Signal Processing and Control;2025-01

2. HCT-Unet: multi-target medical image segmentation via a hybrid CNN-transformer Unet incorporating multi-axis gated multi-layer perceptron;The Visual Computer;2024-09-06

3. UTAC-Net: A Semantic Segmentation Model for Computer-Aided Diagnosis for Ischemic Region Based on Nuclear Medicine Cerebral Perfusion Imaging;Electronics;2024-04-12

4. MSR-Net: Multi-Scale Residual Network Based on Attention Mechanism for Pituitary Adenoma MRI Image Segmentation;IEEE Access;2024