SFE-TRANSUNET: A TRANSFORMER-BASED U-NET WITH SKIPPED FEATURES ENHANCER FOR MEDICAL IMAGE SEGMENTATION

Author:

HUANG JIANUO12345ORCID,LAN QUAN12345ORCID,DENG WEN678ORCID,HUANG CHENXI9ORCID,ZHANG SUZHEN10ORCID

Affiliation:

1. Guangxi Key Laboratory of Eye Health, Nanning 530021, P. R. China

2. Department of Ophthalmology, The People’s Hospital of Guangxi Zhuang Autonomous Region, Nanning 530021, P. R. China

3. Institute of Ophthalmic Diseases, Guangxi Academy of Medical Sciences, Nanning 530021, P. R. China

4. School of Computing and Data Science, Xiamen University Malaysia, Sepang 43900, Malaysia

5. Department of Neurology and Department of Neuroscience, The First Affiliated Hospital of Xiamen University, Xiamen 361003, P. R. China

6. Guangxi Key Laboratory of Eye Health & Department of Ophthalmology, Nanning 530021, P. R. China

7. The People’s Hospital of Guangxi Zhuang Autonomous, Region & Institute of Ophthalmic Diseases, Nanning 530021, P. R. China

8. Guangxi Academy of Medical Sciences, Nanning 530021, P. R. China

9. School of Computing and Data Science, Xiamen University Malaysia, Xiamen 361003, P. R. China

10. Department of Vascular Surgery, Zhongshan Hospital of Xiamen University, Xiamen, 361015, P. R. China

Abstract

In the field of medical image segmentation, deep convolutional neural networks have achieved satisfying performance in the past decade or so. However, there are some shortcomings. First, the convolutional neural network model cannot provide good insight into the remote dependencies in the image. Second, medical imaging datasets are typically small, which leads to a much higher risk of overfitting in model training. To address these limitations, we innovatively designed the skipped features enhancer (SFE) to enhance the impact of preserved details. To gain insight into remote dependencies in images, this model (SFE-TransUNet) is based on transformer. Additionally, a different scale convolutional layer (additional information capturer) before and after the Transformer Encoder to fuse the features to retain more information of the original data. In addition, a gate mechanism was introduced in the multi-head self-attention (MHSA), and. Finally, an attention block with residuals. SFE-TransUNet was evaluated on two public medical image segmentation datasets. Experimental results show that it achieves better performance than other related Transformer-based architectures. Code available at https://github.com/xackz/SFE-TransUNet .

Funder

the Project of Guangxi Key Laboratory of Eye Health

Fujian Provincial Clinical Research Center for Brain Diseases

Publisher

World Scientific Pub Co Pte Ltd

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3