Improving Polyp Segmentation with Boundary-Assisted Guidance and Cross-Scale Interaction Fusion Transformer Network

Author:

Jiang Lincen12,Hui Yan1ORCID,Fei Yuan1,Ji Yimu1,Zeng Tao3

Affiliation:

1. School of Communications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210023, China

2. School of Computer and Software, Nanjing Vocational University of Industry Technology, Nanjing 210023, China

3. College of Electronic and Optical Engineering & College of Flexible Electronics (Future Technology), Nanjing University of Posts and Telecommunications, Nanjing 210023, China

Abstract

Efficient and precise colorectal polyp segmentation has significant implications for screening colorectal polyps. Although network variants derived from the Transformer network have high accuracy in segmenting colorectal polyps with complex shapes, they have two main shortcomings: (1) multi-level semantic information at the output of the encoder may result in information loss during the fusion process and (2) failure to adequately suppress background noise during segmentation. To address these challenges, we propose a cross-scale interaction fusion transformer for polyp segmentation (CIFFormer). Firstly, a novel feature supplement module (FSM) supplements the missing details and explores potential features to enhance the feature representations. Additionally, to mitigate the interference of background noise, we designed a cross-scale interactive fusion module (CIFM) that combines feature information between different layers to obtain more multi-scale and discriminative representative features. Furthermore, a boundary-assisted guidance module (BGM) is proposed to help the segmentation network obtain boundary-enhanced details. Extensive experiments on five typical datasets have demonstrated that CIFFormer has an obvious advantage in segmenting polyps. Specifically, CIFFormer achieved an mDice of 0.925 and an mIoU of 0.875 on the Kvasir-SEG dataset, achieving superior segmentation accuracy to competing methods.

Funder

National Key R&D Program of China

Jiangsu Key Development Planning Project

Natural Science Foundation of Jiangsu Province

The 14th Five-Year Plan project of Equipment Development Department

Jiangsu Hongxin Information Technology Co., Ltd. Project

Future Network Scientific Research Fund Project

2021 Jiangsu Higher Education Teaching Reform Research General Project

Publisher

MDPI AG

Reference34 articles.

1. Kim, T., Lee, H., and Kim, D. (2021, January 20–24). UACANet: Uncertainty Augmented Context Attention for Polyp Segmentation. Proceedings of the 29th ACM International Conference on Multimedia, Virtual.

2. Polyp Segmentation in NBI Colonoscopy;Gross;DBLP,2009

3. Attention Is All You Need;Vaswani;Adv. Neural Inf. Process. Syst.,2017

4. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., and Houlsby, N. (2020). An Image is Worth 16×16 Words: Transformers for Image Recognition at Scale. arXiv.

5. Intriguing Properties of Vision Transformers;Naseer;Adv. Neural Inf. Process. Syst.,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3