MEDUSA: Multi-Scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis

Author:

Aboutalebi Hossein,Pavlova Maya,Gunraj Hayden,Shafiee Mohammad Javad,Sabri Ali,Alaref Amer,Wong Alexander

Abstract

Medical image analysis continues to hold interesting challenges given the subtle characteristics of certain diseases and the significant overlap in appearance between diseases. In this study, we explore the concept of self-attention for tackling such subtleties in and between diseases. To this end, we introduce, a multi-scale encoder-decoder self-attention (MEDUSA) mechanism tailored for medical image analysis. While self-attention deep convolutional neural network architectures in existing literature center around the notion of multiple isolated lightweight attention mechanisms with limited individual capacities being incorporated at different points in the network architecture, MEDUSA takes a significant departure from this notion by possessing a single, unified self-attention mechanism with significantly higher capacity with multiple attention heads feeding into different scales in the network architecture. To the best of the authors' knowledge, this is the first “single body, multi-scale heads” realization of self-attention and enables explicit global context among selective attention at different levels of representational abstractions while still enabling differing local attention context at individual levels of abstractions. With MEDUSA, we obtain state-of-the-art performance on multiple challenging medical image analysis benchmarks including COVIDx, Radiological Society of North America (RSNA) RICORD, and RSNA Pneumonia Challenge when compared to previous work. Our MEDUSA model is publicly available.

Publisher

Frontiers Media SA

Subject

General Medicine

Reference53 articles.

1. Deep learning;LeCun;Nature.,2015

2. Learning Deep Architectures for AI

3. Understanding the difficulty of training deep feedforward neural networks;Glorot,2010

4. Deep residual learning for image recognition;He;CoRR.,2015

5. Very deep convolutional networks for large-scale image recognition;Simonyan;arXiv preprint.,2014

Cited by 13 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Optimizing image captioning: The effectiveness of vision transformers and VGG networks for remote sensing;Big Data Research;2024-08

2. Convolutional neural network with parallel convolution scale attention module and ResCBAM for breast histology image classification;Heliyon;2024-05

3. Convolutional Recurrent Neural Networks for Medical Image Recognition;2024 International Conference on Optimization Computing and Wireless Communication (ICOCWC);2024-01-29

4. Multi-Scale Recurrent Neural Networks for Medical Image Classification;2024 International Conference on Optimization Computing and Wireless Communication (ICOCWC);2024-01-29

5. Exploring the Potential of Recurrent Neural Networks for Medical Image Segmentation;2024 International Conference on Optimization Computing and Wireless Communication (ICOCWC);2024-01-29

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3