Abstract
Abstract
Objective. Chest x-ray image representation and learning is an important problem in computer-aided diagnostic area. Existing methods usually adopt CNN or Transformers for feature representation learning and focus on learning effective representations for chest x-ray images. Although good performance can be obtained, however, these works are still limited mainly due to the ignorance of mining the correlations of channels and pay little attention on the local context-aware feature representation of chest x-ray image. Approach. To address these problems, in this paper, we propose a novel spatial-channel high-order attention model (SCHA) for chest x-ray image representation and diagnosis. The proposed network architecture mainly contains three modules, i.e. CEBN, SHAM and CHAM. To be specific, firstly, we introduce a context-enhanced backbone network by employing multi-head self-attention to extract initial features for the input chest x-ray images. Then, we develop a novel SCHA which contains both spatial and channel high-order attention learning branches. For the spatial branch, we develop a novel local biased self-attention mechanism which can capture both local and long-range global dependences of positions to learn rich context-aware representation. For the channel branch, we employ Brownian Distance Covariance to encode the correlation information of channels and regard it as the image representation. Finally, the two learning branches are integrated together for the final multi-label diagnosis classification and prediction. Main results. Experiments on the commonly used datasets including ChestX-ray14 and CheXpert demonstrate that our proposed SCHA approach can obtain better performance when comparing many related approaches. Significance. This study obtains a more discriminative method for chest x-ray classification and provides a technique for computer-aided diagnosis.
Funder
University Synergy Innovation Program of Anhui Province
Natural Science Foundation of Anhui Province
National Natural Science Foundation of China
Reference66 articles.
1. Anaxnet: anatomy aware multi-label finding classification in chest x-ray;Agu,2021
2. Mrscatt: a spatio-channel attention-guided network for mars rover image classification;Chakravarthy,2021
3. Label co-occurrence learning with graph convolutional networks for multi-label chest x-ray image classification;Chen;IEEE J. Biomed. Health Inf.,2020
4. Generating radiology reports via memory-driven transformer;Chen,2022
5. Keras: Deep learning library for theano and tensorflow;Chollet,2015