Attention-driven tree-structured convolutional LSTM for high dimensional data understanding
-
Published:2023-03-27
Issue:
Volume:11
Page:
-
ISSN:2296-424X
-
Container-title:Frontiers in Physics
-
language:
-
Short-container-title:Front. Phys.
Author:
Lu Yi,Kong Bin,Gao Feng,Cao Kunlin,Lyu Siwei,Zhang Shaoting,Hu Shu,Yin Youbing,Wang Xin
Abstract
Modeling sequential information for image sequences is a vital step of various vision tasks and convolutional long short-term memory (ConvLSTM) has demonstrated its superb performance in such spatiotemporal problems. Nevertheless, the hierarchical data structures (e.g., human body parts and vessel/airway tree in biomedical images) in various tasks cannot be properly modeled by sequential models. Thus, ConvLSTM is not suitable for analyzing tree-structured image data that has a rich relation among its elements. In order to address this limitation, we present a tree-structured ConvLSTM model for tree-structured image analysis which can be trained end-to-end. To demonstrate its effectiveness, we first evaluate the proposed tree-structured ConvLSTM model on a synthetic Tree-Moving-MNIST dataset for tree-structured modeling. Experimental results demonstrate the superiority of the tree-structured ConvLSTM model for tree-structured image analysis compared with other alternatives. Additionally, we present a tree-structured segmentation framework which consists of a tree-structured ConvLSTM layer and an attention fully convolutional network (FCN) model. The proposed framework is validated on four large-scale coronary artery datasets. The results demonstrate the effectiveness and efficiency of the proposed method, showing its potential use cases in the analysis of tree-structured image data.
Publisher
Frontiers Media SA
Subject
Physical and Theoretical Chemistry,General Physics and Astronomy,Mathematical Physics,Materials Science (miscellaneous),Biophysics
Reference55 articles.
1. Convolutional lstm network: A machine learning approach for precipitation nowcasting;Shi,2015
2. Deep learning for precipitation nowcasting: A benchmark and a new model;Shi,2017
3. Spatio-temporal video autoencoder with differentiable memory;Patraucean,2016
4. Videolstm convolves, attends and flows for action recognition;Li;Computer Vis Image Understanding,2018
5. Improving deep pancreas segmentation in ct and mri images via recurrent neural contextual learning and direct loss function;Jinzheng,2017