Author:
Cao Xianghai,Zhang Kai,Jiao Licheng
Abstract
Road segmentation from remote sensing images is an important task in many applications. However, due to the high density of roads and the complex background, the roads are often occluded by trees. This makes accurate road segmentation a challenge task. Most existing road segmentation networks rely on convolutions with small kernels; however, these methods often cannot obtain satisfying results because the long-range dependencies are not captured and the intrinsic relationships between feature maps at different scales are not fully exploited. In this paper, a deep neural network based on a cross-scale axial attention mechanism is proposed to address this problem. This model enables low-resolution features to aggregate global contextual information from high-resolution features. Among them, the axial attention mechanism realizes global attention by using vertical and horizontal attention sequentially. With this strategy, the dense long-range dependencies can be captured with extremely low computational cost. The cross-scale mechanism enables the model to effectively combine the high-resolution fine-grained features and the low-resolution coarse-grained features. The proposed method enables the network to propagate the information without losing details. Our method achieves IoUs of 58.98 and 65.28 on the Massachusetts Roads dataset and DeepGlobe dataset and outperforms other methods.
Funder
National Natural Science Foundation of China
Subject
General Earth and Planetary Sciences
Reference46 articles.
1. icurb: Imitation learning-based detection of road curbs using aerial images for autonomous driving;Xu;IEEE Robot. Autom. Lett.,2021
2. Topo-boundary: A benchmark dataset on topological road-boundary detection using aerial images for autonomous driving;Xu;IEEE Robot. Autom. Lett.,2021
3. Steger, C., Glock, C., Eckstein, W., Mayer, H., and Radig, B. (1995). Automatic Extraction of Man-Made Objects from Aerial and Space Images, Springer.
4. Road extraction from very high resolution remote sensing optical images based on texture analysis and beamlet transform;Sghaier;IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens.,2015
5. Máttyus, G., Luo, W., and Urtasun, R. (2017, January 22–29). Deeproadmapper: Extracting road topology from aerial images. Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy.
Cited by
9 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献