GeoSparseNet: A Multi-Source Geometry-Aware CNN for Urban Scene Analysis

Author:

Afzal Muhammad Kamran123ORCID,Liu Weiquan12ORCID,Zang Yu2ORCID,Chen Shuting4ORCID,Afzal Hafiz Muhammad Rehan56ORCID,Adam Jibril Muhammad2ORCID,Yang Bai3,Li Jonathan7ORCID,Wang Cheng2ORCID

Affiliation:

1. College of Computer Engineering, Jimei University, Xiamen 361021, China

2. Fujian Key Laboratory of Sensing and Computing for Smart Cities, School of Informatics, Xiamen University, Xiamen 361005, China

3. Center for Integrative Conservation & Yunnan Key Laboratory for Conservation of Tropical Rainforests and Asian Elephants, Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences, Mengla 666303, China

4. Mathematics and Digital Science School, Chengyi College, Jimei University, Xiamen 361021, China

5. The School of Life Sciences, Northwestern Polytechnical University, Xi’an 710072, China

6. School of Electrical Engineering and Computing, University of Newcastle, Callaghan, NSW 2308, Australia

7. Departments of Geography and Environmental Management and Systems Design Engineering, University of Waterloo, Waterloo, ON N2L 3G1, Canada

Abstract

The convolutional neural networks (CNNs) functioning on geometric learning for the urban large-scale 3D meshes are indispensable because of their substantial, complex, and deformed shape constitutions. To address this issue, we proposed a novel Geometry-Aware Multi-Source Sparse-Attention CNN (GeoSparseNet) for the urban large-scale triangular mesh classification task. GeoSparseNet leverages the non-uniformity of 3D meshes to depict both broad flat areas and finely detailed features by adopting the multi-scale convolutional kernels. By operating on the mesh edges to prepare for subsequent convolutions, our method exploits the inherent geodesic connections by utilizing the Large Kernel Attention (LKA) based Pooling and Unpooling layers to maintain the shape topology for accurate classification predictions. Learning which edges in a mesh face to collapse, GeoSparseNet establishes a task-oriented process where the network highlights and enhances crucial features while eliminating unnecessary ones. Compared to previous methods, our innovative approach outperforms them significantly by directly processing extensive 3D mesh data, resulting in more discerning feature maps. We achieved an accuracy rate of 87.5% when testing on an urban large-scale model dataset of the Australian city of Adelaide.

Funder

China Postdoctoral Science Foundatio

National Natural Science Foundation of China

The Educational Project Foundation of Young and Middle-aged Teacher of Fujian Province of China

The 14th Five-Year Plan of the Xishuangbanna Tropical Botanical Garden, Chinese Academy of Sciences

FuXiaQuan National Independent Innovation Demonstration Zone Collaborative Innovation Platform

Publisher

MDPI AG

Reference57 articles.

1. SUM: A benchmark dataset of semantic urban meshes;Gao;ISPRS J. Photogramm. Remote Sens.,2021

2. Skyline-based geometric simplification for urban solar analysis;Besuievsky;Graph. Model.,2018

3. Deep learning-based semantic segmentation of urban-scale 3D meshes in remote sensing: A survey;Adam;Int. J. Appl. Earth Obs. Geoinf.,2023

4. Fast semantic segmentation of 3D point clouds with strongly varying density;Hackel;ISPRS J. Photogramm. Remote Sens.,2016

5. Thomas, H., Qi, C.R., Deschaud, J.E., Marcotegui, B., Goulette, F., and Guibas, L.J. (November, January 27). Kpconv: Flexible and deformable convolution for point clouds. Proceedings of the IEEE/CVF, Seoul, Republic of Korea.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3