Research on Image Classification and Retrieval Using Deep Learning with Attention Mechanism on Diaspora Chinese Architectural Heritage in Jiangmen, China
-
Published:2023-01-17
Issue:2
Volume:13
Page:275
-
ISSN:2075-5309
-
Container-title:Buildings
-
language:en
-
Short-container-title:Buildings
Author:
Gao Le, Wu Yanqing, Yang Tian, Zhang Xin, Zeng ZhiqiangORCID, Chan Chak Kwan Dickson, Chen Weihui
Abstract
The study of the architectural heritage of the Chinese diaspora has an important role and significance in China’s historical and cultural background in the preservation of cultural data, the restoration of images, and in the analysis of human social and ideological conditions. The images from the architectural heritage of the Chinese diaspora usually include frescos, decorative patterns, chandelier base patterns, various architectural styles and other major types of architecture. Images of the architectural heritage of the Chinese diaspora in Jiangmen City, Guangdong Province, China are the research object of this study. A total of 5073 images of diaspora Chinese buildings in 64 villages and 16 towns were collected. In view of the fact that different types of image vary greatly in features while there are only small differences among the features of the same type of image, this study uses the depth learning method to design the Convolutional Neural Network Attention Retrieval Framework (CNNAR Framework). This approach can be divided into two stages. In the first stage, the transfer learning method is used to classify the image in question by transferring the trained parameters of the Paris500K datasets image source network to the target network for training, and thus the classified image is obtained. The advantage of this method is that it narrows the retrieval range of the target image. In the second stage, the fusion attention mechanism is used to extract the features of the images that have been classified, and the distance between similar images of the same type is reduced by loss of contrast. When we retrieve images, we can use the features extracted in the second stage to measure the similarities among them and return the retrieval results. The results show that the classification accuracy of the proposed method reaches 98.3% in the heritage image datasets of the JMI Chinese diaspora architectures. The mean Average Precision (mAP) of the proposed algorithm can reach 76.6%, which is better than several mainstream model algorithms. At the same time, the image results retrieved by the algorithm in this paper are very similar to those of the query image. In addition, the CNNAR retrieval framework proposed in this paper achieves accuracies of 71.8% and 72.5% on the public data sets Paris500K and Corel5K, respectively, which can be greatly generalized and can, therefore, also be effectively applied to other topics datasets. The JMI architectural heritage image database constructed in this study, which is rich in cultural connotations of diaspora Chinese homeland life, can provide strong and reliable data support for the follow-up study of the zeitgeist of the culture reflected in architecture and the integration of Chinese and Western aesthetics. At the same time, through the rapid identification, classification, and retrieval of precious architectural images stored in the database, similar target images can be retrieved reasonably and accurately; then, accurate techniques can be provided to restore old and damaged products of an architectural heritage.
Funder
Wuyi University-Hong Kong-Macao Unite Research Funds Wuyi University Youth Team Funds Guangdong Province Philosophy and Social Science Planning Discipline Joint Project National I & E Program for College Student
Subject
Building and Construction,Civil and Structural Engineering,Architecture
Reference56 articles.
1. Caciora, T., Herman, G.V., Ilies, A., Baias, S., Ilies, D.C., Josan, I., and Hodor, N. (2021). The use of virtual reality to promote sustainable tourism: A case study of wooden churches historical monuments from Romania. Remote Sens., 13. 2. A review of building detecting from very high resolution optical remote sensing images;Li;Giscience Remote Sens.,2022 3. Cai, Y.M., Ding, Y.L., Zhang, H.W., Xiu, J.H., and Liu, Z.M. (2020). Geo-Location algorithm for building targets in oblique remote sensing images based on deep learning and height estimation. Remote Sens., 12. 4. Munawar, H.S., Aggarwal, R., Qadir, Z., Khan, S.I., Kouzani, A.Z., and Mahmud, M.A.P. (2021). A gabor filter-based protocol for automated image-based building detection. Buildings, 11. 5. Cao, D.G., Xing, H.F., Wong, M.S., Kwan, M.P., Xing, H.Q., and Meng, Y. (2021). A stacking ensemble deep learning model for building extraction from remote sensing images. Remote Sens., 13.
Cited by
3 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|