Abstract
The sustainable development of cities with cultural landscapes has attracted wide attention, as they are composite carriers of urban greening and cultural space. Suzhou, China, and Kyoto, Japan, are renowned for their cultural heritage. In addition to protecting the rich tangible cultural landscapes, it is also important to integrate visitors' subjective perceptions, which relate more to intangible heritage, into heritage protection and green sustainable development strategies. Due to the limitations in quantifying non-visual sensory elements, previous perceptual evaluations have mainly focused on visual elements. However, online reviews include multi-sensory perception descriptions. This study employs deep learning methods to process photos and text from online reviews to obtain landscape elements and sensory information from both cities. Significant differences were found in the perception of various landscape elements and multi-sensory descriptions between the two cities, and different senses affect overall perception to varying degrees. These findings and the application of new technologies facilitate the incorporation of multi-sensory public perceptions into the protection of green spaces with cultural significance.