Affiliation:
1. College of Geography and Remote Sensing Sciences, Xinjiang University, Urumqi 830049, China
2. Xinjiang Key Laboratory of Oasis Ecology, Xinjiang University, Urumqi 830017, China
Abstract
The urban agglomeration on the north slope of the Tianshan Mountains is a pivotal place in Western China; it is essential for the economic growth of Xinjiang and acts as a critical bridge between China’s interior and the Asia–Europe continent. Due to unique natural conditions, the local population distribution exhibits distinct regional characteristics. This study employs the spatial lag model (SLM) from conventional spatial analysis and the random forest model (RFM) from contemporary machine learning techniques. It integrates traditional geographic data, including land cover data and nighttime light data, with geographical big data, such as POI (points of interest) and OSM (OpenStreetMap), to build a comprehensive indicator database. Subsequently, it simulates the spatial population distribution within the urban agglomeration on the northern slopes of the Tianshan Mountains in 2020. The accuracy of the results is then compared and assessed against the accuracy of other available population raster datasets, and the spatial distribution pattern in 2020 is analyzed. The findings reveal the following: (1) The result of SLM, combined with multi-source data, predicts the population distribution as a relatively uniform and nearly circular structure, with minimal spatial differentiation. (2) The result of RFM, employing multi-source data, better captures the spatial population distribution, resulting in irregular boundaries that are indicative of strong spatial heterogeneity. (3) Both models demonstrate superior accuracy in simulating population distribution. The spatial lag model’s accuracy surpasses that of the GHS and GPW datasets, albeit still trailing behind WorldPop and LandScan. Meanwhile, the random forest model significantly outperforms the four aforementioned population raster datasets. (4) The population spatial pattern in the urban agglomeration on the north slope of the Tianshan Mountains predominantly consists of four distinct circles, illustrating a “one axis, one center, and multiple focal points” distribution characteristic. Combining the random forest model with geographic big data for spatialized population simulation offers robust scientific validity and practicality. It holds potential for broader application within the urban agglomeration on the Tianshan Mountains and across Xinjiang. This study can offer insights for studies on regional population spatial distributions and inform sustainable development strategies for cities and their populations.
Funder
the Third Xinjiang Scientific Expedition Program of Ministry of Science and Technology of the People’s Republic of China
Reference59 articles.
1. Research on the suitability of population distribution at the county level in China;Feng;Acta Geogr. Sin.,2014
2. Spatial heterogeneity and its changes of population on the two sides of Hu Line;Li;Acta Geogr. Sin.,2017
3. Improved population mapping for China using remotely sensed and points-of-interest data within a random forests model;Ye;Sci. Total Environ.,2019
4. Research on Demographic Statistical Data Spatialization Based on Residential Area Classifying;Chen;Geospat. Inf.,2016
5. A review on the spatial disaggregation of socioeconomic statistical data;Guo;Acta Geogr. Sin.,2022