Abstract
With the widespread use of GPS equipment, a large amount of mobile location data is recorded, and urban hotspot areas extracted from GPS data can be applied to location-based services, such as tourist recommendations and point of interest positioning. It can also provide decision support for the analysis of population migration distribution and land use and planning. However, taxi GPS location data has a large amount of data and sparse points. How to avoid the influence of noise and efficiently detect hotspots in cities have become urgent problems to be solved. This paper proposes a clustering algorithm based on stay points and grid density. Firstly, a filtering pre-processing algorithm using stay points classification and stay points thresholds is proposed, so the influence of stop points is avoided. Then, the data space is divided into rectangular grid cells; each grid cell is determined to be a dense or non-dense grid according to the defined density threshold, and the cluster boundary points and noise points are judged in the non-dense grid cells to avoid normal sampling points being treated as noise. Finally, the associated dense grids are connected into clusters. The sampling points mapped to the grid cells are the elements in the clusters. Our method is more efficient than the DBSCAN algorithm because the grid cells are calculated. The superiority of the proposed algorithm in terms of clustering accuracy and time efficiency is verified in the real data set compared to traditional algorithms.
Subject
Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献