Affiliation:
1. Department of Civil and Environmental Engineering, University of Maryland, College Park, MD
Abstract
Mobile device location data (MDLD) have been popularly utilized in various fields. Yet large-scale applications are limited because of either biased or insufficient spatial coverage of the data from individual data vendors. One approach to improve the data coverage is to leverage the data from different data vendors and integrate them to build a more representative dataset. To extract reliable statistics from MDLD, certain data preprocessing steps are crucial to ensure the accuracy of the analysis. One of these steps is the development of a framework to remove duplicated devices or several devices that belong to the same data subject. This treatment is especially necessary when using a multiplicity of data sources, as the same device may be captured by more than one data provider. We propose a data integration methodology for multisourced data to investigate the feasibility of integrating data from several sources. By leveraging the uniqueness of travel pattern of each device, duplicate devices are identified. The proposed methodology is shown to be cost-effective through a national-level analysis. The method is successfully applied to a dataset from January 2020 consisting of more than 270 million raw devices nationwide. Our findings suggest that devices sharing the same imputed home location and the same top-five most-visited locations during a month can represent the same user in the MDLD. It is shown that more than 99.6% of the sample devices having the aforementioned attribute in common are observed at the same location simultaneously.
Subject
Mechanical Engineering,Civil and Structural Engineering
Reference21 articles.
1. Federal Highway Administration. 2017 National Household Travel Survey. U.S. Department of Transportation, Washington, D.C., 2017. https://nhts.ornl.gov.
2. Baltimore Metropolitan Council. Maryland statewide household travel survey. https://www.baltometro.org/transportation/data-maps/maryland-travel-survey.
3. Freight Traffic Analytics from National Truck GPS Data in Thailand
4. The promises of big data and small data for travel behavior (aka human mobility) analysis
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献